Clc11G08940 (gene) Watermelon (cordophanus) v2

Overview
NameClc11G08940
Typegene
OrganismCitrullus lanatus subsp. cordophanus cv. cordophanus (Watermelon (cordophanus) v2)
DescriptionTransposable element protein
LocationClcChr11: 11030844 .. 11033129 (+)
RNA-Seq ExpressionClc11G08940
SyntenyClc11G08940
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCCTCCCTTACTAATGCTCTTTCTAAATTGATTGCAGGGGGCCAAGCTCAAGCAAGTCCACCATCCATAGCATCTCTTCCCACCTTGGCATCGGAGATGTCTACACAAAAGGACTTGGAACAAAGAGACAATGAGACGATCAATTATGTTGATCGAGGACACTATAGAGGCCACCAACAACAACTTCCAACTCATTACCATCCTAACTTGAGGAATCATGAAAACTTTTCTTATGCTAACAATAGAAATGTTTTGCAAGGTCCTCCCAGTTTTTCCAGTGTAGGTCCATCACACACCAAAAAGAAGTCCAACTTAAAAAACTTAATGGTGGAGTTCATAAAAGAGTCAAGAGGAAGGACCACGCATTGGAGCGCATGGTACAAAGTCATGGCAAGGCTATTCACAACATTGAGGTACAAATTAGCCAAATAGCCACTTCTCTTCAAACAATGCAAAAGGGTAAGTTTCCTAGTTGCCCCAAAAGGAATCCAAAGCAGGAGTGCAAGGTCGTGACTTTGAGGAGTGGGAAAAAGTTGTCCACTCCCTTGATTGATGATGAGGATGAAGAGCAAGAGGTAGATGAGACCATCCAAAAGCCTATCTTAGAAGATGAACCCAAGGCGGTCTTAGAAAAAGAGTAAAAGGTAAAGGAGTCGGCATATTCAAGTAAGTCATCTAACCCTCTACCTTTTGTACCTAATACCTTGTCCTATCCTCAATGTTTTAAAAAGAAAGAAATAGACTCCCAATTTGCTAAATTTTTAGAGTTATTTAAAAAGCTTCACATTAATATCCCATTTGCTGAGGCTTTAGAGCAAATGCCCAATTATGCTAAATTTATGAATGATGTCTTGTCTAAAAAGAAGAAGTTTGGGAAATATGAGATGATTAGCATGGCTGAAGAATGCAATGTTGTGCTACAAAAGAAGCTACCACAAAAATTGAAAGATCCAGGGAGTTTTACTATACCTTGCACTGTTGGTTTCTTTAAATGTTACTAGAGCTCTCTGTGATCTTGGTACTTGTATTAATTTAATGCCTTTATCTATTTATAGGAAGCTTGACATTGGAGAAGTGCAACCTATCACTATCACATTACAAATAGCCGATAGATCCTTAGCTTATCTTAAAGGTATTGTTGAGGATGTATTAGTTAAAGTTGACAAATTTATCTTTCCTATAGATTTTGTAGTTTTGGACATGGAGGAGGACTCTGAGGTTCCTATCATTCTTGGGCGCCCATTCCTAGTAATTAGGAAAGCTATCATAGATGGTGAATATGTTGTCTTTAATATTTATAAGTCCTTGAGTCACCATGATGAGGGTCGTACTTGCCATGCTATAGACATGATTGATCATACTATCTCTGAGCATGTTGTCAAATCATGTGATAGGTGCCAACGTACTGACAATATTTCTAGACAACATGAGCTTCCAATGAAACCTATCTTAGAAGTGGAGCTCTTTGATGTCTGGGGTATTGACTTTATGGGGCCTTTTCCTATGTCTTCTGATGGCTACCTATATATTCTAGTTGCAGTTGATTATGTATCTAAATGGGTAGAAGCCATGGCTACTAGGACCAATGATGCTCGCACTGTTTTAAAATTCTTGCATAAAAACATCTTCACACGTTTTGGTACACCTGGAGCTATTATTAGTGATGAGGGTTCTCACTTTTGCAATAAATTATTTGAATCCATGATGCAAAAATATAATGTTAATCATAAAATTGCTACAATTTATTATCCTGAAACTAATGGTCTTGCTGAGTTATCTAATAGGGAAATCAAGCAAGTTTTGGAAAAGACTGTCAAGATCAATAGGAAGGATTGGGCCCTAAAGCTTGATGATGCATTGTGAGCCCACTGTACATCTTTCAAAACCCCAATAGGTACTTCACCGTACAGGTTGGTGTTTGAAAAGGCTTGTCACTTACCCGTAGAGCTCGAGCATAGAGCTTATTGGGCTATCAAGAAGTTGAACATGGATTTTGAGAAGGCCGGTGAGAAGCGCCTCTTGGAACTCAATGAGATGGAGGAGTTTCGTGCTCAAGCTTATGAGAATGCCAAACTTTATAAGGAGTGCACTACCAGATGGCATGATAAGAAGATCAACTCACAGACCTTTCTTCTTGGACAAAGAGTATTACTTTTCAACTCACGTTTACGTTTGTTTCCAGGTAAGCTTAGGACACGATGGTCGGGACCCTTTGTCATTGTCAAGTGTCCCCACATGGAGTCATGGAATTGCAAAGCGACGATGGGACAATCTTCAAAGTAA

mRNA sequence

ATGGCCTCCCTTACTAATGCTCTTTCTAAATTGATTGCAGGGGGCCAAGCTCAAGCAAGTCCACCATCCATAGCATCTCTTCCCACCTTGGCATCGGAGATGTCTACACAAAAGGACTTGGAACAAAGAGACAATGAGACGATCAATTATGTTGATCGAGGACACTATAGAGGCCACCAACAACAACTTCCAACTCATTACCATCCTAACTTGAGGAATCATGAAAACTTTTCTTATGCTAACAATAGAAATGTTTTGCAAGTTCATAAAAGAGTCAAGAGGAAGGACCACGCATTGGAGCGCATGGTACAAAGTCATGGCAAGGCTATTCACAACATTGAGGTACAAATTAGCCAAATAGCCACTTCTCTTCAAACAATGCAAAAGGGTAAGTTTCCTAGTTGCCCCAAAAGGAATCCAAAGCAGGAGTGCAAGGTCGTGACTTTGAGGAGTGGGAAAAAGTTGTCCACTCCCTTGATTGATGATGAGGATGAAGAGCAAGAGGTAGATGAGACCATCCAAAAGCCTATCTTAGAAGATGAACCCAAGGCGGTCTTAGAAAAAGAGAAGCTTGACATTGGAGAAGTGCAACCTATCACTATCACATTACAAATAGCCGATAGATCCTTAGCTTATCTTAAAGGTATTGTTGAGGATGTATTAGTTAAAGTTGACAAATTTATCTTTCCTATAGATTTTGTAGTTTTGGACATGGAGGAGGACTCTGAGGTTCCTATCATTCTTGGGCGCCCATTCCTAGTAATTAGGAAAGCTATCATAGATGGTGAATATGTTGTCTTTAATATTTATAAGTCCTTGAGTCACCATGATGAGGGTCGTACTTGCCATGCTATAGACATGATTGATCATACTATCTCTGAGCATGTTGTCAAATCATGTGATAGGTGCCAACGTACTGACAATATTTCTAGACAACATGAGCTTCCAATGAAACCTATCTTAGAAGTGGAGCTCTTTGATGTCTGGGGTATTGACTTTATGGGGCCTTTTCCTATGTCTTCTGATGGCTACCTATATATTCTAGTTGCAGTTGATTATGTATCTAAATGGGTAGAAGCCATGGCTACTAGGACCAATGATGCTCGCACTGTTTTAAAATTCTTGCATAAAAACATCTTCACACGTTTTGGTACACCTGGAGCTATTATTAGTGATGAGGGTTCTCACTTTTGCAATAAATTATTTGAATCCATGATGCAAAAATATAATGTTAATCATAAAATTGCTACAATTTATTATCCTGAAACTAATGGTCTTGCTGAGTTATCTAATAGGGAAATCAAGCAAGTTTTGGAAAAGACTGTCAAGATCAATAGGAAGGATTGGGCCCTAAAGCTTGATGATGCATTGTTGGTGTTTGAAAAGGCTTGTCACTTACCCGTAGAGCTCGAGCATAGAGCTTATTGGGCTATCAAGAAGTTGAACATGGATTTTGAGAAGGCCGGTGAGAAGCGCCTCTTGGAACTCAATGAGATGGAGGAGTTTCGTGCTCAAGCTTATGAGAATGCCAAACTTTATAAGGAGTGCACTACCAGATGGCATGATAAGAAGATCAACTCACAGACCTTTCTTCTTGGACAAAGAGTATTACTTTTCAACTCACGTTTACGTTTGTTTCCAGGTAAGCTTAGGACACGATGGTCGGGACCCTTTGTCATTGTCAAGTGTCCCCACATGGAGTCATGGAATTGCAAAGCGACGATGGGACAATCTTCAAAGTAA

Coding sequence (CDS)

ATGGCCTCCCTTACTAATGCTCTTTCTAAATTGATTGCAGGGGGCCAAGCTCAAGCAAGTCCACCATCCATAGCATCTCTTCCCACCTTGGCATCGGAGATGTCTACACAAAAGGACTTGGAACAAAGAGACAATGAGACGATCAATTATGTTGATCGAGGACACTATAGAGGCCACCAACAACAACTTCCAACTCATTACCATCCTAACTTGAGGAATCATGAAAACTTTTCTTATGCTAACAATAGAAATGTTTTGCAAGTTCATAAAAGAGTCAAGAGGAAGGACCACGCATTGGAGCGCATGGTACAAAGTCATGGCAAGGCTATTCACAACATTGAGGTACAAATTAGCCAAATAGCCACTTCTCTTCAAACAATGCAAAAGGGTAAGTTTCCTAGTTGCCCCAAAAGGAATCCAAAGCAGGAGTGCAAGGTCGTGACTTTGAGGAGTGGGAAAAAGTTGTCCACTCCCTTGATTGATGATGAGGATGAAGAGCAAGAGGTAGATGAGACCATCCAAAAGCCTATCTTAGAAGATGAACCCAAGGCGGTCTTAGAAAAAGAGAAGCTTGACATTGGAGAAGTGCAACCTATCACTATCACATTACAAATAGCCGATAGATCCTTAGCTTATCTTAAAGGTATTGTTGAGGATGTATTAGTTAAAGTTGACAAATTTATCTTTCCTATAGATTTTGTAGTTTTGGACATGGAGGAGGACTCTGAGGTTCCTATCATTCTTGGGCGCCCATTCCTAGTAATTAGGAAAGCTATCATAGATGGTGAATATGTTGTCTTTAATATTTATAAGTCCTTGAGTCACCATGATGAGGGTCGTACTTGCCATGCTATAGACATGATTGATCATACTATCTCTGAGCATGTTGTCAAATCATGTGATAGGTGCCAACGTACTGACAATATTTCTAGACAACATGAGCTTCCAATGAAACCTATCTTAGAAGTGGAGCTCTTTGATGTCTGGGGTATTGACTTTATGGGGCCTTTTCCTATGTCTTCTGATGGCTACCTATATATTCTAGTTGCAGTTGATTATGTATCTAAATGGGTAGAAGCCATGGCTACTAGGACCAATGATGCTCGCACTGTTTTAAAATTCTTGCATAAAAACATCTTCACACGTTTTGGTACACCTGGAGCTATTATTAGTGATGAGGGTTCTCACTTTTGCAATAAATTATTTGAATCCATGATGCAAAAATATAATGTTAATCATAAAATTGCTACAATTTATTATCCTGAAACTAATGGTCTTGCTGAGTTATCTAATAGGGAAATCAAGCAAGTTTTGGAAAAGACTGTCAAGATCAATAGGAAGGATTGGGCCCTAAAGCTTGATGATGCATTGTTGGTGTTTGAAAAGGCTTGTCACTTACCCGTAGAGCTCGAGCATAGAGCTTATTGGGCTATCAAGAAGTTGAACATGGATTTTGAGAAGGCCGGTGAGAAGCGCCTCTTGGAACTCAATGAGATGGAGGAGTTTCGTGCTCAAGCTTATGAGAATGCCAAACTTTATAAGGAGTGCACTACCAGATGGCATGATAAGAAGATCAACTCACAGACCTTTCTTCTTGGACAAAGAGTATTACTTTTCAACTCACGTTTACGTTTGTTTCCAGGTAAGCTTAGGACACGATGGTCGGGACCCTTTGTCATTGTCAAGTGTCCCCACATGGAGTCATGGAATTGCAAAGCGACGATGGGACAATCTTCAAAGTAA

Protein sequence

MASLTNALSKLIAGGQAQASPPSIASLPTLASEMSTQKDLEQRDNETINYVDRGHYRGHQQQLPTHYHPNLRNHENFSYANNRNVLQVHKRVKRKDHALERMVQSHGKAIHNIEVQISQIATSLQTMQKGKFPSCPKRNPKQECKVVTLRSGKKLSTPLIDDEDEEQEVDETIQKPILEDEPKAVLEKEKLDIGEVQPITITLQIADRSLAYLKGIVEDVLVKVDKFIFPIDFVVLDMEEDSEVPIILGRPFLVIRKAIIDGEYVVFNIYKSLSHHDEGRTCHAIDMIDHTISEHVVKSCDRCQRTDNISRQHELPMKPILEVELFDVWGIDFMGPFPMSSDGYLYILVAVDYVSKWVEAMATRTNDARTVLKFLHKNIFTRFGTPGAIISDEGSHFCNKLFESMMQKYNVNHKIATIYYPETNGLAELSNREIKQVLEKTVKINRKDWALKLDDALLVFEKACHLPVELEHRAYWAIKKLNMDFEKAGEKRLLELNEMEEFRAQAYENAKLYKECTTRWHDKKINSQTFLLGQRVLLFNSRLRLFPGKLRTRWSGPFVIVKCPHMESWNCKATMGQSSK
Homology
BLAST of Clc11G08940 vs. NCBI nr
Match: XP_023874613.1 (uncharacterized protein LOC111987139 [Quercus suber])

HSP 1 Score: 411.4 bits (1056), Expect = 1.3e-110
Identity = 199/284 (70.07%), Postives = 232/284 (81.69%), Query Frame = 0

Query: 296  VVKSCDRCQRTDNISRQHELPMKPILEVELFDVWGIDFMGPFPMSSDGYLYILVAVDYVS 355
            +VK+CDRCQR  NISR+ ELP+K ILEVELFDVWGIDFMGPFP  S G++YIL+AVDYVS
Sbjct: 1429 LVKTCDRCQRMGNISRRQELPLKNILEVELFDVWGIDFMGPFP-PSFGFVYILLAVDYVS 1488

Query: 356  KWVEAMATRTNDARTVLKFLHKNIFTRFGTPGAIISDEGSHFCNKLFESMMQKYNVNHKI 415
            KWVEA+AT TNDA+ VLKFLHKNIFTRFGTP AIISDEG+HFCNKLF++++ KY V HKI
Sbjct: 1489 KWVEAIATTTNDAKVVLKFLHKNIFTRFGTPRAIISDEGTHFCNKLFDNLLSKYGVKHKI 1548

Query: 416  ATIYYPETNGLAELSNREIKQVLEKTVKINRKDWALKLDDAL-----------------L 475
            A  Y+P+TNG AE+SNREIK +LEKTV  NRKDWA KLDDAL                 L
Sbjct: 1549 ALAYHPQTNGQAEISNREIKNILEKTVNTNRKDWAKKLDDALWAYRTAFKTPIGMSPYRL 1608

Query: 476  VFEKACHLPVELEHRAYWAIKKLNMDFEKAGEKRLLELNEMEEFRAQAYENAKLYKECTT 535
            VF KACHLPVELEH+AYWA+KK N+D + AGEKRLL+LNEM+EFR  AYENAK+YKE T 
Sbjct: 1609 VFGKACHLPVELEHKAYWAVKKFNLDLKAAGEKRLLQLNEMDEFRNDAYENAKIYKERTK 1668

Query: 536  RWHDKKINSQTFLLGQRVLLFNSRLRLFPGKLRTRWSGPFVIVK 563
            +WHDK+I  + F  GQ+VLLFNSRL+LFPGKLR+RW+GP+ I K
Sbjct: 1669 KWHDKQILRREFAPGQQVLLFNSRLKLFPGKLRSRWTGPYTIDK 1711

BLAST of Clc11G08940 vs. NCBI nr
Match: XP_023874613.1 (uncharacterized protein LOC111987139 [Quercus suber])

HSP 1 Score: 150.2 bits (378), Expect = 5.4e-32
Identity = 130/470 (27.66%), Postives = 197/470 (41.91%), Query Frame = 0

Query: 7   ALSKLIAGGQAQASPPSIASLPTLASEMSTQK---DLEQRDNETINYVDRGHYRGHQQQL 66
           ALS  +A    Q S  +   +P  A  ++       + +   E + Y++  +Y      +
Sbjct: 239 ALSAQVASLSHQVSALTTQRIPQGAEYVAASSMTVPMNEASQEQVQYINNRNYNYRGNPM 298

Query: 67  PTHYHPNLRNHENFSYANNRNVLQ--------------------------VHKRVKRKDH 126
           P +YHP LRNHENFSY N +NVLQ                               K+ D 
Sbjct: 299 PNYYHPGLRNHENFSYGNTKNVLQPPPGFDSQPSEKKMSLEDAMVSFVEETKATFKKSDS 358

Query: 127 ALERMVQSH----GKAIHNIEVQISQIATSLQTMQKGKFPSCPKRNPKQECKVVTLRSGK 186
            L+  +++H    G  + N+EVQI Q+AT++   Q+G FPS  + NPK++CK +TLRSG+
Sbjct: 359 QLDN-IETHCSNMGATMKNLEVQIGQLATTINAQQRGTFPSNTEVNPKEQCKAITLRSGR 418

Query: 187 KL---------STPLIDDE-------DEEQEVDETIQK-------------PI------- 246
           ++         +TP   +        +EE+ V++T+++             PI       
Sbjct: 419 EIERSPSKETETTPTAPNNGQSKNKVEEEEIVEDTLRETDMPPSISFPDNPPILSTPLPY 478

Query: 247 ------------------------------------------------------------ 295
                                                                       
Sbjct: 479 PQRFQKQKLDKQFSKFLDIFKKIHINIPFADALEQMPNYAKFLKDIISKKRRLEEFETVK 538


HSP 2 Score: 406.0 bits (1042), Expect = 5.5e-109
Identity = 204/310 (65.81%), Postives = 234/310 (75.48%), Query Frame = 0

Query: 275  HHDEGRTCHAI---DMIDHTI---SEHVVKSCDRCQRTDNISRQHELPMKPILEVELFDV 334
            HH E RT   +        T+   S   VK CDRCQRT N+S + ++P+  + EVELFDV
Sbjct: 1190 HHGESRTAAKVLQSGFFWPTLFRDSYEFVKRCDRCQRTGNLSNKSQMPLNNMQEVELFDV 1249

Query: 335  WGIDFMGPFPMSSDGYLYILVAVDYVSKWVEAMATRTNDARTVLKFLHKNIFTRFGTPGA 394
            WGIDFMGPFP SS+G LYIL+AVDYVSKWVEA+AT  NDARTVLKF HKNIF+RFGTP A
Sbjct: 1250 WGIDFMGPFP-SSNGKLYILLAVDYVSKWVEAIATTANDARTVLKFFHKNIFSRFGTPRA 1309

Query: 395  IISDEGSHFCNKLFESMMQKYNVNHKIATIYYPETNGLAELSNREIKQVLEKTVKINRKD 454
            IISDEGSHFCNKLF ++  K  + HKIA  Y+P+TNGLAELSNREIKQ+LEKTV  NRKD
Sbjct: 1310 IISDEGSHFCNKLFANLTNKLGIRHKIALAYHPQTNGLAELSNREIKQILEKTVSTNRKD 1369

Query: 455  WALKLDDAL-----------------LVFEKACHLPVELEHRAYWAIKKLNMDFEKAGEK 514
            WALKLDDAL                 LVF KACHLPVELEHRAYWA+KKLN D    G +
Sbjct: 1370 WALKLDDALWAFRTAFKTPIGMSPYKLVFGKACHLPVELEHRAYWAVKKLNFDQTATGNR 1429

Query: 515  RLLELNEMEEFRAQAYENAKLYKECTTRWHDKKINSQTFLLGQRVLLFNSRLRLFPGKLR 562
            RLL+LNEMEEFR  AYENAK+YKE T +WHDK+I  + F  G +VLLFNSRLRLFPGKL+
Sbjct: 1430 RLLQLNEMEEFRNDAYENAKIYKEKTKKWHDKRITKREFRAGDQVLLFNSRLRLFPGKLK 1489

BLAST of Clc11G08940 vs. NCBI nr
Match: XP_012831341.1 (PREDICTED: uncharacterized protein LOC105952343 [Erythranthe guttata])

HSP 1 Score: 140.6 bits (353), Expect = 4.3e-29
Identity = 140/485 (28.87%), Postives = 193/485 (39.79%), Query Frame = 0

Query: 1   MASLTNALSKLIAGGQAQASPPSIASLPTLASEMSTQKDLEQRD--NETINYVDRGHYRG 60
           +A+L+N ++++   G      P    +   ++  +T  D EQ    N   N     ++RG
Sbjct: 267 LATLSNQVAQISVRG------PQTERVAAASTSQATNDDWEQAHFMNHRFN-----NFRG 326

Query: 61  --HQQQLPTHYHPNLRNHENFSYANNRNVLQV-----HKRVKR------KDHALERMVQS 120
             +Q Q PTHYHP +RNHENFSYAN +N LQ      H+R +R      + H  E+ ++ 
Sbjct: 327 THNQNQNPTHYHPGIRNHENFSYANPKNALQPPPDFNHQREQRGPTYDERLHRQEQEMEG 386

Query: 121 HGKAIHNIEVQISQIATSLQTMQKGKFPSCPKRNPKQECKVVTLRSGKKLS-TPLIDDED 180
               + N+E QI QIA S+ TM KG FPS  + NPK+ C+ +T RSG +++  P   DE 
Sbjct: 387 LKSTMKNMEKQIGQIAQSMSTMAKGGFPSNTEVNPKESCQAITTRSGLQMTDPPYPTDES 446

Query: 181 EEQEVDETIQKP------------------ILEDEP------------------------ 240
               +  T  +P                     D P                        
Sbjct: 447 PRPAIQPTPVEPEITISGSGTKEASKPNNIFFPDNPPLLITPIPFPERQKKKKFQNQLKK 506

Query: 241 ----------------------------KAVLEKE------------------------- 300
                                       K VL K+                         
Sbjct: 507 FIEKIKQIRINIPFAEALEVMPNYTKFMKEVLSKKIRIEEDIPVTLTATCSAILQSNLPP 566

Query: 301 ---------------------------------------KLDIGEVQPITITLQIADRSL 321
                                                  KL +G +    +TLQ+ADRSL
Sbjct: 567 KMKDPGSYTIPCIIGNSTFDKALCDLGASINLMPMSVFLKLGLGNLNRTRMTLQLADRSL 626


HSP 2 Score: 402.1 bits (1032), Expect = 8.0e-108
Identity = 231/431 (53.60%), Postives = 284/431 (65.89%), Query Frame = 0

Query: 161  DDEDEEQEVDETIQKPILEDEPKAVLEKEKLDIGEVQPI-TITLQIADRSLAYLKGIVED 220
            D +  E  V + + + ILE+ P     +E     ++  I T T   AD +     GI+ D
Sbjct: 1654 DKKGSENVVADHLSRLILEEVPAEGNIQESFPDEQLLAISTHTPWYADVANFLASGIIPD 1713

Query: 221  VLV--KVDKFIFPIDFVVLDMEEDSEVPIILGRPFLVIRKAIIDGEY--VVFNIYKSL-- 280
             L   +  KF+    F + D     E  +    P  VIR+ + + E   ++ + + S   
Sbjct: 1714 DLSYHQKKKFLHDSRFYLWD-----EPLLFRTGPDRVIRRCVPETEVREILTHCHSSPCG 1773

Query: 281  SHHDEGRTCHAI---DMIDHTI---SEHVVKSCDRCQRTDNISRQHELPMKPILEVELFD 340
             HH E RT   +        T+   S   VK CDRCQRT N+S + ++P+  + EVELFD
Sbjct: 1774 GHHGESRTAAKVLQSGFFWPTLFRDSYEFVKRCDRCQRTGNLSNKSQMPLNNMQEVELFD 1833

Query: 341  VWGIDFMGPFPMSSDGYLYILVAVDYVSKWVEAMATRTNDARTVLKFLHKNIFTRFGTPG 400
            VWGIDFMGPFP SS+G LYIL+AVDYVSKWVEA+AT TNDARTVLKF HKNIF+RFGTP 
Sbjct: 1834 VWGIDFMGPFP-SSNGKLYILLAVDYVSKWVEAIATTTNDARTVLKFFHKNIFSRFGTPR 1893

Query: 401  AIISDEGSHFCNKLFESMMQKYNVNHKIATIYYPETNGLAELSNREIKQVLEKTVKINRK 460
            AIISDEGSHFCNKL  ++  K  + HKIA  Y+P+TNGLAELSNREIKQ+LEKTV  NRK
Sbjct: 1894 AIISDEGSHFCNKLLTNLTNKLGIRHKIALAYHPQTNGLAELSNREIKQILEKTVSTNRK 1953

Query: 461  DWALKLDDAL-----------------LVFEKACHLPVELEHRAYWAIKKLNMDFEKAGE 520
            DWALKLDDAL                 LV+ KACHLPVELEHRAYWA+KKLN D    G+
Sbjct: 1954 DWALKLDDALWAYRTAFKTPIGMSPYKLVYGKACHLPVELEHRAYWAVKKLNFDQTATGD 2013

Query: 521  KRLLELNEMEEFRAQAYENAKLYKECTTRWHDKKINSQTFLLGQRVLLFNSRLRLFPGKL 562
            +RLL+LNEMEEFR  AYENAK+YKE T +WHDK+I  + F  G +VLLFNSRLRLFPGKL
Sbjct: 2014 RRLLQLNEMEEFRNDAYENAKIYKEKTKKWHDKRITKREFRAGDQVLLFNSRLRLFPGKL 2073

BLAST of Clc11G08940 vs. NCBI nr
Match: XP_012833448.1 (PREDICTED: uncharacterized protein LOC105954320 [Erythranthe guttata])

HSP 1 Score: 86.7 bits (213), Expect = 7.4e-13
Identity = 68/192 (35.42%), Postives = 100/192 (52.08%), Query Frame = 0

Query: 1   MASLTNALSKLIAGGQAQASPPSIASLPTLASEMSTQKDLEQRD--NETINYVDRGHYRG 60
           +A L+N ++++   G      P    +   ++  +T  D EQ    N   N     ++RG
Sbjct: 650 LAILSNQVAQISVRG------PQTERVAAASTSQATNDDWEQAHFMNHRFN-----NFRG 709

Query: 61  --HQQQLPTHYHPNLRNHENFSYANNRNVLQV-----HKRVKR------KDHALERMVQS 120
             +Q Q PTHYHP +RNHENFSYAN +N LQ      H+R +R      + H  E+ ++ 
Sbjct: 710 TNNQNQNPTHYHPGIRNHENFSYANPKNALQPPPDFNHQREQRGPTYDERLHRQEQEMEG 769

Query: 121 HGKAIHNIEVQISQIATSLQTMQKGKFPSCPKRNPKQECKVVTLRSGKKLS-TPLIDDED 177
               + N+E QI QIA S+ TM KG FPS  + NPK+ C+ +T RSG +++  P   DE 
Sbjct: 770 LKSTMKNMEKQIGQIAQSMSTMAKGGFPSNTEVNPKESCQAITTRSGLQMTDPPYPTDEP 829


HSP 2 Score: 402.1 bits (1032), Expect = 8.0e-108
Identity = 231/431 (53.60%), Postives = 284/431 (65.89%), Query Frame = 0

Query: 161  DDEDEEQEVDETIQKPILEDEPKAVLEKEKLDIGEVQPI-TITLQIADRSLAYLKGIVED 220
            D +  E  V + + + ILE+ P     +E     ++  I T T   AD +     GI+ D
Sbjct: 1271 DKKGSENVVADHLSRLILEEVPAEGNIQESFPDEQLLAISTHTPWYADVANFLASGIIPD 1330

Query: 221  VLV--KVDKFIFPIDFVVLDMEEDSEVPIILGRPFLVIRKAIIDGEY--VVFNIYKSL-- 280
             L   +  KF+    F + D     E  +    P  VIR+ + + E   ++ + + S   
Sbjct: 1331 DLSYHQKKKFLHDSRFYLWD-----EPLLFRTGPDRVIRRCVPETEVREILTHCHSSPCG 1390

Query: 281  SHHDEGRTCHAI---DMIDHTI---SEHVVKSCDRCQRTDNISRQHELPMKPILEVELFD 340
             HH E RT   +        T+   S   VK CDRCQRT N+S + ++P+  + EVELFD
Sbjct: 1391 GHHGESRTAAKVLQSGFFWPTLFRDSYEFVKRCDRCQRTGNLSNKSQMPLNNMQEVELFD 1450

Query: 341  VWGIDFMGPFPMSSDGYLYILVAVDYVSKWVEAMATRTNDARTVLKFLHKNIFTRFGTPG 400
            VWGIDFMGPFP SS+G LYIL+AVDYVSKWVEA+AT TNDARTVLKF HKNIF+RFGTP 
Sbjct: 1451 VWGIDFMGPFP-SSNGKLYILLAVDYVSKWVEAIATTTNDARTVLKFFHKNIFSRFGTPR 1510

Query: 401  AIISDEGSHFCNKLFESMMQKYNVNHKIATIYYPETNGLAELSNREIKQVLEKTVKINRK 460
            AIISDEGSHFCNKL  ++  K  + HKIA  Y+P+TNGLAELSNREIKQ+LEKTV  NRK
Sbjct: 1511 AIISDEGSHFCNKLLTNLTNKLGIRHKIALAYHPQTNGLAELSNREIKQILEKTVSTNRK 1570

Query: 461  DWALKLDDAL-----------------LVFEKACHLPVELEHRAYWAIKKLNMDFEKAGE 520
            DWALKLDDAL                 LV+ KACHLPVELEHRAYWA+KKLN D    G+
Sbjct: 1571 DWALKLDDALWAYRTAFKTPIGMSPYKLVYGKACHLPVELEHRAYWAVKKLNFDQTATGD 1630

Query: 521  KRLLELNEMEEFRAQAYENAKLYKECTTRWHDKKINSQTFLLGQRVLLFNSRLRLFPGKL 562
            +RLL+LNEMEEFR  AYENAK+YKE T +WHDK+I  + F  G +VLLFNSRLRLFPGKL
Sbjct: 1631 RRLLQLNEMEEFRNDAYENAKIYKEKTKKWHDKRITKREFRAGDQVLLFNSRLRLFPGKL 1690

BLAST of Clc11G08940 vs. NCBI nr
Match: XP_012833687.1 (PREDICTED: uncharacterized protein LOC105954563 [Erythranthe guttata] >XP_012857704.1 PREDICTED: uncharacterized protein LOC105976985 [Erythranthe guttata])

HSP 1 Score: 87.0 bits (214), Expect = 5.7e-13
Identity = 68/192 (35.42%), Postives = 100/192 (52.08%), Query Frame = 0

Query: 1   MASLTNALSKLIAGGQAQASPPSIASLPTLASEMSTQKDLEQRD--NETINYVDRGHYRG 60
           +A L+N ++++   G      P    +   ++  +T  D EQ    N   N     ++RG
Sbjct: 267 LAILSNQVAQISVRG------PQTERVAAASTSQATNDDWEQAHFMNHRFN-----NFRG 326

Query: 61  --HQQQLPTHYHPNLRNHENFSYANNRNVLQV-----HKRVKR------KDHALERMVQS 120
             +Q Q PTHYHP +RNHENFSYAN +N LQ      H+R +R      + H  E+ ++ 
Sbjct: 327 TNNQNQNPTHYHPGIRNHENFSYANPKNALQPPPDFNHQREQRGPTYDERLHRQEQEMEG 386

Query: 121 HGKAIHNIEVQISQIATSLQTMQKGKFPSCPKRNPKQECKVVTLRSGKKLS-TPLIDDED 177
               + N+E QI QIA S+ TM KG FPS  + NPK+ C+ +T RSG +++  P   DE 
Sbjct: 387 LKSTMKNMEKQIGQIAQSMSTMAKGGFPSNTEVNPKESCQAITTRSGLQMTDPPYPTDES 446


HSP 2 Score: 402.1 bits (1032), Expect = 8.0e-108
Identity = 229/431 (53.13%), Postives = 283/431 (65.66%), Query Frame = 0

Query: 161  DDEDEEQEVDETIQKPILEDEPKAVLEKEKLDIGEVQPITI-TLQIADRSLAYLKGIVED 220
            D +  E  V + + + ILE+ P     +E     ++  I+  T   AD +     GI+ D
Sbjct: 1413 DKKGSENVVADHLSRLILEEVPAEGNIQESFPDEQLLAISAHTPWYADVANFLASGIIPD 1472

Query: 221  VLV--KVDKFIFPIDFVVLDMEEDSEVPIILGRPFLVIRKAIIDGEY--VVFNIYKSL-- 280
             L   +  KF+    F + D     E  +    P  VIR+ + + E   ++ + + S   
Sbjct: 1473 DLSYHQKKKFLHDSRFYLWD-----EPLLFRTGPDRVIRRCVPETEVREILTHCHSSPCG 1532

Query: 281  SHHDEGRTCHAIDMID------HTISEHVVKSCDRCQRTDNISRQHELPMKPILEVELFD 340
             HH E RT   +  +          S   VK CDRCQRT N+S + ++P+  + EVELFD
Sbjct: 1533 GHHGESRTAAKVLQLGFFWPTLFRDSYEFVKRCDRCQRTGNLSNKSQMPLNNMQEVELFD 1592

Query: 341  VWGIDFMGPFPMSSDGYLYILVAVDYVSKWVEAMATRTNDARTVLKFLHKNIFTRFGTPG 400
            VWGIDFMGPFP SS+G LYIL+AVDYVSKWVEA+AT  NDARTVLKF HKNIF+RFGTP 
Sbjct: 1593 VWGIDFMGPFP-SSNGKLYILLAVDYVSKWVEAIATTANDARTVLKFFHKNIFSRFGTPR 1652

Query: 401  AIISDEGSHFCNKLFESMMQKYNVNHKIATIYYPETNGLAELSNREIKQVLEKTVKINRK 460
            AIISDEGSHFCNKL  ++  K  + HKIA  Y+P+TNGLAELSNREIKQ+LEKTV  NRK
Sbjct: 1653 AIISDEGSHFCNKLLTNLTNKLGIRHKIALAYHPQTNGLAELSNREIKQILEKTVSTNRK 1712

Query: 461  DWALKLDDAL-----------------LVFEKACHLPVELEHRAYWAIKKLNMDFEKAGE 520
            DWALKLDDAL                 LV+ KACHLPVELEHRAYWA+KKLN D   AG+
Sbjct: 1713 DWALKLDDALWAYRTAFKTPIGMSPYKLVYGKACHLPVELEHRAYWAVKKLNFDQTAAGD 1772

Query: 521  KRLLELNEMEEFRAQAYENAKLYKECTTRWHDKKINSQTFLLGQRVLLFNSRLRLFPGKL 562
            +RLL+LNEMEEFR  AYENAK+YKE T +WHDK+I  + F  G +VLLFNSRLRLFPGKL
Sbjct: 1773 RRLLQLNEMEEFRNDAYENAKIYKEKTKKWHDKRITKREFRAGDQVLLFNSRLRLFPGKL 1832

BLAST of Clc11G08940 vs. ExPASy Swiss-Prot
Match: P10272 (Gag-Pol polyprotein OS=Baboon endogenous virus (strain M7) OX=11764 GN=pol PE=3 SV=2)

HSP 1 Score: 95.5 bits (236), Expect = 2.1e-18
Identity = 83/279 (29.75%), Postives = 125/279 (44.80%), Query Frame = 0

Query: 291  TISEHVVKSCDRCQRTDNISRQHELPMKPILEVELFDVWGIDFMGPFPMSSDGYLYILVA 350
            T+ E V  +C  CQ+  N         K          W IDF    P  + GY Y+LV 
Sbjct: 1409 TLIEQVTSACKVCQQV-NAGATRVPAGKRTRGNRPGVYWEIDFTEVKPHYA-GYKYLLVF 1468

Query: 351  VDYVSKWVEAMATRTNDARTVLKFLHKNIFTRFGTPGAIISDEGSHFCNKLFESMMQKYN 410
            VD  S WVEA  TR   A  V K + + IF RFG P  I SD G  F +++ + + +   
Sbjct: 1469 VDTFSGWVEAFPTRQETAHIVAKKILEEIFPRFGLPKVIGSDNGPAFVSQVSQGLARILG 1528

Query: 411  VNHKIATIYYPETNGLAELSNREIKQVLEK-TVKINRKDWALKLDDALLVFEKACH---- 470
            +N K+   Y P+++G  E  NR IK+ L K T++   KDW   L  ALL      +    
Sbjct: 1529 INWKLHCAYRPQSSGQVERMNRTIKETLTKLTLETGLKDWRRLLSLALLRARNTPNRFGL 1588

Query: 471  LPVELEHRAYWAIKKLNMDFEKAGEKRLLE--LNEMEEFRAQAYEN-AKLYKECTTRWHD 530
             P E+ +     +  L   F  +  K  L+  L  ++  +AQ +   A+LY+   ++   
Sbjct: 1589 TPYEILYGGPPPLSTLLNSFSPSNSKTDLQARLKGLQAVQAQIWAPLAELYRPGHSQ--- 1648

Query: 531  KKINSQTFLLGQRVLLFNSRLRLFPGKLRTRWSGPFVIV 562
                S  F +G  V +   R +     L  RW GP++++
Sbjct: 1649 ---TSHPFQVGDSVYVRRHRSQ----GLEPRWKGPYIVL 1675

BLAST of Clc11G08940 vs. ExPASy Swiss-Prot
Match: P31792 (Pol polyprotein (Fragment) OS=Feline endogenous virus ECE1 OX=11766 GN=pol PE=3 SV=1)

HSP 1 Score: 92.4 bits (228), Expect = 1.8e-17
Identity = 84/279 (30.11%), Postives = 125/279 (44.80%), Query Frame = 0

Query: 291 TISEHVVKSCDRCQRTDNISRQHELPMKPILEVELFDVWGIDFMGPFPMSSDGYLYILVA 350
           T+ E V  +C  CQ+  N         K          W IDF    P  + GY Y+LV 
Sbjct: 728 TLIEQVTSACKVCQQV-NAGATRVPEGKRTRGNRPGVYWEIDFTEVKPHYA-GYKYLLVF 787

Query: 351 VDYVSKWVEAMATRTNDARTVLKFLHKNIFTRFGTPGAIISDEGSHFCNKLFESMMQKYN 410
           VD  S WVEA  TR   A  V K + + IF RFG P  I SD G  F +++ + + +   
Sbjct: 788 VDTFSGWVEAYPTRQETAHMVAKKILEEIFPRFGLPKVIGSDNGPAFVSQVSQGLARTLG 847

Query: 411 VNHKIATIYYPETNGLAELSNREIKQVLEK-TVKINRKDWALKLDDALLVFEKACH---- 470
           +N K+   Y P+++G  E  NR IK+ L K T++   KDW   L  ALL      +    
Sbjct: 848 INWKLHCAYRPQSSGQVERMNRTIKETLTKLTLETGLKDWRRLLSLALLRARNTPNRFGL 907

Query: 471 LPVELEHRAYWAIKKLNMDFEKAGEKRLLE--LNEMEEFRAQAYEN-AKLYKECTTRWHD 530
            P E+ +     +  L   F  +  K  L+  L  ++  +AQ +   A+LY+      H 
Sbjct: 908 TPYEILYGGPPPLSTLLNSFSPSDPKTDLQARLKGLQAVQAQIWTPLAELYRP----GHP 967

Query: 531 KKINSQTFLLGQRVLLFNSRLRLFPGKLRTRWSGPFVIV 562
           +   S  F +G  V +   R +     L  RW GP++++
Sbjct: 968 Q--TSYPFQVGDSVYVRWHRSQ----GLEPRWKGPYIVL 994

BLAST of Clc11G08940 vs. ExPASy Swiss-Prot
Match: P03359 (Gag-Pol polyprotein OS=Woolly monkey sarcoma virus OX=11970 GN=pol PE=3 SV=2)

HSP 1 Score: 90.9 bits (224), Expect = 5.1e-17
Identity = 81/294 (27.55%), Postives = 124/294 (42.18%), Query Frame = 0

Query: 280  RTCHAIDMIDHTISEHVVKSCDRCQRTDNISRQHELPMKPILEVELFDVWGIDFMGPFPM 339
            RT   I  +   + E V   C  C  T+ ++   E   +   +      W +DF    P 
Sbjct: 1355 RTSLLIPNLQSAVRE-VTSQCQACAMTNAVTTYRETGKRQRGD-RPGVYWEVDFTEVKP- 1414

Query: 340  SSDGYLYILVAVDYVSKWVEAMATRTNDARTVLKFLHKNIFTRFGTPGAIISDEGSHFCN 399
               G  Y+LV +D  S WVEA  T+T  A TV K + + I  RFG P  + SD G  F  
Sbjct: 1415 GRYGNRYLLVFIDTFSGWVEAFPTKTETALTVCKKILEEILPRFGIPKVLGSDNGPAFVA 1474

Query: 400  KLFESMMQKYNVNHKIATIYYPETNGLAELSNREIKQVLEK-TVKINRKDWALKLDDALL 459
            ++ + +  +  +N K+   Y P+++G  E  NR IK+ L K  ++   KDW   L  ALL
Sbjct: 1475 QVSQGLATQLGINWKLHCAYRPQSSGQVERMNRTIKETLTKLALETGXKDWVALLPLALL 1534

Query: 460  VFEKACHLPVELEHRAYWAIKKLNMDFEKAG----------EKRLLELNEMEEFRAQAYE 519
               +A + P       Y  +        ++G                L  +E  R Q ++
Sbjct: 1535 ---RARNTPGRFGLTPYEILYGGPPPILESGGTLGPDDNFLPVLFTHLKALEVVRTQIWD 1594

Query: 520  NAK-LYKECTTRWHDKKINSQTFLLGQRVLLFNSRLRLFPGKLRTRWSGPFVIV 562
              K +YK  T            F +G +VL+   R    PG L  RW GP++++
Sbjct: 1595 QIKEVYKPGTV------AIPHPFQVGDQVLVRRHR----PGSLEPRWKGPYLVL 1632

BLAST of Clc11G08940 vs. ExPASy Swiss-Prot
Match: Q9TTC1 (Gag-Pol polyprotein OS=Koala retrovirus OX=394239 GN=pro-pol PE=3 SV=2)

HSP 1 Score: 89.7 bits (221), Expect = 1.1e-16
Identity = 83/295 (28.14%), Postives = 123/295 (41.69%), Query Frame = 0

Query: 279  GRTCHAIDMIDHTISEHVVKSCDRCQRTDNISRQHELPMKPILEVELFDVWGIDFMGPFP 338
            GRT   I  +   + E +   C  C  T+ ++   E P +          W +DF    P
Sbjct: 1354 GRTSFHIPNLQSVVRE-ITSKCQVCAVTNAVTTYRE-PGRRQRGDRPGVYWEVDFTEVKP 1413

Query: 339  MSSDGYLYILVAVDYVSKWVEAMATRTNDARTVLKFLHKNIFTRFGTPGAIISDEGSHFC 398
                G  Y+LV +D  S WVEA  T+T  A TV K + + I  RFG P  + SD G  F 
Sbjct: 1414 -GRYGNRYLLVFIDTFSGWVEAFPTKTETALTVCKKILEEILPRFGIPKVLGSDNGPAFV 1473

Query: 399  NKLFESMMQKYNVNHKIATIYYPETNGLAELSNREIKQVLEK-TVKINRKDWALKLDDAL 458
             ++ + +  +  ++ K+   Y P+++G  E  NR IK+ L K  ++   KDW   L  AL
Sbjct: 1474 AQVSQGLATQLGIDWKLHCAYRPQSSGQVERMNRTIKETLTKLALETGGKDWVTLLPLAL 1533

Query: 459  LVFEKACH----LPVELEH------RAYWAIKKLNMDFEKAGEKRLLELNEMEEFRAQAY 518
            L            P E+ H       A   +   N DF          L  +E  R Q +
Sbjct: 1534 LRARNTPGQFGLTPYEILHGGPPPVLASGEVVGSNGDFFPV---LFTHLKALEVVRTQIW 1593

Query: 519  ENAK-LYKECTTRWHDKKINSQTFLLGQRVLLFNSRLRLFPGKLRTRWSGPFVIV 562
            +  K  Y+  T            F +G RVL+   R     G L  RW GP++++
Sbjct: 1594 DQIKEAYRPGTV------AIPHPFQVGDRVLVRRHR----SGSLEPRWKGPYLVL 1632

BLAST of Clc11G08940 vs. ExPASy Swiss-Prot
Match: P21414 (Gag-Pol polyprotein OS=Gibbon ape leukemia virus OX=11840 GN=pol PE=3 SV=2)

HSP 1 Score: 88.2 bits (217), Expect = 3.3e-16
Identity = 82/294 (27.89%), Postives = 125/294 (42.52%), Query Frame = 0

Query: 280  RTCHAIDMIDHTISEHVVKSCDRCQRTDNISRQHELPMKPILEVELFDVWGIDFMGPFPM 339
            RT   I  +   + E V   C  C  T+ ++   E   +   +      W +DF    P 
Sbjct: 1354 RTSLLIPNLQSAVRE-VTSQCQACAMTNAVTTYRETGKRQRGD-RPGVYWEVDFTEIKP- 1413

Query: 340  SSDGYLYILVAVDYVSKWVEAMATRTNDARTVLKFLHKNIFTRFGTPGAIISDEGSHFCN 399
               G  Y+LV +D  S WVEA  T+T  A  V K + + I  RFG P  + SD G  F  
Sbjct: 1414 GRYGNKYLLVFIDTFSGWVEAFPTKTETALIVCKKILEEILPRFGIPKVLGSDNGPAFVA 1473

Query: 400  KLFESMMQKYNVNHKIATIYYPETNGLAELSNREIKQVLEK-TVKINRKDWALKLDDALL 459
            ++ + +  +  +N K+   Y P+++G  E  NR IK+ L K  ++   KDW   L  ALL
Sbjct: 1474 QVSQGLATQLGINWKLHCAYRPQSSGQVERMNRTIKETLTKLALETGGKDWVTLLPLALL 1533

Query: 460  VFEKACHLPVELEHRAYWAIKKLNMDFEKAGE-----KRLL-----ELNEMEEFRAQAYE 519
               +A + P       Y  +        ++GE      R L      L  +E  R Q ++
Sbjct: 1534 ---RARNTPGRFGLTPYEILYGGPPPILESGETLGPDDRFLPVLFTHLKALEIVRTQIWD 1593

Query: 520  NAK-LYKECTTRWHDKKINSQTFLLGQRVLLFNSRLRLFPGKLRTRWSGPFVIV 562
              K +YK  T            F +G +VL+   R    P  L  RW GP++++
Sbjct: 1594 QIKEVYKPGTV------TIPHPFQVGDQVLVRRHR----PSSLEPRWKGPYLVL 1631

BLAST of Clc11G08940 vs. ExPASy TrEMBL
Match: A0A6P6G9R2 (LOW QUALITY PROTEIN: uncharacterized protein LOC112492084 OS=Ziziphus jujuba OX=326968 GN=LOC112492084 PE=4 SV=1)

HSP 1 Score: 395.6 bits (1015), Expect = 3.6e-106
Identity = 247/586 (42.15%), Postives = 306/586 (52.22%), Query Frame = 0

Query: 190 KLDIGEVQPITITLQIADRSLAYLKGIVEDVLVKVDKFIFPIDFVVLDMEEDSEVPIILG 249
           KL +G+V+P T TLQ+ADRS+   +GI+EDVLVKV+KFIFP DFV+LDMEED  +PIILG
Sbjct: 401 KLGVGDVKPTTGTLQMADRSIKRPRGILEDVLVKVNKFIFPADFVILDMEEDDNIPIILG 460

Query: 250 RPFLVIRKAIID------------------------------------GEY-------VV 309
           RPFL   +A+ID                                    GE        ++
Sbjct: 461 RPFLATGRALIDVQQRQVTLRVLNEEEDHKPTVEHQRRLNPNLKEVVHGEILKLLDVGII 520

Query: 310 FNIYKS-------------------------------------------------LSHHD 369
           + I  S                                                 ++  D
Sbjct: 521 YPISDSNWVSPIQVVPKKGGMTVQENDKSELIPTRLAGKEYYCFLDGYSGYNQIAIALDD 580

Query: 370 EGRT------------------CHAI------------DMI------------------- 429
           + +T                  C+A+            DM+                   
Sbjct: 581 QEKTTFTCPYGTFAYRRMPFGLCNALATFQRCMMSIFSDMVEDIIEIFMADFSVFGDSFT 640

Query: 430 -------------DHTISEH---------------------------------------- 489
                        DH+  ++                                        
Sbjct: 641 SCLQNLSRTIVYTDHSAIKYPMSKKESKPRLIKWGHFGTRKTIAKILNSGFYWPSMFKDT 700

Query: 490 --VVKSCDRCQRTDNISRQHELPMKPILEVELFDVWGIDFMGPFPMSSDGYLYILVAVDY 549
              V+ CDRCQRT NISR++E+P+K ILEVELFDVWGIDFMGPFP S  G  YILVAVDY
Sbjct: 701 NIYVQGCDRCQRTRNISRKNEMPLKNILEVELFDVWGIDFMGPFPPSC-GNKYILVAVDY 760

Query: 550 VSKWVEAMATRTNDARTVLKFLHKNIFTRFGTPGAIISDEGSHFCNKLFESMMQKYNVNH 563
           VSKWVEA A  TNDA  V+KFL K IFTRFGTP AIISD G+HFCNK FES++ KY V H
Sbjct: 761 VSKWVEASALPTNDAWVVVKFLKKYIFTRFGTPXAIISDGGTHFCNKQFESLLAKYGVRH 820

BLAST of Clc11G08940 vs. ExPASy TrEMBL
Match: A0A6P6GGL5 (LOW QUALITY PROTEIN: uncharacterized protein LOC112492878 OS=Ziziphus jujuba OX=326968 GN=LOC112492878 PE=4 SV=1)

HSP 1 Score: 385.6 bits (989), Expect = 3.7e-103
Identity = 191/283 (67.49%), Postives = 223/283 (78.80%), Query Frame = 0

Query: 297  VKSCDRCQRTDNISRQHELPMKPILEVELFDVWGIDFMGPFPMSSDGYLYILVAVDYVSK 356
            V+ CDRCQRT NISR++E+P+K ILEVELFDVWGIDFMGPFP SS G  YILVAVDYVSK
Sbjct: 1152 VQGCDRCQRTGNISRKNEMPLKNILEVELFDVWGIDFMGPFP-SSCGNKYILVAVDYVSK 1211

Query: 357  WVEAMATRTNDARTVLKFLHKNIFTRFGTPGAIISDEGSHFCNKLFESMMQKYNVNHKIA 416
            WVEA    TNDAR V+KFL K IFTRFGTP AIISD G+HFCNK FES++ KY V HKIA
Sbjct: 1212 WVEASVLPTNDARVVVKFLKKYIFTRFGTPRAIISDGGTHFCNKQFESLLAKYGVRHKIA 1271

Query: 417  TIYYPETNGLAELSNREIKQVLEKTVKINRKDWALKLDDAL-----------------LV 476
            T Y+P+T+G  E+SNREIK++LEKTV  +RKDW+LKLDDAL                 LV
Sbjct: 1272 TPYHPQTSGQVEISNREIKRILEKTVNASRKDWSLKLDDALWAYRTAYKTPIGTSPYKLV 1331

Query: 477  FEKACHLPVELEHRAYWAIKKLNMDFEKAGEKRLLELNEMEEFRAQAYENAKLYKECTTR 536
            F K CHLPVELEH+AYWA K LN D E  G+ RLL+L+E+EEFR  A+ENAK+YKE T R
Sbjct: 1332 FGKECHLPVELEHKAYWATKFLNFDQEAMGKNRLLQLDELEEFRMDAFENAKIYKEKTKR 1391

Query: 537  WHDKKINSQTFLLGQRVLLFNSRLRLFPGKLRTRWSGPFVIVK 563
            WHDK I  +TF +GQ+VLL+NSRL+LFPGKLR+RWSGP+ IV+
Sbjct: 1392 WHDKMIKKRTFHVGQKVLLYNSRLKLFPGKLRSRWSGPYDIVQ 1433

BLAST of Clc11G08940 vs. ExPASy TrEMBL
Match: A0A6P6GGL5 (LOW QUALITY PROTEIN: uncharacterized protein LOC112492878 OS=Ziziphus jujuba OX=326968 GN=LOC112492878 PE=4 SV=1)

HSP 1 Score: 102.1 bits (253), Expect = 8.2e-18
Identity = 58/123 (47.15%), Postives = 77/123 (62.60%), Query Frame = 0

Query: 190 KLDIGEVQPITITLQIADRSLAYLKGIVEDVLVKVDKFIFPIDFVVLDMEEDSEVPIILG 249
           KL +G+V+P T+TLQ+ADRS+   +GI+EDVLVKV+KFIFP DFV+LDMEED  +PIILG
Sbjct: 488 KLGVGDVKPTTVTLQMADRSIKRPRGILEDVLVKVNKFIFPADFVILDMEEDDNIPIILG 547

Query: 250 RPFLVIRKAIID-----------GEYVVFNI--YKSLSHHDEGRTCHAIDMIDHTISEHV 300
           RPFL   +A+ID            E V F I       + DE   C  +D  D  +++  
Sbjct: 548 RPFLATGRALIDVQQRQVTLRVLNEEVSFQIPNVVKFPNLDEISYCFLVDACDELVNDMA 607


HSP 2 Score: 381.3 bits (978), Expect = 7.0e-102
Identity = 182/286 (63.64%), Postives = 227/286 (79.37%), Query Frame = 0

Query: 298 KSCDRCQRTDNISRQHELPMKPILEVELFDVWGIDFMGPFPMSSDGYLYILVAVDYVSKW 357
           ++CDRCQRT  I+++HE+P++ IL VELFDVWGIDFMGPFP  S+G+ YILVAVDYVSKW
Sbjct: 121 ENCDRCQRTGTITKKHEMPLQNILAVELFDVWGIDFMGPFPY-SNGHRYILVAVDYVSKW 180

Query: 358 VEAMATRTNDARTVLKFLHKNIFTRFGTPGAIISDEGSHFCNKLFESMMQKYNVNHKIAT 417
           VEA+A  TNDA+ V+ F+ K+IFTRFGTP  +ISD G+HFCNKL ++++ KY V HK+AT
Sbjct: 181 VEAIALPTNDAKVVVSFVKKHIFTRFGTPRVLISDGGTHFCNKLLKNVLAKYGVRHKVAT 240

Query: 418 IYYPETNGLAELSNREIKQVLEKTVKINRKDWALKLDDAL-----------------LVF 477
            Y+P+T+G  E+SNRE+KQ+LEKTV  NRKDW+ KL+DAL                 LV+
Sbjct: 241 AYHPQTSGQVEVSNREVKQILEKTVSANRKDWSGKLEDALWAYRTAYKTPIGTSPYRLVY 300

Query: 478 EKACHLPVELEHRAYWAIKKLNMDFEKAGEKRLLELNEMEEFRAQAYENAKLYKECTTRW 537
            KACHLPVE+EH+AYWAIKKLNM+ + AGEKRLL+LNE++EFR  AYENAKLYKE T +W
Sbjct: 301 GKACHLPVEIEHKAYWAIKKLNMNMDLAGEKRLLQLNELDEFRLHAYENAKLYKEKTKKW 360

Query: 538 HDKKINSQTFLLGQRVLLFNSRLRLFPGKLRTRWSGPFVIVK-CPH 566
           HDK I  + F  GQ VLLFNSRL+LFPGKL++RW+GPFV+V   PH
Sbjct: 361 HDKHIQHREFEPGQEVLLFNSRLKLFPGKLKSRWAGPFVVVSVTPH 405

BLAST of Clc11G08940 vs. ExPASy TrEMBL
Match: A0A4Y1QYH5 (Transposable element protein OS=Prunus dulcis OX=3755 GN=Prudu_005832 PE=4 SV=1)

HSP 1 Score: 378.6 bits (971), Expect = 4.6e-101
Identity = 184/279 (65.95%), Postives = 217/279 (77.78%), Query Frame = 0

Query: 300 CDRCQRTDNISRQHELPMKPILEVELFDVWGIDFMGPFPMSSDGYLYILVAVDYVSKWVE 359
           CDRCQR  NISR++ELP+K IL VELFDVWGIDFMGPFP SS GY YILVAVDYVSKWVE
Sbjct: 574 CDRCQRMGNISRRNELPLKNILFVELFDVWGIDFMGPFP-SSFGYTYILVAVDYVSKWVE 633

Query: 360 AMATRTNDARTVLKFLHKNIFTRFGTPGAIISDEGSHFCNKLFESMMQKYNVNHKIATIY 419
           A+AT+TND + VLKFL  NIFTRFGTP A+ISD GSHFCNKLFE++M+KYN+ H+++T Y
Sbjct: 634 AIATKTNDHKVVLKFLRDNIFTRFGTPRAVISDGGSHFCNKLFEALMKKYNITHRVSTPY 693

Query: 420 YPETNGLAELSNREIKQVLEKTVKINRKDWALKLDDAL-----------------LVFEK 479
           +P+T+G  E+SNREIKQ+LEK V   RKDWA KL+DAL                 LVF K
Sbjct: 694 HPQTSGQVEISNREIKQILEKVVNSTRKDWAAKLNDALWAYRTAYKTPIGMSPYRLVFGK 753

Query: 480 ACHLPVELEHRAYWAIKKLNMDFEKAGEKRLLELNEMEEFRAQAYENAKLYKECTTRWHD 539
           ACHLP+ELEH A+WAIKKLN D +KAG  R  +LNE+EE R ++YENAKLYKE T  +HD
Sbjct: 754 ACHLPMELEHNAFWAIKKLNFDLDKAGHVRKFQLNELEEIRHESYENAKLYKERTKSYHD 813

Query: 540 KKINSQTFLLGQRVLLFNSRLRLFPGKLRTRWSGPFVIV 562
           + I  + F  G  VLLFNSRLRLFPGKL++RW GPF +V
Sbjct: 814 RNIQRKEFTKGMSVLLFNSRLRLFPGKLKSRWLGPFTVV 851

BLAST of Clc11G08940 vs. ExPASy TrEMBL
Match: A0A5H2XID6 (Reverse transcriptase OS=Prunus dulcis OX=3755 GN=Prudu_268S000200 PE=4 SV=1)

HSP 1 Score: 378.6 bits (971), Expect = 4.6e-101
Identity = 184/279 (65.95%), Postives = 217/279 (77.78%), Query Frame = 0

Query: 300  CDRCQRTDNISRQHELPMKPILEVELFDVWGIDFMGPFPMSSDGYLYILVAVDYVSKWVE 359
            CDRCQR  NISR++ELP+K IL VELFDVWGIDFMGPFP SS GY YILVAVDYVSKWVE
Sbjct: 1221 CDRCQRMGNISRRNELPLKNILFVELFDVWGIDFMGPFP-SSFGYTYILVAVDYVSKWVE 1280

Query: 360  AMATRTNDARTVLKFLHKNIFTRFGTPGAIISDEGSHFCNKLFESMMQKYNVNHKIATIY 419
            A+AT+TND + VLKFL  NIFTRFGTP A+ISD GSHFCNKLFE++M+KYN+ H+++T Y
Sbjct: 1281 AIATKTNDHKVVLKFLRDNIFTRFGTPRAVISDGGSHFCNKLFEALMKKYNITHRVSTPY 1340

Query: 420  YPETNGLAELSNREIKQVLEKTVKINRKDWALKLDDAL-----------------LVFEK 479
            +P+T+G  E+SNREIKQ+LEK V   RKDWA KL+DAL                 LVF K
Sbjct: 1341 HPQTSGQVEISNREIKQILEKVVNSTRKDWAAKLNDALWAYRTAYKTPIGMSPYRLVFGK 1400

Query: 480  ACHLPVELEHRAYWAIKKLNMDFEKAGEKRLLELNEMEEFRAQAYENAKLYKECTTRWHD 539
            ACHLP+ELEH A+WAIKKLN D +KAG  R  +LNE+EE R ++YENAKLYKE T  +HD
Sbjct: 1401 ACHLPMELEHNAFWAIKKLNFDLDKAGHVRKFQLNELEEIRHESYENAKLYKERTKSYHD 1460

Query: 540  KKINSQTFLLGQRVLLFNSRLRLFPGKLRTRWSGPFVIV 562
            + I  + F  G  VLLFNSRLRLFPGKL++RW GPF +V
Sbjct: 1461 RNIQRKEFTKGMSVLLFNSRLRLFPGKLKSRWLGPFTVV 1498

BLAST of Clc11G08940 vs. TAIR 10
Match: ATMG00750.1 (GAG/POL/ENV polyprotein )

HSP 1 Score: 53.9 bits (128), Expect = 5.0e-07
Identity = 23/38 (60.53%), Postives = 29/38 (76.32%), Query Frame = 0

Query: 297 VKSCDRCQRTDNISRQHELPMKPILEVELFDVWGIDFM 335
           V SCD CQR  N ++++E+P   ILEVE+FDVWGI FM
Sbjct: 53  VSSCDACQRKGNFTKRNEMPQHFILEVEVFDVWGIYFM 90

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_023874613.11.3e-11070.07uncharacterized protein LOC111987139 [Quercus suber][more]
XP_023874613.15.4e-3227.66uncharacterized protein LOC111987139 [Quercus suber][more]
XP_012831341.14.3e-2928.87PREDICTED: uncharacterized protein LOC105952343 [Erythranthe guttata][more]
XP_012833448.17.4e-1335.42PREDICTED: uncharacterized protein LOC105954320 [Erythranthe guttata][more]
XP_012833687.15.7e-1335.42PREDICTED: uncharacterized protein LOC105954563 [Erythranthe guttata] >XP_012857... [more]

Pages

Match NameE-valueIdentityDescription
P102722.1e-1829.75Gag-Pol polyprotein OS=Baboon endogenous virus (strain M7) OX=11764 GN=pol PE=3 ... [more]
P317921.8e-1730.11Pol polyprotein (Fragment) OS=Feline endogenous virus ECE1 OX=11766 GN=pol PE=3 ... [more]
P033595.1e-1727.55Gag-Pol polyprotein OS=Woolly monkey sarcoma virus OX=11970 GN=pol PE=3 SV=2[more]
Q9TTC11.1e-1628.14Gag-Pol polyprotein OS=Koala retrovirus OX=394239 GN=pro-pol PE=3 SV=2[more]
P214143.3e-1627.89Gag-Pol polyprotein OS=Gibbon ape leukemia virus OX=11840 GN=pol PE=3 SV=2[more]
Match NameE-valueIdentityDescription
A0A6P6G9R23.6e-10642.15LOW QUALITY PROTEIN: uncharacterized protein LOC112492084 OS=Ziziphus jujuba OX=... [more]
A0A6P6GGL53.7e-10367.49LOW QUALITY PROTEIN: uncharacterized protein LOC112492878 OS=Ziziphus jujuba OX=... [more]
A0A6P6GGL58.2e-1847.15LOW QUALITY PROTEIN: uncharacterized protein LOC112492878 OS=Ziziphus jujuba OX=... [more]
A0A4Y1QYH54.6e-10165.95Transposable element protein OS=Prunus dulcis OX=3755 GN=Prudu_005832 PE=4 SV=1[more]
A0A5H2XID64.6e-10165.95Reverse transcriptase OS=Prunus dulcis OX=3755 GN=Prudu_268S000200 PE=4 SV=1[more]

Pages

Match NameE-valueIdentityDescription
ATMG00750.15.0e-0760.53GAG/POL/ENV polyprotein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (cordophanus) v2
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001584Integrase, catalytic corePFAMPF00665rvecoord: 327..421
e-value: 2.7E-14
score: 53.3
IPR001584Integrase, catalytic corePROSITEPS50994INTEGRASEcoord: 315..506
score: 20.523663
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 183..273
e-value: 8.9E-10
score: 40.3
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 320..512
e-value: 1.8E-41
score: 143.6
NoneNo IPR availablePANTHERPTHR24559:SF373REVERSE TRANSCRIPTASE DOMAIN, RIBONUCLEASE H-LIKE DOMAIN PROTEIN-RELATEDcoord: 297..482
NoneNo IPR availablePANTHERPTHR24559TRANSPOSON TY3-I GAG-POL POLYPROTEINcoord: 297..482
NoneNo IPR availableCDDcd00303retropepsin_likecoord: 189..254
e-value: 6.74935E-8
score: 48.4868
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 325..461

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Clc11G08940.1Clc11G08940.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
molecular_function GO:0003676 nucleic acid binding