ClCG11G009740 (gene) Watermelon (Charleston Gray) v2.5

Overview
NameClCG11G009740
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionGag-pol polyprotein
LocationCG_Chr11: 16028155 .. 16030351 (-)
RNA-Seq ExpressionClCG11G009740
SyntenyClCG11G009740
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAGTCTATCAGGGAAGGTGGATCAACAACTTGTCCTTCTGTACTTGATGGTTCAAACTATTCGTATTGGAAGGCTAGGATGACAACCTTCTTAAAATCTATCGACAACAAGACCTGGAAAGTCGTCATTTTCGGATGGACTCCTCCTCAAGTCACTGATACAGATGGAAATGTGAGTCTTAAGCTTGAGAAAGACTTTACGGAAGATGAAGATGAGGTGTTAGGGAATTGTCAGGCTCTTAATGCCATCTTTAACAGAGTAGATAAGAATATCTTTTGCTTAATCAACACTTGTGTCTCTGCCAAAGAAGCATGGGATATTCTTGCTGTTGCACATGAAGGAACCTCCAAAGTAAAAATGTCTACACTGCAGCTTCTAACAACAAAATTCAAATCTCTAAAGATGCTGAAAGAAGAAACTATAACTAAGTTTAATGTATGACTCTTGATATTGCAAATGAATAGTTTGCTCTCGTTGAGAAGATATCAGAAGAAAAGTTGGTGCGTAAGGTTCTTCGATCCCTTCCTAAAAGATTTGATATGAAAGTTATAGCTATTGAAGAGGCTCATCACATTGCCACCATGAAAGTTGATGAGCTTTTTGGCTTTATGTGTACTTTCAAAATGTCGTTTGATGACAAGTCTGATAAGAAATCTAAGAGTATTGCATTACAGTCGACTATTGAAAATGATGCTCCTATTGTCAAAATTAAGGAATCTGATCAGAACCTCGCTCAATCGATATCTCTTCTGGCCAAGAAGTTTGGAAAGGTCCTCAGGCGATGGGACAAACGTGGAGGATCTCGGGGTAATCATGTGTCTCCCAATGTCCAAGACAACAATAGTCCAAATAATCACTCCAACCAAAAGTCTTCAAGATTATTTGAAAGAAAACTTGAATATGTAAGAGGATTGGGAAGCAATCAAGGATTGAGAGACAAAAATTCAGATGTAGAGAATGTGAGGGATTTGACCATTACCAAGCTGAATGTCCAAACTTTCTGAAAAGACAGAACAAGAGTTACTCTGTGACATTATTCGATGATGACCATGAATCAAGTAGTGACTCTGATGATGAAATTCGTGCTTTAATGAGATGTTTATCTCTTGAGAGTTCCCAAGTGACATCCCCTTCGGATATCGTGATTGCTATGGTTATTGAGAAGACTTAAGAGAATAAGTCCAGTCATGACGAACTTTCTTTCGAAGACATTATCAGCATTGGATTGAAGACGCCCAAGCCATTATTGTTCAGAAAAAGAGAATTGAAAAGTTGATAGAAGACAACCACTGTCTTTTGAGCACTATTTTTGAATTGAAGAAGGAATTGAAATCCTCTAAAGCTGAGCTTGAAGTAATGACCAAGTCAGTTCGTATGTTGAATTTTAGTACTAATGATCTTAACAAGATTTTGTCTTGTGATAAGCAAGTTTCAAAATTCTTGTGGTAAGAATTTTAGTACTTTGATTCCATGATCCAGAAATCACTCAACGCAACCTTTCAATCAAATCCTTCTAATTCAAACTTCATCTACGGCTTATGTAAACTAACACTTGAGCATAAACCGTTCTTGTTCTACTTCTTCCCTTTAAAGGCTCTTCAACGCATTGGTCTTAGAAAAAGGGTGTGACATTGCGTTAGCTGTCAACCTTGCGTTCAAGGAGCAAGGTTGCGTCCAACTGAGCTGCCATAAAAGAAAAACAGCTACCTTGCAGTTTAGAGTCTCAAGCCTTGCGTTCAATGAGCTTCGGAAAGCAGAATTTGAAGGAAAAAAAGGGAATTCCTTAAGCGTTTTTGTGAGTGACACTTTCCTTTATTTGAAAATGATTTTTTGGAAATGTTGTTCTGGTTTTCTTATAAGCTGTGGTTTGATGCTAAAGTTATTATTTATGAAAGATTTCTCTATGAACTGATTATGGAAAAATTGGTTTCTAAAATGTTTTAAAGGAGGCTCAATGCTGAAGGACGTGAGAAGGCATGTGAGTTTTCCCAGGACCCAGTACTGAGGGACATAGAGAAGGTACTTAGGCAACTCGCCATATTCCTATGTGCACATTGAGGTTGTGATGTTGGGTGTGTTGAGTCTTGCTCCACCCCAACTAAGGTTGTTTTCAGCGTTGAGTGCATTGAGTTTTGCTCCGCCTCAACTAAGGTTGTTTTCGACGTTTAGTGCGTTGGGTTTTGCTCCGCCTTAA

mRNA sequence

ATGGAGTCTATCAGGGAAGGTGGATCAACAACTTGTCCTTCTGTACTTGATGGTTCAAACTATTCGTATTGGAAGGCTAGGATGACAACCTTCTTAAAATCTATCGACAACAAGACCTGGAAAGTCGTCATTTTCGGATGGACTCCTCCTCAAGTCACTGATACAGATGGAAATGTGAGTCTTAAGCTTGAGAAAGACTTTACGGAAGATGAAGATGAGGTGTTAGGGAATTGTCAGGCTCTTAATGCCATCTTTAACAGAGTAGATAAGAATATCTTTTGCTTAATCAACACTTGTGTCTCTGCCAAAGAAGCATGGGATATTCTTGCTGTTGCACATGAAGGAACCTCCAAATTTGCTCTCGTTGAGAAGATATCAGAAGAAAAGTTGGTGCGTAAGGTTCTTCGATCCCTTCCTAAAAGATTTGATATGAAAGTTATAGCTATTGAAGAGGCTCATCACATTGCCACCATGAAAGTTGATGAGCTTTTTGGCTTTATGTGTACTTTCAAAATGTCGTTTGATGACAAGTCTGATAAGAAATCTAAGAGTATTGCATTACAGTCGACTATTGAAAATGATGCTCCTATTGTCAAAATTAAGGAATCTGATCAGAACCTCGCTCAATCGATATCTCTTCTGGCCAAGAAGTTTGGAAAGGTCCTCAGGCGATGGGACAAACGTGGAGGATCTCGGGGTAATCATGTGTCTCCCAATGTCCAAGACAACAATAGTCCAAATAATCACTCCAACCAAAAGATTGGGAAGCAATCAAGGATTGAGAGACAAAAATTCAGATGTAGAGAATGTGAGGGATTTGACCATTACCAAGCTGAATGTCCAAACTTTCTGAAAAGACAGAACAAGAGTTACTCTGTGACATTATTCGATGATGACCATGAATCAAGTAGTGACTCTGATGATGAAATTCGTGCTTTAATGAGATGTTTATCTCTTGAGAGTTCCCAAGTGACATCCCCTTCGGATATCGTGATTGCTATGCATTGGATTGAAGACGCCCAAGCCATTATTGTTCAGAAAAAGAGAATTGAAAAGTTGATAGAAGACAACCACTGTCTTTTGAGCACTATTTTTGAATTGAAGAAGGAATTGAAATCCTCTAAAGCTGAGCTTGAAGTAATGACCAAGTCAGTTCAAAAAGGGTGTGACATTGCGTTAGCTGTCAACCTTGCGTTCAAGGAGCAAGGTTGCGTCCAACTGAGCTGCCATAAAAGAAAAACAGCTACCTTGCAGTTTAGAGTCTCAAGCCTTGCGTTCAATGAGCTTCGGAAAGCAGAATTTGAAGGAAAAAAAGGGAATTCCTTAAGCGTTTTTGTGAGACGTGAGAAGGCATGTGAGTTTTCCCAGGACCCAGTACTGAGGGACATAGAGAAGGTTGTGATGTTGGGTGTGTTGAGTCTTGCTCCACCCCAACTAAGGTTGTTTTCAGCGTTGAGTGCATTGAGTTTTGCTCCGCCTCAACTAAGGTTGTTTTCGACGTTTAGTGCGTTGGGTTTTGCTCCGCCTTAA

Coding sequence (CDS)

ATGGAGTCTATCAGGGAAGGTGGATCAACAACTTGTCCTTCTGTACTTGATGGTTCAAACTATTCGTATTGGAAGGCTAGGATGACAACCTTCTTAAAATCTATCGACAACAAGACCTGGAAAGTCGTCATTTTCGGATGGACTCCTCCTCAAGTCACTGATACAGATGGAAATGTGAGTCTTAAGCTTGAGAAAGACTTTACGGAAGATGAAGATGAGGTGTTAGGGAATTGTCAGGCTCTTAATGCCATCTTTAACAGAGTAGATAAGAATATCTTTTGCTTAATCAACACTTGTGTCTCTGCCAAAGAAGCATGGGATATTCTTGCTGTTGCACATGAAGGAACCTCCAAATTTGCTCTCGTTGAGAAGATATCAGAAGAAAAGTTGGTGCGTAAGGTTCTTCGATCCCTTCCTAAAAGATTTGATATGAAAGTTATAGCTATTGAAGAGGCTCATCACATTGCCACCATGAAAGTTGATGAGCTTTTTGGCTTTATGTGTACTTTCAAAATGTCGTTTGATGACAAGTCTGATAAGAAATCTAAGAGTATTGCATTACAGTCGACTATTGAAAATGATGCTCCTATTGTCAAAATTAAGGAATCTGATCAGAACCTCGCTCAATCGATATCTCTTCTGGCCAAGAAGTTTGGAAAGGTCCTCAGGCGATGGGACAAACGTGGAGGATCTCGGGGTAATCATGTGTCTCCCAATGTCCAAGACAACAATAGTCCAAATAATCACTCCAACCAAAAGATTGGGAAGCAATCAAGGATTGAGAGACAAAAATTCAGATGTAGAGAATGTGAGGGATTTGACCATTACCAAGCTGAATGTCCAAACTTTCTGAAAAGACAGAACAAGAGTTACTCTGTGACATTATTCGATGATGACCATGAATCAAGTAGTGACTCTGATGATGAAATTCGTGCTTTAATGAGATGTTTATCTCTTGAGAGTTCCCAAGTGACATCCCCTTCGGATATCGTGATTGCTATGCATTGGATTGAAGACGCCCAAGCCATTATTGTTCAGAAAAAGAGAATTGAAAAGTTGATAGAAGACAACCACTGTCTTTTGAGCACTATTTTTGAATTGAAGAAGGAATTGAAATCCTCTAAAGCTGAGCTTGAAGTAATGACCAAGTCAGTTCAAAAAGGGTGTGACATTGCGTTAGCTGTCAACCTTGCGTTCAAGGAGCAAGGTTGCGTCCAACTGAGCTGCCATAAAAGAAAAACAGCTACCTTGCAGTTTAGAGTCTCAAGCCTTGCGTTCAATGAGCTTCGGAAAGCAGAATTTGAAGGAAAAAAAGGGAATTCCTTAAGCGTTTTTGTGAGACGTGAGAAGGCATGTGAGTTTTCCCAGGACCCAGTACTGAGGGACATAGAGAAGGTTGTGATGTTGGGTGTGTTGAGTCTTGCTCCACCCCAACTAAGGTTGTTTTCAGCGTTGAGTGCATTGAGTTTTGCTCCGCCTCAACTAAGGTTGTTTTCGACGTTTAGTGCGTTGGGTTTTGCTCCGCCTTAA

Protein sequence

MESIREGGSTTCPSVLDGSNYSYWKARMTTFLKSIDNKTWKVVIFGWTPPQVTDTDGNVSLKLEKDFTEDEDEVLGNCQALNAIFNRVDKNIFCLINTCVSAKEAWDILAVAHEGTSKFALVEKISEEKLVRKVLRSLPKRFDMKVIAIEEAHHIATMKVDELFGFMCTFKMSFDDKSDKKSKSIALQSTIENDAPIVKIKESDQNLAQSISLLAKKFGKVLRRWDKRGGSRGNHVSPNVQDNNSPNNHSNQKIGKQSRIERQKFRCRECEGFDHYQAECPNFLKRQNKSYSVTLFDDDHESSSDSDDEIRALMRCLSLESSQVTSPSDIVIAMHWIEDAQAIIVQKKRIEKLIEDNHCLLSTIFELKKELKSSKAELEVMTKSVQKGCDIALAVNLAFKEQGCVQLSCHKRKTATLQFRVSSLAFNELRKAEFEGKKGNSLSVFVRREKACEFSQDPVLRDIEKVVMLGVLSLAPPQLRLFSALSALSFAPPQLRLFSTFSALGFAPP
Homology
BLAST of ClCG11G009740 vs. NCBI nr
Match: KAA0032410.1 (gag-proteinase polyprotein [Cucumis melo var. makuwa] >TYK21348.1 gag-proteinase polyprotein [Cucumis melo var. makuwa])

HSP 1 Score: 280.4 bits (716), Expect = 3.1e-71
Identity = 168/419 (40.10%), Postives = 250/419 (59.67%), Query Frame = 0

Query: 1   MESIREGGSTTCPSVLDGSNYSYWKARMTTFLKSIDNKTWKVVIFGWTPPQVTDTDGNVS 60
           ME IREG ST+ P +LDG NYSYWK+RM  F+K++D + W+ ++ G+ PP +   DG   
Sbjct: 1   MEIIREGPSTSRPLILDGKNYSYWKSRMIFFIKTLDGRAWRALVAGYEPPMII-VDGVSV 60

Query: 61  LKLEKDFTEDEDEV-LGNCQALNAIFNRVDKNIFCLINTCVSAKEAWDILAVAHEGTSKF 120
            K E D+T+ E++  +GN +A+NAIFN VD N+F LIN+C +AKEAW +L V +EGTSK+
Sbjct: 61  PKPEVDWTDAEEQASIGNARAINAIFNDVDLNVFKLINSCSTAKEAWRLLEVTYEGTSKY 120

Query: 121 ------------ALVEKISEEKLVRKVLRSLPKRFDMKVIAIEEAHHIATMKVDELFGFM 180
                        L EKI + K+V+KVL+SLP++F+M V AIEEAH I T+++DELFG +
Sbjct: 121 NERVLEDANESLLLDEKIPDSKIVQKVLQSLPRKFEMNVTAIEEAHDITTLELDELFGSL 180

Query: 181 CTFKMSFDDKSDKKSKSIALQSTIENDAPIVKIKESDQNLAQSISLLAKKFGKVLRRWDK 240
            T +M+  DK +KK K IA +S  + +  IV   + + N+ +SI+LL K+F KV+R++  
Sbjct: 181 LTLEMAISDKENKKGKGIAFKSIYQVET-IVNQSDDEANMDESIALLKKQFSKVVRKFKN 240

Query: 241 RGGSRGNHVSPN----------VQDNNSPNNHSNQKIGKQSRIERQKFRCRECEGFDHYQ 300
                 N  +PN           +  N  +N      GK+   E + F+CREC G  HYQ
Sbjct: 241 MNTIGSNAQNPNQYRRKDGENTTRRYNKVSNRRGGDYGKKKGGEGRFFKCRECRGVGHYQ 300

Query: 301 AECPNFLKRQNKSYSVTLFDDDHESSSDSDDEIRALMRCLS-------LESSQVTSPSDI 360
            ECP FL+RQ KS+  TL D+D + S D D  + A   C++        E S+     ++
Sbjct: 301 IECPTFLRRQKKSFRATLSDEDTDDSED-DSGMNAFTACITKIDLGDESECSKENCDEEL 360

Query: 361 V---IAMHWIEDAQAIIVQKKRIEKLIEDNHCLLSTIFELKKELKSSKAELEVMTKSVQ 387
               + + W ED++A  +QK+ I+ L+E+N  L+S I  LK +LK  + + +   K V+
Sbjct: 361 TFEELKVLWKEDSEAKAIQKEIIQDLMEENERLMSVISSLKLKLKEVQNDYDQTIKFVK 416

BLAST of ClCG11G009740 vs. NCBI nr
Match: AAO73521.1 (gag-pol polyprotein [Glycine max])

HSP 1 Score: 275.4 bits (703), Expect = 9.9e-70
Identity = 182/459 (39.65%), Postives = 253/459 (55.12%), Query Frame = 0

Query: 1   MESIREGGSTTCPSVLDGSNYSYWKARMTTFLKSIDNKTWKVVIFGWTPPQVTDTDGNVS 60
           M   +EGG    P +LDGSNY YWKARM  FLKS+D++TWK VI GW  P++ DT+G  +
Sbjct: 1   MNMEKEGGPVNRPPILDGSNYEYWKARMVAFLKSLDSRTWKAVIKGWEHPKMLDTEGKPT 60

Query: 61  --LKLEKDFTEDEDEV-LGNCQALNAIFNRVDKNIFCLINTCVSAKEAWDILAVAHEGTS 120
             LK E+D+T++EDE+ LGN +ALNA+FN VDKNIF LINTC  AK+AW+IL + HEGTS
Sbjct: 61  DELKPEEDWTKEEDELALGNSKALNALFNGVDKNIFRLINTCTVAKDAWEILKITHEGTS 120

Query: 121 KF--------------------------------------ALVEKISEEKLVRKVLRSLP 180
           K                                       AL E+I++EKLVRK+LRSLP
Sbjct: 121 KVKISRLQLLATKFENLKMKEEECIHDFHMNILEIANACTALGERITDEKLVRKILRSLP 180

Query: 181 KRFDMKVIAIEEAHHIATMKVDELFGFMCTFKMSFDDKSDKKSKSIALQSTIENDAPIVK 240
           KRFDMKV AIEEA  I  M+VDEL G + TF++   D+++KKSK++A  S  E +     
Sbjct: 181 KRFDMKVTAIEEAQDICNMRVDELIGSLQTFELGLSDRAEKKSKNLAFVSNDEGEEDEYD 240

Query: 241 IKESDQNLAQSISLLAKKFGKVLRRWDKRGGSRGNHVSPNVQDNNSPNNHSNQKIGKQSR 300
           +  +D+ L  ++ LL K+F KVL R DKR      ++  +++  +     S+ K      
Sbjct: 241 L-NTDEGLTNAVVLLGKQFNKVLNRMDKRQKPHVQNIPFDIRKGSKYQKKSDVKPSHSKG 300

Query: 301 IERQKFRCRECEGFDHYQAECPNFLKRQNKSYSVTLFDDDHESSSDSDDEIRALMRCLSL 360
           I+     C  CEG+ H  AECP  LK+  K  SV   D + E  SDSD ++ AL      
Sbjct: 301 IQ-----CHGCEGYGHIIAECPTHLKKHRKGLSVCQSDTESEQESDSDRDVNALTGI--F 360

Query: 361 ESSQVTSPSDIVIAMHWIE--------DAQAIIVQKKRIEKLIED----NHCLLSTIFEL 404
           E+++ +S +D  I    +          ++ I+ Q+ +++K+I D           I EL
Sbjct: 361 ETAEDSSDTDSEITFDELATSYRKLCIKSEKILQQEAQLKKVIADLEAEKEAHKEEISEL 420

BLAST of ClCG11G009740 vs. NCBI nr
Match: AAO73527.1 (gag-pol polyprotein [Glycine max])

HSP 1 Score: 275.0 bits (702), Expect = 1.3e-69
Identity = 181/459 (39.43%), Postives = 253/459 (55.12%), Query Frame = 0

Query: 1   MESIREGGSTTCPSVLDGSNYSYWKARMTTFLKSIDNKTWKVVIFGWTPPQVTDTDGNVS 60
           M   +EGG    P +LDGSNY YWKARM  FLKS+D++TWK VI GW  P++ DT+G  +
Sbjct: 1   MNMEKEGGPVNRPPILDGSNYEYWKARMVAFLKSLDSRTWKAVIKGWEHPKMLDTEGKPT 60

Query: 61  --LKLEKDFTEDEDEV-LGNCQALNAIFNRVDKNIFCLINTCVSAKEAWDILAVAHEGTS 120
             LK E+D+T++EDE+ LGN +ALNA+FN VDKNIF LINTC  AK+AW+IL + HEGTS
Sbjct: 61  DELKPEEDWTKEEDELALGNSKALNALFNGVDKNIFRLINTCTVAKDAWEILKITHEGTS 120

Query: 121 KF--------------------------------------ALVEKISEEKLVRKVLRSLP 180
           K                                       AL E+I++EKLVRK+LRSLP
Sbjct: 121 KVKMSRLQLLATKFENLKMKEEECIHDFHMNILEIANACTALGERITDEKLVRKILRSLP 180

Query: 181 KRFDMKVIAIEEAHHIATMKVDELFGFMCTFKMSFDDKSDKKSKSIALQSTIENDAPIVK 240
           KRFDMKV AIEEA  I  M+VDEL G + TF++   D+++KKSK++A  S  E +     
Sbjct: 181 KRFDMKVTAIEEAQDICNMRVDELIGSLQTFELGLSDRAEKKSKNLAFVSNDEGEEDEYD 240

Query: 241 IKESDQNLAQSISLLAKKFGKVLRRWDKRGGSRGNHVSPNVQDNNSPNNHSNQKIGKQSR 300
           + ++D+ L  ++ LL K+F KVL R DKR      ++  +++  +     S+ K      
Sbjct: 241 L-DTDEGLTNAVVLLGKQFNKVLNRMDKRQKPHVQNIPFDIRKGSKYQKRSDVKPSHSKG 300

Query: 301 IERQKFRCRECEGFDHYQAECPNFLKRQNKSYSVTLFDDDHESSSDSDDEIRALMRCLSL 360
           I+     C  CEG+ H  AECP  LK+  K  SV   D + E  SDSD ++ AL      
Sbjct: 301 IQ-----CHGCEGYGHIIAECPTHLKKHRKGLSVCQSDTESEQESDSDRDVNALTGI--F 360

Query: 361 ESSQVTSPSDIVIAMHWIE--------DAQAIIVQKKRIEKLIED----NHCLLSTIFEL 404
           E+++ +S +D  I    +          ++ I+ Q+ +++K+I D           I EL
Sbjct: 361 ETAEDSSDTDSEITFDELAASYRKLCIKSEKILQQEAQLKKVIADLEAEKEAHEEEISEL 420

BLAST of ClCG11G009740 vs. NCBI nr
Match: AAO73529.1 (gag-pol polyprotein [Glycine max])

HSP 1 Score: 273.5 bits (698), Expect = 3.7e-69
Identity = 181/447 (40.49%), Postives = 258/447 (57.72%), Query Frame = 0

Query: 1   MESIREGGSTTCPSVLDGSNYSYWKARMTTFLKSIDNKTWKVVIFGWTPPQVTDTDGNVS 60
           M   +EGG    P +LDG+NY YWKARM  FLKS+D++TWK VI GW  P++ DT+G  +
Sbjct: 1   MNMEKEGGPVNRPPILDGTNYEYWKARMVAFLKSLDSRTWKAVIKGWEHPKMLDTEGKPT 60

Query: 61  --LKLEKDFTEDEDEV-LGNCQALNAIFNRVDKNIFCLINTCVSAKEAWDILAVAHEGTS 120
             LK E+D+T++EDE+ LGN +ALNA+FN VDKNIF LINTC  AK+AW+IL   HEGTS
Sbjct: 61  NELKPEEDWTKEEDELALGNSKALNALFNGVDKNIFRLINTCTVAKDAWEILKTTHEGTS 120

Query: 121 KF--------------------------------------ALVEKISEEKLVRKVLRSLP 180
           K                                       AL E++++EKLVRK+LRSLP
Sbjct: 121 KVKMSRLQLLATKFENLKMKEEECIHDFHMTILEIANACTALGERMTDEKLVRKILRSLP 180

Query: 181 KRFDMKVIAIEEAHHIATMKVDELFGFMCTFKMSFDDKSDKKSKSIALQSTIENDAPIVK 240
           KRFDMKV AIEEA  I  M+VDEL G + TF++   D+++KKSK++A  S  E +     
Sbjct: 181 KRFDMKVTAIEEAQDICNMRVDELIGSLQTFELGLSDRTEKKSKNLAFVSNDEGEEDEYD 240

Query: 241 IKESDQNLAQSISLLAKKFGKVLRRWDKRGGSRGNHVSPNVQDNNSPNNHSNQKIGKQSR 300
           + ++D+ L  ++  L K+F KVL R D+R      ++S +++  +     S++K      
Sbjct: 241 L-DTDEGLTNAVVFLGKQFNKVLNRMDRRQKPHVRNISLDIRKGSEYQRKSDEKPSHSKG 300

Query: 301 IERQKFRCRECEGFDHYQAECPNFLKRQNKSYSVTLFDD-DHESSSDSDDEIRALM-RCL 360
           I+     CR CEG+ H +AECP  LK+Q K  SV   DD + E  SDSD ++ AL  R  
Sbjct: 301 IQ-----CRGCEGYGHIKAECPTHLKKQRKGLSVCRSDDTESEQESDSDRDVNALTGRFE 360

Query: 361 SLESSQVTSPSDIV---IAMHWIE---DAQAIIVQKKRIEKLI----EDNHCLLSTIFEL 392
           S E S  T  S+I    +A+ + E    ++ I+ Q+ +++K+I     +       I +L
Sbjct: 361 SAEDSSDTD-SEITFDELAIFYRELCIKSEKILQQEAQLKKVIANLEAEKEAHEEEISKL 420

BLAST of ClCG11G009740 vs. NCBI nr
Match: AAO73523.1 (gag-pol polyprotein [Glycine max])

HSP 1 Score: 270.4 bits (690), Expect = 3.2e-68
Identity = 181/459 (39.43%), Postives = 253/459 (55.12%), Query Frame = 0

Query: 1   MESIREGGSTTCPSVLDGSNYSYWKARMTTFLKSIDNKTWKVVIFGWTPPQVTDTDGNVS 60
           M   +EGG    P +LDGSNY YWKARM  FLKS+D++TWK VI GW  P++ DT+G  +
Sbjct: 1   MNMEKEGGPVNRPPILDGSNYEYWKARMVAFLKSLDSRTWKAVIKGWEHPKMLDTEGKPT 60

Query: 61  --LKLEKDFTEDEDEV-LGNCQALNAIFNRVDKNIFCLINTCVSAKEAWDILAVAHEGTS 120
             LK E+D+T++EDE+ LGN +ALNA+FN VDKNIF LINTC  AK+A +IL   HEGTS
Sbjct: 61  DELKPEEDWTKEEDELALGNSKALNALFNGVDKNIFRLINTCTVAKDACEILKSTHEGTS 120

Query: 121 KF--------------------------------------ALVEKISEEKLVRKVLRSLP 180
           K                                       AL E+I++EKLVRK+LRSLP
Sbjct: 121 KVKMSRLQLLATKFENLKMKEEECIHDFHMNILEIANACTALGERITDEKLVRKILRSLP 180

Query: 181 KRFDMKVIAIEEAHHIATMKVDELFGFMCTFKMSFDDKSDKKSKSIALQSTIENDAPIVK 240
           KRFDMKV AIEEA  I  M+VDEL G + TF++   D+++KKSK++A  S  E +     
Sbjct: 181 KRFDMKVTAIEEAQDICNMRVDELIGSLQTFELGLSDRAEKKSKNLAFVSNDEGEEDEYD 240

Query: 241 IKESDQNLAQSISLLAKKFGKVLRRWDKRGGSRGNHVSPNVQDNNSPNNHSNQKIGKQSR 300
           + ++D+ L  ++ LL K+F KVL R DKR      ++  +++  +     S+ K      
Sbjct: 241 L-DTDEGLTNAVVLLGKQFNKVLNRMDKRQKPHVQNIPFDIRKGSKYQKRSDVKPSHSKG 300

Query: 301 IERQKFRCRECEGFDHYQAECPNFLKRQNKSYSVTLFDDDHESSSDSDDEIRALMRCLSL 360
           I+     C  CEG+ H  AECP  LK+  K  SV   D + E  SDSD ++ AL+     
Sbjct: 301 IQ-----CHGCEGYGHIIAECPTHLKKHRKGLSVCQSDTESEQESDSDRDVNALIGI--F 360

Query: 361 ESSQVTSPSDIVIAMHWIE--------DAQAIIVQKKRIEKLIED----NHCLLSTIFEL 404
           E+++ +S +D  I    +          ++ I+ Q+ +++K+I D           I EL
Sbjct: 361 ETAEDSSDTDSEITFDELAASYRKLCIKSEKILQQEAQLKKVIADLEAEKEAHKEEISEL 420

BLAST of ClCG11G009740 vs. ExPASy TrEMBL
Match: A0A5D3DCW5 (Gag-proteinase polyprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1814G00160 PE=4 SV=1)

HSP 1 Score: 280.4 bits (716), Expect = 1.5e-71
Identity = 168/419 (40.10%), Postives = 250/419 (59.67%), Query Frame = 0

Query: 1   MESIREGGSTTCPSVLDGSNYSYWKARMTTFLKSIDNKTWKVVIFGWTPPQVTDTDGNVS 60
           ME IREG ST+ P +LDG NYSYWK+RM  F+K++D + W+ ++ G+ PP +   DG   
Sbjct: 1   MEIIREGPSTSRPLILDGKNYSYWKSRMIFFIKTLDGRAWRALVAGYEPPMII-VDGVSV 60

Query: 61  LKLEKDFTEDEDEV-LGNCQALNAIFNRVDKNIFCLINTCVSAKEAWDILAVAHEGTSKF 120
            K E D+T+ E++  +GN +A+NAIFN VD N+F LIN+C +AKEAW +L V +EGTSK+
Sbjct: 61  PKPEVDWTDAEEQASIGNARAINAIFNDVDLNVFKLINSCSTAKEAWRLLEVTYEGTSKY 120

Query: 121 ------------ALVEKISEEKLVRKVLRSLPKRFDMKVIAIEEAHHIATMKVDELFGFM 180
                        L EKI + K+V+KVL+SLP++F+M V AIEEAH I T+++DELFG +
Sbjct: 121 NERVLEDANESLLLDEKIPDSKIVQKVLQSLPRKFEMNVTAIEEAHDITTLELDELFGSL 180

Query: 181 CTFKMSFDDKSDKKSKSIALQSTIENDAPIVKIKESDQNLAQSISLLAKKFGKVLRRWDK 240
            T +M+  DK +KK K IA +S  + +  IV   + + N+ +SI+LL K+F KV+R++  
Sbjct: 181 LTLEMAISDKENKKGKGIAFKSIYQVET-IVNQSDDEANMDESIALLKKQFSKVVRKFKN 240

Query: 241 RGGSRGNHVSPN----------VQDNNSPNNHSNQKIGKQSRIERQKFRCRECEGFDHYQ 300
                 N  +PN           +  N  +N      GK+   E + F+CREC G  HYQ
Sbjct: 241 MNTIGSNAQNPNQYRRKDGENTTRRYNKVSNRRGGDYGKKKGGEGRFFKCRECRGVGHYQ 300

Query: 301 AECPNFLKRQNKSYSVTLFDDDHESSSDSDDEIRALMRCLS-------LESSQVTSPSDI 360
            ECP FL+RQ KS+  TL D+D + S D D  + A   C++        E S+     ++
Sbjct: 301 IECPTFLRRQKKSFRATLSDEDTDDSED-DSGMNAFTACITKIDLGDESECSKENCDEEL 360

Query: 361 V---IAMHWIEDAQAIIVQKKRIEKLIEDNHCLLSTIFELKKELKSSKAELEVMTKSVQ 387
               + + W ED++A  +QK+ I+ L+E+N  L+S I  LK +LK  + + +   K V+
Sbjct: 361 TFEELKVLWKEDSEAKAIQKEIIQDLMEENERLMSVISSLKLKLKEVQNDYDQTIKFVK 416

BLAST of ClCG11G009740 vs. ExPASy TrEMBL
Match: Q84VI4 (Gag-pol polyprotein OS=Glycine max OX=3847 GN=gag-pol PE=4 SV=1)

HSP 1 Score: 275.4 bits (703), Expect = 4.8e-70
Identity = 182/459 (39.65%), Postives = 253/459 (55.12%), Query Frame = 0

Query: 1   MESIREGGSTTCPSVLDGSNYSYWKARMTTFLKSIDNKTWKVVIFGWTPPQVTDTDGNVS 60
           M   +EGG    P +LDGSNY YWKARM  FLKS+D++TWK VI GW  P++ DT+G  +
Sbjct: 1   MNMEKEGGPVNRPPILDGSNYEYWKARMVAFLKSLDSRTWKAVIKGWEHPKMLDTEGKPT 60

Query: 61  --LKLEKDFTEDEDEV-LGNCQALNAIFNRVDKNIFCLINTCVSAKEAWDILAVAHEGTS 120
             LK E+D+T++EDE+ LGN +ALNA+FN VDKNIF LINTC  AK+AW+IL + HEGTS
Sbjct: 61  DELKPEEDWTKEEDELALGNSKALNALFNGVDKNIFRLINTCTVAKDAWEILKITHEGTS 120

Query: 121 KF--------------------------------------ALVEKISEEKLVRKVLRSLP 180
           K                                       AL E+I++EKLVRK+LRSLP
Sbjct: 121 KVKISRLQLLATKFENLKMKEEECIHDFHMNILEIANACTALGERITDEKLVRKILRSLP 180

Query: 181 KRFDMKVIAIEEAHHIATMKVDELFGFMCTFKMSFDDKSDKKSKSIALQSTIENDAPIVK 240
           KRFDMKV AIEEA  I  M+VDEL G + TF++   D+++KKSK++A  S  E +     
Sbjct: 181 KRFDMKVTAIEEAQDICNMRVDELIGSLQTFELGLSDRAEKKSKNLAFVSNDEGEEDEYD 240

Query: 241 IKESDQNLAQSISLLAKKFGKVLRRWDKRGGSRGNHVSPNVQDNNSPNNHSNQKIGKQSR 300
           +  +D+ L  ++ LL K+F KVL R DKR      ++  +++  +     S+ K      
Sbjct: 241 L-NTDEGLTNAVVLLGKQFNKVLNRMDKRQKPHVQNIPFDIRKGSKYQKKSDVKPSHSKG 300

Query: 301 IERQKFRCRECEGFDHYQAECPNFLKRQNKSYSVTLFDDDHESSSDSDDEIRALMRCLSL 360
           I+     C  CEG+ H  AECP  LK+  K  SV   D + E  SDSD ++ AL      
Sbjct: 301 IQ-----CHGCEGYGHIIAECPTHLKKHRKGLSVCQSDTESEQESDSDRDVNALTGI--F 360

Query: 361 ESSQVTSPSDIVIAMHWIE--------DAQAIIVQKKRIEKLIED----NHCLLSTIFEL 404
           E+++ +S +D  I    +          ++ I+ Q+ +++K+I D           I EL
Sbjct: 361 ETAEDSSDTDSEITFDELATSYRKLCIKSEKILQQEAQLKKVIADLEAEKEAHKEEISEL 420

BLAST of ClCG11G009740 vs. ExPASy TrEMBL
Match: Q84VH8 (Gag-pol polyprotein OS=Glycine max OX=3847 GN=gag-pol PE=4 SV=1)

HSP 1 Score: 275.0 bits (702), Expect = 6.2e-70
Identity = 181/459 (39.43%), Postives = 253/459 (55.12%), Query Frame = 0

Query: 1   MESIREGGSTTCPSVLDGSNYSYWKARMTTFLKSIDNKTWKVVIFGWTPPQVTDTDGNVS 60
           M   +EGG    P +LDGSNY YWKARM  FLKS+D++TWK VI GW  P++ DT+G  +
Sbjct: 1   MNMEKEGGPVNRPPILDGSNYEYWKARMVAFLKSLDSRTWKAVIKGWEHPKMLDTEGKPT 60

Query: 61  --LKLEKDFTEDEDEV-LGNCQALNAIFNRVDKNIFCLINTCVSAKEAWDILAVAHEGTS 120
             LK E+D+T++EDE+ LGN +ALNA+FN VDKNIF LINTC  AK+AW+IL + HEGTS
Sbjct: 61  DELKPEEDWTKEEDELALGNSKALNALFNGVDKNIFRLINTCTVAKDAWEILKITHEGTS 120

Query: 121 KF--------------------------------------ALVEKISEEKLVRKVLRSLP 180
           K                                       AL E+I++EKLVRK+LRSLP
Sbjct: 121 KVKMSRLQLLATKFENLKMKEEECIHDFHMNILEIANACTALGERITDEKLVRKILRSLP 180

Query: 181 KRFDMKVIAIEEAHHIATMKVDELFGFMCTFKMSFDDKSDKKSKSIALQSTIENDAPIVK 240
           KRFDMKV AIEEA  I  M+VDEL G + TF++   D+++KKSK++A  S  E +     
Sbjct: 181 KRFDMKVTAIEEAQDICNMRVDELIGSLQTFELGLSDRAEKKSKNLAFVSNDEGEEDEYD 240

Query: 241 IKESDQNLAQSISLLAKKFGKVLRRWDKRGGSRGNHVSPNVQDNNSPNNHSNQKIGKQSR 300
           + ++D+ L  ++ LL K+F KVL R DKR      ++  +++  +     S+ K      
Sbjct: 241 L-DTDEGLTNAVVLLGKQFNKVLNRMDKRQKPHVQNIPFDIRKGSKYQKRSDVKPSHSKG 300

Query: 301 IERQKFRCRECEGFDHYQAECPNFLKRQNKSYSVTLFDDDHESSSDSDDEIRALMRCLSL 360
           I+     C  CEG+ H  AECP  LK+  K  SV   D + E  SDSD ++ AL      
Sbjct: 301 IQ-----CHGCEGYGHIIAECPTHLKKHRKGLSVCQSDTESEQESDSDRDVNALTGI--F 360

Query: 361 ESSQVTSPSDIVIAMHWIE--------DAQAIIVQKKRIEKLIED----NHCLLSTIFEL 404
           E+++ +S +D  I    +          ++ I+ Q+ +++K+I D           I EL
Sbjct: 361 ETAEDSSDTDSEITFDELAASYRKLCIKSEKILQQEAQLKKVIADLEAEKEAHEEEISEL 420

BLAST of ClCG11G009740 vs. ExPASy TrEMBL
Match: Q84VH6 (Gag-pol polyprotein OS=Glycine max OX=3847 GN=gag-pol PE=4 SV=1)

HSP 1 Score: 273.5 bits (698), Expect = 1.8e-69
Identity = 181/447 (40.49%), Postives = 258/447 (57.72%), Query Frame = 0

Query: 1   MESIREGGSTTCPSVLDGSNYSYWKARMTTFLKSIDNKTWKVVIFGWTPPQVTDTDGNVS 60
           M   +EGG    P +LDG+NY YWKARM  FLKS+D++TWK VI GW  P++ DT+G  +
Sbjct: 1   MNMEKEGGPVNRPPILDGTNYEYWKARMVAFLKSLDSRTWKAVIKGWEHPKMLDTEGKPT 60

Query: 61  --LKLEKDFTEDEDEV-LGNCQALNAIFNRVDKNIFCLINTCVSAKEAWDILAVAHEGTS 120
             LK E+D+T++EDE+ LGN +ALNA+FN VDKNIF LINTC  AK+AW+IL   HEGTS
Sbjct: 61  NELKPEEDWTKEEDELALGNSKALNALFNGVDKNIFRLINTCTVAKDAWEILKTTHEGTS 120

Query: 121 KF--------------------------------------ALVEKISEEKLVRKVLRSLP 180
           K                                       AL E++++EKLVRK+LRSLP
Sbjct: 121 KVKMSRLQLLATKFENLKMKEEECIHDFHMTILEIANACTALGERMTDEKLVRKILRSLP 180

Query: 181 KRFDMKVIAIEEAHHIATMKVDELFGFMCTFKMSFDDKSDKKSKSIALQSTIENDAPIVK 240
           KRFDMKV AIEEA  I  M+VDEL G + TF++   D+++KKSK++A  S  E +     
Sbjct: 181 KRFDMKVTAIEEAQDICNMRVDELIGSLQTFELGLSDRTEKKSKNLAFVSNDEGEEDEYD 240

Query: 241 IKESDQNLAQSISLLAKKFGKVLRRWDKRGGSRGNHVSPNVQDNNSPNNHSNQKIGKQSR 300
           + ++D+ L  ++  L K+F KVL R D+R      ++S +++  +     S++K      
Sbjct: 241 L-DTDEGLTNAVVFLGKQFNKVLNRMDRRQKPHVRNISLDIRKGSEYQRKSDEKPSHSKG 300

Query: 301 IERQKFRCRECEGFDHYQAECPNFLKRQNKSYSVTLFDD-DHESSSDSDDEIRALM-RCL 360
           I+     CR CEG+ H +AECP  LK+Q K  SV   DD + E  SDSD ++ AL  R  
Sbjct: 301 IQ-----CRGCEGYGHIKAECPTHLKKQRKGLSVCRSDDTESEQESDSDRDVNALTGRFE 360

Query: 361 SLESSQVTSPSDIV---IAMHWIE---DAQAIIVQKKRIEKLI----EDNHCLLSTIFEL 392
           S E S  T  S+I    +A+ + E    ++ I+ Q+ +++K+I     +       I +L
Sbjct: 361 SAEDSSDTD-SEITFDELAIFYRELCIKSEKILQQEAQLKKVIANLEAEKEAHEEEISKL 420

BLAST of ClCG11G009740 vs. ExPASy TrEMBL
Match: Q84VI2 (Gag-pol polyprotein OS=Glycine max OX=3847 GN=gag-pol PE=4 SV=1)

HSP 1 Score: 270.4 bits (690), Expect = 1.5e-68
Identity = 181/459 (39.43%), Postives = 253/459 (55.12%), Query Frame = 0

Query: 1   MESIREGGSTTCPSVLDGSNYSYWKARMTTFLKSIDNKTWKVVIFGWTPPQVTDTDGNVS 60
           M   +EGG    P +LDGSNY YWKARM  FLKS+D++TWK VI GW  P++ DT+G  +
Sbjct: 1   MNMEKEGGPVNRPPILDGSNYEYWKARMVAFLKSLDSRTWKAVIKGWEHPKMLDTEGKPT 60

Query: 61  --LKLEKDFTEDEDEV-LGNCQALNAIFNRVDKNIFCLINTCVSAKEAWDILAVAHEGTS 120
             LK E+D+T++EDE+ LGN +ALNA+FN VDKNIF LINTC  AK+A +IL   HEGTS
Sbjct: 61  DELKPEEDWTKEEDELALGNSKALNALFNGVDKNIFRLINTCTVAKDACEILKSTHEGTS 120

Query: 121 KF--------------------------------------ALVEKISEEKLVRKVLRSLP 180
           K                                       AL E+I++EKLVRK+LRSLP
Sbjct: 121 KVKMSRLQLLATKFENLKMKEEECIHDFHMNILEIANACTALGERITDEKLVRKILRSLP 180

Query: 181 KRFDMKVIAIEEAHHIATMKVDELFGFMCTFKMSFDDKSDKKSKSIALQSTIENDAPIVK 240
           KRFDMKV AIEEA  I  M+VDEL G + TF++   D+++KKSK++A  S  E +     
Sbjct: 181 KRFDMKVTAIEEAQDICNMRVDELIGSLQTFELGLSDRAEKKSKNLAFVSNDEGEEDEYD 240

Query: 241 IKESDQNLAQSISLLAKKFGKVLRRWDKRGGSRGNHVSPNVQDNNSPNNHSNQKIGKQSR 300
           + ++D+ L  ++ LL K+F KVL R DKR      ++  +++  +     S+ K      
Sbjct: 241 L-DTDEGLTNAVVLLGKQFNKVLNRMDKRQKPHVQNIPFDIRKGSKYQKRSDVKPSHSKG 300

Query: 301 IERQKFRCRECEGFDHYQAECPNFLKRQNKSYSVTLFDDDHESSSDSDDEIRALMRCLSL 360
           I+     C  CEG+ H  AECP  LK+  K  SV   D + E  SDSD ++ AL+     
Sbjct: 301 IQ-----CHGCEGYGHIIAECPTHLKKHRKGLSVCQSDTESEQESDSDRDVNALIGI--F 360

Query: 361 ESSQVTSPSDIVIAMHWIE--------DAQAIIVQKKRIEKLIED----NHCLLSTIFEL 404
           E+++ +S +D  I    +          ++ I+ Q+ +++K+I D           I EL
Sbjct: 361 ETAEDSSDTDSEITFDELAASYRKLCIKSEKILQQEAQLKKVIADLEAEKEAHKEEISEL 420

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAA0032410.13.1e-7140.10gag-proteinase polyprotein [Cucumis melo var. makuwa] >TYK21348.1 gag-proteinase... [more]
AAO73521.19.9e-7039.65gag-pol polyprotein [Glycine max][more]
AAO73527.11.3e-6939.43gag-pol polyprotein [Glycine max][more]
AAO73529.13.7e-6940.49gag-pol polyprotein [Glycine max][more]
AAO73523.13.2e-6839.43gag-pol polyprotein [Glycine max][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A5D3DCW51.5e-7140.10Gag-proteinase polyprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaff... [more]
Q84VI44.8e-7039.65Gag-pol polyprotein OS=Glycine max OX=3847 GN=gag-pol PE=4 SV=1[more]
Q84VH86.2e-7039.43Gag-pol polyprotein OS=Glycine max OX=3847 GN=gag-pol PE=4 SV=1[more]
Q84VH61.8e-6940.49Gag-pol polyprotein OS=Glycine max OX=3847 GN=gag-pol PE=4 SV=1[more]
Q84VI21.5e-6839.43Gag-pol polyprotein OS=Glycine max OX=3847 GN=gag-pol PE=4 SV=1[more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Watermelon (Charleston Gray) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 364..384
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 228..260
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 234..254
NoneNo IPR availablePANTHERPTHR34676FAMILY NOT NAMEDcoord: 5..334
NoneNo IPR availablePANTHERPTHR34676:SF8MYOSIN-11-LIKEcoord: 5..334
IPR036875Zinc finger, CCHC-type superfamilySUPERFAMILY57756Retrovirus zinc finger-like domainscoord: 248..290

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG11G009740.1ClCG11G009740.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0008270 zinc ion binding