ClCG01G011440 (gene) Watermelon (Charleston Gray) v2.5

Overview
NameClCG01G011440
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionGag-pol polyprotein
LocationCG_Chr01: 18789907 .. 18795509 (-)
RNA-Seq ExpressionClCG01G011440
SyntenyClCG01G011440
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAGGACAATCATCGTCTCCTAAGTACTATCTTTAAGGTGAAGAAGGAACTTAGAATAATAAAGGCCGAGCTCAAGTTAATGAGCAAGTTTGTTCGTATGATGAACTCAAGCACAAATGATTTGAACAAGGTCCTGTCCTCTGAAAAACAAGCTTCTGACAAGAGAGGAGTTGAGTTTTCTCAATCAAAATCTAATGAGCTCAAAGGAGAGTCTTCCCTATCAAAGGTTTTCGTTCATGCTCAACATGTTCAAACAACTCCTTATGACTACCAGCAGAAAGGTGTGTATGTTGAACCTACTCAACACAAATTTCAGAAGAAGTGGATTGGTCATTTTTGTGGAAGACTCGGTCATATTCGTCCTTACTGCTTTCGGCTGTATGGAAGTCGTGCTCATCAGAGATCTCGTAGGCTGCTTTTAAAGCTCATGGTTTGCTCAGCATGTTAAAAACTTGAGAAGCAACAAAATGGAGTGGAGAGTTAAAAATAGTGGTGAATACTTCAATAGAAAATGTCAGGTTGCTGTTACAACCTTACACTCGTCCACTCAGAAGGATTGGTACTTCGATAGCGGGTGCTCACGACACATGACAGGTGACAAATCATTTGTACTAGACATTCAACCATGTAATTCCGGTCATGTAATGTTTGGTGATGGTGCTACGGGAAAAGTGATCGGAAAGGGCAGATTGAACTATCTTGGTCTGCCCATTCTCAAGGAAGTTCTGGTGGAAGGATTAACAACTAACTTGATCAGCATAAGTCAATAGTGTGATCAAGGATTCTCTGTCAGCTTCTCTAAAAACAAGTGTACGGTGACCAACTCCAACGATACCTTTGTGATGGCAGGTATGCGCTCTTCTAATGATTGTTACTTATGGAACCTTGACCTTTCATCTTCTATGTGTAATTCTGTCAGAACAAATGAACCATCCCTCTGGTACAAGAGACTTGAGCACATAAATCTTTGTTCATTCAAGAAAGCTATTGCTGAAGAAGCTTTGTTGGGCATTCCTCAACTTGCAGGAGGCTGTGATGGTGTGTGTGGGGATTGTCAGGTAGGAAAGCAAGTCAACGCTTCACACAAAAGAGTTTCTGTATGCACAACAAATAGAGTATTGGAACTGTTGCATGTTGACTTGATGGGGCCGATGCAAGTTGAGAGTCTTGAAGGAACGAGATATGCTTTTGTTTGTGTTAATGACTTTTCCCGATATACATGGGTAAAGTTTATTTGTGAAAAATCAAATATGATTAGTGTTTGTCAGACTTTATGTTGACAACTTTAAAGAGAACAAAGGATGTCGATCATTCGTATACGAAGTGATCATGGTAAGGAGTTTGAGAATAGTCTGTTTGAAAAGTTTTGCAATTCAGAAGGAATTATGCATGAGTTCTCTTCTTCTATTACTCCTCAATAGAATGGAGTGGTTGAGAGGAAGAATCGTACACTGCAGGAAATGGCTAGAGCCATGTTGCATGGAAAGAATTTGCCTTCGTACTTTTGTGTGGAAGCACTAAACATAGCCTGTCATATACATAATCAAGTTTCTCTACGACCAGGTACCACTAAGACAAACTATGAAACATGGAGAATGAGGAAACCTAATGTCAAATACTTTCATGTGTTTGGAAGTGTGTATCACATCCTTGTTGATAGAGAATATCGAAAGAAGTGGCACTCTAAGTCTGATAGTGGATTTTTTTTTGGGATACTCCTTTAACAATCGGGCGTATAGAGTATTCAACAATCATACTCGTTGTATCATGGAGTCCATCAATGTTGTTATAGATGATTAAGACACTGATTGAGTCCTAGATGACGAAAGAGATGATCTTGTATCCCAGATGTTGCTCATGATGTTGCTGATAAGAAGCACGACATTACTCCCAATATTGACAATGATGAGAAAAGTGATTCTGATCTTGAAGAGTCTGTTTTCAACACTGTTAAGTGTTTCAGCAGGGTGAAAAAGAATCATCCCTCTGAAAATATTATAGGCGATCTGAGTTCTGGAGTAACTACTCGGAGGAAAGACAAAGTTGATTATCTGAACCTGATTGGGAATGGTTGCTTTATTTCGTCTATAGAACCCAAAAATATCAACGAAGCATTAAAGGACAAATTTTGGATGAATGCTATGCAGGAAAAGTTAGGACAGTTTGAGCGAAATCAAGTGTGGAAGCTTGTTCCACGTCCCGAGTCTGCTAATGTTATTGGTACCAAATGGGTGTTAAAAAACAAATCAAATGAGGATGGTGTTGTTATTCGTAACATAGCACGCATGGTGGCTCAAGGCTACTCACAAGTTGAAGGTGTTGATTTTCATGAGACGTTCGCTCTAGTTTCTAGATTAGAAGCCATACACCTTCTCTTTGGTGTTGCCTGTCTGCTCAAATTCAAGCTCTATTAGATGGATGTGAAAAGTGCTTTTCTGAATGGTTTTTTAAATGAAGATGTTTATGTAGAGCAGCCTAAAGGGTTCATTGATCCCTCATGCCCTCAACATGTGTATAAGCTTCAGAAGACTCTTTATGGTCTGAAGCAAGCTCCCAGGGCCTTGTATGATCGTCTTACAGAATTTCTTATTCATAAAGGATATTCTAGAGGAGGATCGGACAAAACTCTTTTTATTAAACAATCAAGAGAAGGATTTATCATTGCTTAGGTTATGTGGATGACATTGTGTTTGGAGAATCCTCTCAGACCTTAGTAAACCATTTTGTTGATTAGATGAAGAGTGAATTCGAAATGAGTATGGTCAGTGAGTTAACTTACTTTCTCGAGTTTCAGGTCAAACAATGACCTAATGTTATATTTATATCACAAGAAAAATATGTAAAGAACATGTTAAAGAAGTTTGGAATGGAGAAAGCCAACCCTAAGAGAACTCTAGCTCCCACCCATGCCAAGCCATCAAGGGACTTAGAAGGTGAAAATGTTGATGAAAGATTGTATTGAAGCATGATAGGCAATCTGTTGTATTTAATTGCTAGTCGGCCTGATATAAGTTATGTTGTGGGTGTTTGTGCTCGTTTTCAGTCTAATCCCAAAGTCAGTCATATGCCAAGTGTGAAAAGAATTCTCAAGTATATTTGTTAGGACCAGTGACTATGGGTTGTTGTACTCCTCAAACACGAGAGAGATCTTAGTCGGATTTTGTGATGCTGACTGAGCTTGCAGCTCCGATGACAGAAAAAGCACCTCAGGTGGTTGTTTCTTTCTTGGAAACAATCTAATATCCTGGTTCAGTAAGAAACGGAATTATGTTTCTTTGTCTACAATAAATACATAACAGTGGCAAGTAGTTGTTCATAGTTTCTATGGATGAAACAAATGCTGCAGGAGTACAATGTTACACAAGATGTCATGACCTTGTATTACCACAATATGAGCACAATTAATATTTTAGGAATGTGGCGCACATAGTAGGACCAAACACATTCACCTCAGACACCATTTCATTAGGGACTTAGTCGAGACTAAAGTCATTACTCTTGAACATGTTCGAAGTGACAATCAACTGGCAGATATTTTTACAAAACCTCTTGATGCAAGCTGTTTTGAGTCATTAAGGGATGCTCTGGGATTATGCTGAATTTCGTTATAGCAATTAATTCCTATTTTGGAAGGGCTGGTTTATTGGGCCATGAAGTGCATGCCACAGCCCAAGGAAAGATTTCGAATCTTTGCATTTAATGGTTTATATCTTCCCTTTCTCATGATGGCTAGTTTTTCAAATGCTCAAACGGGTTGCAACGTTTTTCAGTTGTCTTCTCTTCACCTTCTTGTGCCCTATTTCGCTAAAAGCACTTCTTAGAATGCAGTGCCTTCCCGAGAACAACGTATGCTTCAGTGCAGTGCCATCATAGCTTTCCATGATGGTCAGCACCGCGGCGCCAACTTTCTTAATTTGAAGATTTGGTATTTTTGGAAAAAATTTTGGACGGCCAACTCTATAAATAAGAGGGTTTCTAGCAGCTCTAATGACCAAGTTACACACCAAGAGACACAAGAAGTTCATTAGAGAGTTTAAAGAGTGAGTTAAATGCAAGGAAGAGGAAAAGAGTTCGAAAGAGAGTTGTGGTGCCATTCGAGTTCTGACGTGAAGGTCAAAGAGATCGTGACAGTGCTTCCTCACTCAAAAGCCTCTTTTCAGTAGAGTTTTTCTTTCTTTATACTCTTTCTTTGTTTATATTGTTTGAACATGTGCTTCAAGCTATGAGTGGCTAAACTCTTTAGTTTTAGGGTTTTGATGAATTGTTTATAAAGTTATGTTGATATTGTAGACTCTTCATGGTTTTGTTGTAATGATTATGCTTTTCTTTGTTATTGTGCTTAATTCTTGTTGAATGGCCCATATTTGAATGATAGGTTAGATGTATGATTGAAAGATTGTATGTTTAACTAGATCTATAACAAAGAATCGTTGCTAATCTCTCATGAAAATAGTTAGGTTAACAATGAAAGTTGCTAAAGATTATACTTAATGCCATGTAATTTGTATTGATTTCAAGAGATTAGTAATCAGTTGAGTTGCATGAGAAAAGGTTGTTTTTTAGAGAAAATAACTTAGATTTATGTTTCTAGACTTAAGCATAAGAATTGCAATAAAATCTATGAACCCTAGAATATTGCATGATGAGGTAAACAAGGATTCATTACCCTAGAACTCTCTTGGACTCTTGTGTTTCTTATCTATTTACTTTCCTTTTTAGTTTTTCCAAATTCACAACTTAAATTCCAATTCTCAAATAAAATTGATTTGATTGTAGAATAGGTAAGTAATTTAACCTTCACAATTGGATCCTCAGTGTAACGATATTCGGACTTTACCGTCTTATATTATAACTTGACCCGTACACTTGCAGTTAACAAGGAAAGTGCACATCATAAACCCTCTTGCGCTGATTTTGTGGCTTGCTAATAGGGAGTCTACTATCCCTTTTGTGCCCCAAGGTTCTAGTTTATCTTCTTGTGTTGATAAGGATCCTACTTCTGATTTGTCTAAAAGCTTAGTTGTGTATACTGATTTTTCTCATGATGCTTCTCTTCTGAATGATCCTTCTATAAGTGGTATGATTGATGGTGTGTCGTCTATTTCTGATATTCCTTATACTTCCTTGTATGTTGAGTCTCCTGTTTTTGAAAAGGAGTCTAATGTTGTGCATGATTTCGTTGCTGTTGGTGATAATAAGAACATTGATGTTGAGCGTCCATCTGCTGTTGAAATACTTGCTGATGTTGCTATTGATGTTGGTGACATTTTTGTTGATACTAATCCTGTCTCTCTTGGTCATCCAAGTGTATCATCTTTTGGGAATGATGCCCTTGATCATTCTCCCCATTCTCTCGCTGATGAAACCCAGTCTGTTTCTATTGATGCTTCTCCGAGTAGTTTCGATGATGATGATGTGTCGATTGCCTCTCTTGAAAAATCTTGTGCTCCCAAGTTCTCTCATTCTGCTCCTACGGATGTTAGGCCTTCCTCCCATACTCACTCTAGCCCGTCTAAGGTTGCCAAGCAACCAAGTGGGTCCGATTCAGACTATCATCTTCAGTCTAGCTCTAAGGATGATATTGCATTGCCTTATCCCTGA

mRNA sequence

ATGGAGGACAATCATCGTCTCCTAAGTACTATCTTTAAGGTGAAGAAGGAACTTAGAATAATAAAGGCCGAGCTCAAGTTAATGAGCAAGTTTGTTCGTATGATGAACTCAAGCACAAATGATTTGAACAAGGTCCTGTCCTCTGAAAAACAAGCTTCTGACAAGAGAGGAGTTGAGTTTTCTCAATCAAAATCTAATGAGCTCAAAGGAGAGTCTTCCCTATCAAAGGTTTTCGTTCATGCTCAACATGTTCAAACAACTCCTTATGACTACCAGCAGAAAGGTGTGTATGTTGAACCTACTCAACACAAATTTCAGAAGAAGTGGATTGGTCATTTTTGTGGAAGACTCGGTCATATTCGTCCTTACTGCTTTCGGCTGTATGGAAGTCGTGCTCATCAGAGATCTCGTAGGCTGCTTTTAAAGCTCATGGATCCTACTTCTGATTTGTCTAAAAGCTTAGTTGTGTATACTGATTTTTCTCATGATGCTTCTCTTCTGAATGATCCTTCTATAAGTGGTATGATTGATGGTGTGTCGTCTATTTCTGATATTCCTTATACTTCCTTGTATGTTGAGTCTCCTGTTTTTGAAAAGGAGTCTAATGTTGTGCATGATTTCGTTGCTGTTGGTGATAATAAGAACATTGATGTTGAGCGTCCATCTGCTGTTGAAATACTTGCTGATGTTGCTATTGATGTTGGTGACATTTTTGTTGATACTAATCCTGTCTCTCTTGGTCATCCAAGTGTATCATCTTTTGGGAATGATGCCCTTGATCATTCTCCCCATTCTCTCGCTGATGAAACCCAGTCTGTTTCTATTGATGCTTCTCCGAGTAGTTTCGATGATGATGATGTGTCGATTGCCTCTCTTGAAAAATCTTGTGCTCCCAAGTTCTCTCATTCTGCTCCTACGGATGTTAGGCCTTCCTCCCATACTCACTCTAGCCCGTCTAAGGTTGCCAAGCAACCAAGTGGGTCCGATTCAGACTATCATCTTCAGTCTAGCTCTAAGGATGATATTGCATTGCCTTATCCCTGA

Coding sequence (CDS)

ATGGAGGACAATCATCGTCTCCTAAGTACTATCTTTAAGGTGAAGAAGGAACTTAGAATAATAAAGGCCGAGCTCAAGTTAATGAGCAAGTTTGTTCGTATGATGAACTCAAGCACAAATGATTTGAACAAGGTCCTGTCCTCTGAAAAACAAGCTTCTGACAAGAGAGGAGTTGAGTTTTCTCAATCAAAATCTAATGAGCTCAAAGGAGAGTCTTCCCTATCAAAGGTTTTCGTTCATGCTCAACATGTTCAAACAACTCCTTATGACTACCAGCAGAAAGGTGTGTATGTTGAACCTACTCAACACAAATTTCAGAAGAAGTGGATTGGTCATTTTTGTGGAAGACTCGGTCATATTCGTCCTTACTGCTTTCGGCTGTATGGAAGTCGTGCTCATCAGAGATCTCGTAGGCTGCTTTTAAAGCTCATGGATCCTACTTCTGATTTGTCTAAAAGCTTAGTTGTGTATACTGATTTTTCTCATGATGCTTCTCTTCTGAATGATCCTTCTATAAGTGGTATGATTGATGGTGTGTCGTCTATTTCTGATATTCCTTATACTTCCTTGTATGTTGAGTCTCCTGTTTTTGAAAAGGAGTCTAATGTTGTGCATGATTTCGTTGCTGTTGGTGATAATAAGAACATTGATGTTGAGCGTCCATCTGCTGTTGAAATACTTGCTGATGTTGCTATTGATGTTGGTGACATTTTTGTTGATACTAATCCTGTCTCTCTTGGTCATCCAAGTGTATCATCTTTTGGGAATGATGCCCTTGATCATTCTCCCCATTCTCTCGCTGATGAAACCCAGTCTGTTTCTATTGATGCTTCTCCGAGTAGTTTCGATGATGATGATGTGTCGATTGCCTCTCTTGAAAAATCTTGTGCTCCCAAGTTCTCTCATTCTGCTCCTACGGATGTTAGGCCTTCCTCCCATACTCACTCTAGCCCGTCTAAGGTTGCCAAGCAACCAAGTGGGTCCGATTCAGACTATCATCTTCAGTCTAGCTCTAAGGATGATATTGCATTGCCTTATCCCTGA

Protein sequence

MEDNHRLLSTIFKVKKELRIIKAELKLMSKFVRMMNSSTNDLNKVLSSEKQASDKRGVEFSQSKSNELKGESSLSKVFVHAQHVQTTPYDYQQKGVYVEPTQHKFQKKWIGHFCGRLGHIRPYCFRLYGSRAHQRSRRLLLKLMDPTSDLSKSLVVYTDFSHDASLLNDPSISGMIDGVSSISDIPYTSLYVESPVFEKESNVVHDFVAVGDNKNIDVERPSAVEILADVAIDVGDIFVDTNPVSLGHPSVSSFGNDALDHSPHSLADETQSVSIDASPSSFDDDDVSIASLEKSCAPKFSHSAPTDVRPSSHTHSSPSKVAKQPSGSDSDYHLQSSSKDDIALPYP
Homology
BLAST of ClCG01G011440 vs. NCBI nr
Match: KAA0043382.1 (gag-pol polyprotein [Cucumis melo var. makuwa])

HSP 1 Score: 75.1 bits (183), Expect = 1.3e-09
Identity = 48/135 (35.56%), Postives = 78/135 (57.78%), Query Frame = 0

Query: 1   MEDNHRLLSTIFKVKKELRIIKAELKLMSKFVRMMNSSTNDLNKVLSSEKQASDKRGVEF 60
           ME+N   LS+I  +K EL+  + + + +SK V+MM   T+ L+ +L   K+ +DKRG+ F
Sbjct: 174 MEENQSFLSSIVTLKAELKEARNQFEELSKSVKMMTGGTHKLDDLLGQGKRCNDKRGLGF 233

Query: 61  SQSKSNELKGESSLSKVFVHAQHVQTTPYDYQQKGVYVEPTQH--------KFQKKWIGH 120
           S     E+  + +   VFVH    ++  YD Q+K    + T+           +K+WI +
Sbjct: 234 S-----EIGYDRTKKIVFVH----ESNSYDDQRKITKEKRTEDTTPSIKSLNRRKRWICY 293

Query: 121 FCGRLGHIRPYCFRL 128
           FCG++GHIRPYC++L
Sbjct: 294 FCGKIGHIRPYCYQL 299

BLAST of ClCG01G011440 vs. NCBI nr
Match: KAA0061122.1 (gag-pol polyprotein [Cucumis melo var. makuwa] >TYK03793.1 gag-pol polyprotein [Cucumis melo var. makuwa])

HSP 1 Score: 66.6 bits (161), Expect = 4.7e-07
Identity = 47/143 (32.87%), Postives = 79/143 (55.24%), Query Frame = 0

Query: 1   MEDNHRLLSTIFKVKKELRIIKAELKLMSKFVRMMNSSTNDLNKVLSSEKQASDKRGVEF 60
           ME+N   LS+I  +K EL+  K + + ++K+V+M+ + T  L+ ++   K+  DKRG+ F
Sbjct: 199 MEENQSFLSSIVTLKTELKEAKNQFEELTKYVKMLTNGTKKLDDLIGQGKRYDDKRGLSF 258

Query: 61  SQS--KSNELKGESSLSKVFVHAQHVQTTPYDYQQ----KGVYVEPTQHKF-QKKWIGHF 120
           S+     NE+K       +FV  +  Q    +  +    K V V P + +F +K+ + HF
Sbjct: 259 SEKGITGNEVK------TIFVRERSTQNNEAENGKVKFPKNV-VPPVRFQFRRKRRVCHF 318

Query: 121 CGRLGHIRPYCFRLYGSRAHQRS 137
           CG+ GHIRPY F+L     H+ +
Sbjct: 319 CGKDGHIRPYYFQLQSLMFHEHA 334

BLAST of ClCG01G011440 vs. NCBI nr
Match: KAA0046617.1 (gag-pol polyprotein [Cucumis melo var. makuwa])

HSP 1 Score: 64.7 bits (156), Expect = 1.8e-06
Identity = 42/138 (30.43%), Postives = 65/138 (47.10%), Query Frame = 0

Query: 1   MEDNHRLLSTIFKVKKELRIIKAELKLMSKFVRMMNSSTNDLNKVLSSEKQASDKRGVEF 60
           MEDNHR LS+I   + EL+    E + +SK+V+M+ S T +L  +L+  K +S+K  + F
Sbjct: 59  MEDNHRNLSSIATHRAELKEAHHEFESLSKYVKMLTSGTQNLENILNDGKSSSNKMKLGF 118

Query: 61  SQSKSNELKGESSLSKVFVHAQHVQTTPYDYQQKGVYVEPTQHKFQKKWIGHFCGRLGHI 120
           S+ K                                          KKW+ H+CGR GH+
Sbjct: 119 SEVK------------------------------------------KKWVCHYCGRPGHL 154

Query: 121 RPYCFRLYGSRAHQRSRR 139
           RP+C+ L+G   + +S R
Sbjct: 179 RPFCYCLHGFPLYGKSAR 154

BLAST of ClCG01G011440 vs. NCBI nr
Match: MCH95552.1 (gag-pol polyprotein [Trifolium medium])

HSP 1 Score: 64.7 bits (156), Expect = 1.8e-06
Identity = 46/138 (33.33%), Postives = 77/138 (55.80%), Query Frame = 0

Query: 3   DNHRLLSTIFKVKKELRIIKAELKLMSKFVRMMNSSTNDLNKVLSSEKQASDKRGVEFSQ 62
           +   LLSTI  +  E+ ++ ++++ M K VRMMN+ T+ L ++L   +++  K+G++F  
Sbjct: 52  EKSELLSTISGLNDEVTLLNSKIEHMRKQVRMMNNVTDMLEEILEVGQKSGKKKGIDFDY 111

Query: 63  SKSNELKGESSLSKVFVHAQHVQTTPYD-YQQKGVYVEPTQHKFQK------KWIGHFCG 122
              N  K +   +K FV ++      YD    K ++  P +H+  K       WI H+ G
Sbjct: 112 QPMNTQKQKP--AKDFVPSE----GKYDPTMSKLMFQHPKRHQGTKTKTKPQPWICHYYG 171

Query: 123 RLGHIRPYCFRLYGSRAH 134
           R G+IRP+CF+LYG  AH
Sbjct: 172 RKGNIRPFCFKLYGFIAH 183

BLAST of ClCG01G011440 vs. NCBI nr
Match: PNX91973.1 (gag-protease polyprotein [Trifolium pratense])

HSP 1 Score: 64.7 bits (156), Expect = 1.8e-06
Identity = 44/130 (33.85%), Postives = 78/130 (60.00%), Query Frame = 0

Query: 8   LSTIFKVKKELRIIKAELKLMSKFVRMMNSSTNDLNKVLSSEKQASDKRGVEFSQSKSNE 67
           L  I ++  E+ ++ ++L+ MSK VRM+NS T+ L ++L   ++  + +G+ F+    N+
Sbjct: 379 LKKISELNDEVILLNSKLEHMSKQVRMLNSGTDTLEEILEVGQKPGNPKGIGFNYDSMNK 438

Query: 68  LKGESSLSKVFVHAQH-------VQTTPYDYQQKGVYVEPTQHKFQKK-WIGHFCGRLGH 127
            K +SS++K FV ++         Q  P+  + +G     T+ K + K WI H+CG+ GH
Sbjct: 439 -KSQSSVTK-FVSSKEKYDPTMSEQMLPHPKRHQG-----TKSKGKSKPWICHYCGKKGH 498

Query: 128 IRPYCFRLYG 130
           I+P+CF+LYG
Sbjct: 499 IKPFCFKLYG 501

BLAST of ClCG01G011440 vs. ExPASy TrEMBL
Match: A0A5A7TJC1 (Gag-pol polyprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold588G00490 PE=4 SV=1)

HSP 1 Score: 75.1 bits (183), Expect = 6.5e-10
Identity = 48/135 (35.56%), Postives = 78/135 (57.78%), Query Frame = 0

Query: 1   MEDNHRLLSTIFKVKKELRIIKAELKLMSKFVRMMNSSTNDLNKVLSSEKQASDKRGVEF 60
           ME+N   LS+I  +K EL+  + + + +SK V+MM   T+ L+ +L   K+ +DKRG+ F
Sbjct: 174 MEENQSFLSSIVTLKAELKEARNQFEELSKSVKMMTGGTHKLDDLLGQGKRCNDKRGLGF 233

Query: 61  SQSKSNELKGESSLSKVFVHAQHVQTTPYDYQQKGVYVEPTQH--------KFQKKWIGH 120
           S     E+  + +   VFVH    ++  YD Q+K    + T+           +K+WI +
Sbjct: 234 S-----EIGYDRTKKIVFVH----ESNSYDDQRKITKEKRTEDTTPSIKSLNRRKRWICY 293

Query: 121 FCGRLGHIRPYCFRL 128
           FCG++GHIRPYC++L
Sbjct: 294 FCGKIGHIRPYCYQL 299

BLAST of ClCG01G011440 vs. ExPASy TrEMBL
Match: A0A5D3BVP0 (Gag-pol polyprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold863G001810 PE=4 SV=1)

HSP 1 Score: 66.6 bits (161), Expect = 2.3e-07
Identity = 47/143 (32.87%), Postives = 79/143 (55.24%), Query Frame = 0

Query: 1   MEDNHRLLSTIFKVKKELRIIKAELKLMSKFVRMMNSSTNDLNKVLSSEKQASDKRGVEF 60
           ME+N   LS+I  +K EL+  K + + ++K+V+M+ + T  L+ ++   K+  DKRG+ F
Sbjct: 199 MEENQSFLSSIVTLKTELKEAKNQFEELTKYVKMLTNGTKKLDDLIGQGKRYDDKRGLSF 258

Query: 61  SQS--KSNELKGESSLSKVFVHAQHVQTTPYDYQQ----KGVYVEPTQHKF-QKKWIGHF 120
           S+     NE+K       +FV  +  Q    +  +    K V V P + +F +K+ + HF
Sbjct: 259 SEKGITGNEVK------TIFVRERSTQNNEAENGKVKFPKNV-VPPVRFQFRRKRRVCHF 318

Query: 121 CGRLGHIRPYCFRLYGSRAHQRS 137
           CG+ GHIRPY F+L     H+ +
Sbjct: 319 CGKDGHIRPYYFQLQSLMFHEHA 334

BLAST of ClCG01G011440 vs. ExPASy TrEMBL
Match: A0A2K3MMD3 (Gag-protease polyprotein OS=Trifolium pratense OX=57577 GN=L195_g015102 PE=4 SV=1)

HSP 1 Score: 64.7 bits (156), Expect = 8.7e-07
Identity = 44/130 (33.85%), Postives = 78/130 (60.00%), Query Frame = 0

Query: 8   LSTIFKVKKELRIIKAELKLMSKFVRMMNSSTNDLNKVLSSEKQASDKRGVEFSQSKSNE 67
           L  I ++  E+ ++ ++L+ MSK VRM+NS T+ L ++L   ++  + +G+ F+    N+
Sbjct: 379 LKKISELNDEVILLNSKLEHMSKQVRMLNSGTDTLEEILEVGQKPGNPKGIGFNYDSMNK 438

Query: 68  LKGESSLSKVFVHAQH-------VQTTPYDYQQKGVYVEPTQHKFQKK-WIGHFCGRLGH 127
            K +SS++K FV ++         Q  P+  + +G     T+ K + K WI H+CG+ GH
Sbjct: 439 -KSQSSVTK-FVSSKEKYDPTMSEQMLPHPKRHQG-----TKSKGKSKPWICHYCGKKGH 498

Query: 128 IRPYCFRLYG 130
           I+P+CF+LYG
Sbjct: 499 IKPFCFKLYG 501

BLAST of ClCG01G011440 vs. ExPASy TrEMBL
Match: A0A392N7J1 (Gag-pol polyprotein (Fragment) OS=Trifolium medium OX=97028 GN=A2U01_0016531 PE=4 SV=1)

HSP 1 Score: 64.7 bits (156), Expect = 8.7e-07
Identity = 46/138 (33.33%), Postives = 77/138 (55.80%), Query Frame = 0

Query: 3   DNHRLLSTIFKVKKELRIIKAELKLMSKFVRMMNSSTNDLNKVLSSEKQASDKRGVEFSQ 62
           +   LLSTI  +  E+ ++ ++++ M K VRMMN+ T+ L ++L   +++  K+G++F  
Sbjct: 52  EKSELLSTISGLNDEVTLLNSKIEHMRKQVRMMNNVTDMLEEILEVGQKSGKKKGIDFDY 111

Query: 63  SKSNELKGESSLSKVFVHAQHVQTTPYD-YQQKGVYVEPTQHKFQK------KWIGHFCG 122
              N  K +   +K FV ++      YD    K ++  P +H+  K       WI H+ G
Sbjct: 112 QPMNTQKQKP--AKDFVPSE----GKYDPTMSKLMFQHPKRHQGTKTKTKPQPWICHYYG 171

Query: 123 RLGHIRPYCFRLYGSRAH 134
           R G+IRP+CF+LYG  AH
Sbjct: 172 RKGNIRPFCFKLYGFIAH 183

BLAST of ClCG01G011440 vs. ExPASy TrEMBL
Match: A0A5A7TXF9 (Gag-pol polyprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold114G001660 PE=4 SV=1)

HSP 1 Score: 64.7 bits (156), Expect = 8.7e-07
Identity = 42/138 (30.43%), Postives = 65/138 (47.10%), Query Frame = 0

Query: 1   MEDNHRLLSTIFKVKKELRIIKAELKLMSKFVRMMNSSTNDLNKVLSSEKQASDKRGVEF 60
           MEDNHR LS+I   + EL+    E + +SK+V+M+ S T +L  +L+  K +S+K  + F
Sbjct: 59  MEDNHRNLSSIATHRAELKEAHHEFESLSKYVKMLTSGTQNLENILNDGKSSSNKMKLGF 118

Query: 61  SQSKSNELKGESSLSKVFVHAQHVQTTPYDYQQKGVYVEPTQHKFQKKWIGHFCGRLGHI 120
           S+ K                                          KKW+ H+CGR GH+
Sbjct: 119 SEVK------------------------------------------KKWVCHYCGRPGHL 154

Query: 121 RPYCFRLYGSRAHQRSRR 139
           RP+C+ L+G   + +S R
Sbjct: 179 RPFCYCLHGFPLYGKSAR 154

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAA0043382.11.3e-0935.56gag-pol polyprotein [Cucumis melo var. makuwa][more]
KAA0061122.14.7e-0732.87gag-pol polyprotein [Cucumis melo var. makuwa] >TYK03793.1 gag-pol polyprotein [... [more]
KAA0046617.11.8e-0630.43gag-pol polyprotein [Cucumis melo var. makuwa][more]
MCH95552.11.8e-0633.33gag-pol polyprotein [Trifolium medium][more]
PNX91973.11.8e-0633.85gag-protease polyprotein [Trifolium pratense][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A5A7TJC16.5e-1035.56Gag-pol polyprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold588G... [more]
A0A5D3BVP02.3e-0732.87Gag-pol polyprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold863G... [more]
A0A2K3MMD38.7e-0733.85Gag-protease polyprotein OS=Trifolium pratense OX=57577 GN=L195_g015102 PE=4 SV=... [more]
A0A392N7J18.7e-0733.33Gag-pol polyprotein (Fragment) OS=Trifolium medium OX=97028 GN=A2U01_0016531 PE=... [more]
A0A5A7TXF98.7e-0730.43Gag-pol polyprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold114G... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Watermelon (Charleston Gray) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 257..281
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 298..347
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 305..333

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG01G011440.2ClCG01G011440.2mRNA