Clc11G09160 (gene) Watermelon (cordophanus) v2

Overview
NameClc11G09160
Typegene
OrganismCitrullus lanatus subsp. cordophanus cv. cordophanus (Watermelon (cordophanus) v2)
DescriptionGag-pol polyprotein
LocationClcChr11: 11472008 .. 11478328 (-)
RNA-Seq ExpressionClc11G09160
SyntenyClc11G09160
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCACACCAGAGTTTGGGTACTCCAGTCTTCTCATATTCATATGTTTCTACCATGTCCAATACCGGGGAGAACACTAGATTTACCTTTGGAACCTGGAAAGAAGGTTGGCGTGTACATAGAGATCAAAGTTCCATTAGCTTCCAGAGGTCGACCTTGCCCCGTGCTACGATATTCAACTCGAAAAGTATCCTTGAAATCTACCAATATATTCCTTCACATTCGAATTCATGTCTTACGATTTTGTTTTCCTTCTTTATTGTTTGCACCCATTTTTGGAAACTAATGTATTTGCCTTGTGATGATAGTGATCCCATCTCAAACATGCACTCTCCTTTGATGGCATTGTAACAATTGTATACTTTCATTTAAAATAGAGGAGACTGGTAAATTTTTTTAAGGGTTTAAATTGTAATTTTGCCCAATTTTCCCCTTCTGATGTTGTTTTGGGCAAATTTAGCTCTACAAGGATAATGTAGAGATTATGATTTGTTATACATGTTGGTTTGCTATGATTCTCCATACCCTGTGGATTATAGCATCAAGCAACAGGCAATTTTTTTTGTCAGCTCACATTAAATGATTACTCCCAGATGACAATTTTATTCATAAAGCCAATACAAAGAGGGAAAAAGGTTAGATTCACAATTCAAATCTCACATCTTTCTATGTTGCATAAATTCGAGGTTATAGCTATACTTGAGTGATCCATTAAGAGGAAAAATGGGAATTGGTTAGGATGGTTGTGGAGGTTGTTTAAATATCTGTTTCAATTCCTTATGAATTATTTTGGCATTTAATGTAATAAGTAAACATCAGTATTTAATCTAAACTAACCAAACGGTTGGTAGAGAACATAATTTCTCAAAAGACAAAAACAATAATCCCTTCCATCTTAATAATAATAATAAGAATAATAATAAAGCAATGAAACTGATGGGTGTTAGCCTCAATTTATTTACGATTAAATACACTTTTGGTAAGAAAATGAGGTGGCAATGGTGATGTGGCAAATTCTAGGTGATGTGGCAAAATTGATTTATTTTATCTCATTTGATTTAATTTTATTAATATTATTATTATTATTATTTTATTCTTTTTCTCTTTTCCCTACTTTCTCTGTCCCCTACTCCTTTTTTGAATCTCGGAAGGTGTTACTGCCGCCGATCACCAATTTTCTTCATCTCCTTCTCGATAAGTACTTTCTCCTGTGTCCAGATATTCCTACTAGGTGGGAACATAAGTAATTACAATGAAAAGTACTAACCAACAATTCCCAGAATTGAACTCAATGTTTTGAAACAGTATTTTTATTTATTTATTTATTTATTTTTATTTTTAGTTTGCTTCAATCTAATCTCTGGTGTAGTTTATTTGAAAGAAGTCAAGTAATAAATCATAGACTTTGTATTGTCTTAGCATATTTTCTTCTTCCAAATTGGCATCTTTTCCCTCTCTTCATCGATTAGGTGTTTTCTTTTTGTTCTCATTATACGGTTTCTTGAAAAACAAATTCGATGATCTGGTGAATCTAGAAAGATGATTTCAATTTGTCGAGGGCTACCTCACGACAATGGATTCTAGGGATTTTGATTTCTAGCCCAAAATTTAAATTCACTCTAAAGTTGGCCTTTTGCTTACGTCGACTGCATGTGAAAATACCATTTTGTCCCAGAAAATGCACGAGAGGGTTGCAGGCAATCACGTTTTCTCGTTCTTTTCTCTCGCTCCACTTTCTTCCTCTAGTTAAAAAATTATCGTTCCATCGTCCATTAGACAAATCGGAGACATTCACAATTTAACTTATATTTAACTCTCATTATGATGTCGAACTTTGAATAAATGACATTTGTAAGTTCATATACTGTAGCATTCAATTGACTATAATTTAATGTGCTTGGTACCATGATATCCTTCAACCGACCTCCCTTTTAGCATTCTTCTTTGTCATCCCATGTGCCATCATACTAAATAAAAAGTTGTTTCTCATCCATCCTTTAGAAAAAAAAAAGATGAAGAAAGAAAATTACAATTCTATAAAATTCCTGCTCTCGTTTCTTGCTCTCCCTTGCATCTCTTGTTTTCATGATTCTTCCATCCATTTCTTCCTACACATACTCGCTCCTATAGAATCTCAATTCTTCCTCTAGCTTATACCTTTGTCACTCTAATACACTCATTTTCTCTCTCTCATTACGTGATAAACAATAAACAAATAAAAGGATTAACATTTATTTACTTGACGTGTGTAATTGAATGTATATGATGATTCAACAACCTTTACAATGTTAGAATGTTGGGTACTAGCACTTTGCAGCTGAATAGAAAATGGGTTGTCCACAACCAATGAATTCGGTCTAATGGGTTTCTTCCATATTTTGTGCTTTATTTAAAATAGTGTAGACTTAGCTTGGGAAACACTATTCCTTAATTTGATTTGAAAGAATCATAGTATTTTTTTTTTTTTTTTTTTTTTGCATTTTTAAGCAAATTCATACAGAGATCATTAAATTAATCGCAGAAAACAATTGGCGGGAAAGTGTCTAATGTGGAATGAGAATTTTTTAATGAGAGGAAGAAATGAGAGGAAGTAACGAGAAAATGTGCGGTTGTCTGCAACCTTCACGCGTTTCCAGGACAAAAATGACATTTTCTCACGTAGTTGACACAAGCAGAAGTCCAACTCGAGTGAATTTAAGTTTTGGGCCAAAAATCAGTTTGGTCGGCCAACCGTACAAAAATCCCATGGATTGTAGGTTGAGATACATAATATTTGGGGGTGTAAAAGGAAAAAAAAAGATAAAATAAAATAAAAACAAATCAAAATTACCACATCACTTTTGCCACCTAAATTTAAGGGTCAACTTGAACGTAAAAATTATTTTGAACTATTTTTCAAATCTCAATAACTAAAAAACTTAGGGCAACCAAACCTAAAGAATCAAAATGTACTTTACCCATTTGTTTACGAAGAAGCAGTGGGGAGGATTCAAGAACACCCCTACCTCTCTCTCCAAGATCTCATCAGGTACAAGAAAATTCTGCAAAAACTGTCCATCCCCTCCCCAGCCTCTCGCCCGATATTCATTAGGGCCCACCCAGCCTACACATTTTACTTTACTAACATACACTTTTAGTAATTGGGATCTATAAAAAAAACAAACTTGGGTTCAGGCATCTTCTGCTTAGAGGATTTAATGTGAGAAACCCCTCTAACTATATTGCATGCTTTTGATGGAATGTTTTTATTTTGACCTTCCTTCCAATCTTATCTATGCAAAATACATCATCCTCAACCATAAACAATATCTCTCCCTCTTTAGGAGCCATTTGGTTCCACCTTTCAAAGATATACGAGTAACTATGTTTGGTTGTACATGAAATCAACCACTATAATTTTATATAATTGCAATAGATTAATTAATTAAAAAAATAAATTTCATTCAATAGGTTGATAAAAGCTAATTAATCTAATAATAATATTTTTAATTATAAATATTTAAAAATTGAGAAATTTATCATTAAAATTAGCTAATTAATATATTTAATCAACGTCATATTAATTATAATCTAATACTTAATCATTTTTATAATTAATTAAGTTCAACATTGGTAATCTTAATTTTAATTAAATAGTATAAGAAAATAAATTTTAGGGTTGTTTTCAAATATAGGAACATTAACTAAAATATTTACAAATATAACAAAATTTTACGGTCTACTATAGCACACTTAAAGCGTGAGCTGAATTAGCAAGAGTTGAGAAAGATTTGTATGCAAACTGTTCGAATGTTGAACTCTAGCACTAAGGTTATGGAAGATATTCTAACAAAAGGCAAGTCAGCTGGTGATAACACTGGTCTTGACTTTACTAAGTTTGAAAACGTGCATGTAAAACAAGCTTAACACAAGCGCAAACAAAAGATAGAATTTATCAGAGCTTAGAATCTAAGCTCTTATGTTACTCTAGGAGTCTCTTCTAATACTTTGGATATGAAAGTCAATCAAACCTACAGAACAAGGTGGATATACTGTGGTAAATGGGGTCACAAACAACCTTTTTGTTTTCAGCTACTTGGATAGCCTCAAGTCTTCAAATTCTATTAGGGCTTTCATTTCTACAGAGTAGACATCTCTGTGAGCCCAAACTAAGACAAGAGTGGAGAGTCAAGTCTGCAAAGATCCAAGTGTAATGTTGTGTTGACATCTTTCAGATCCTTAGCCAAAGGAGATTGGGTCTTTGATAGTGGAAGCTCAAGGTATATGATAGGAGAAAAGAGTTACTTGTCAGATTTGAAATCTATTAGTGATAGAGACAACCGCAAGTGCACGAGTCAAGGTATAATATAAAACAATTAAGATCGAGTATCGTATCCACCGAGGACTGAATTCAATATAATTACCTATTATTCAAGAATTGAATCAATTTTATCAAAGGAATCGATAAAATTACTTTATTAAGTCATGCTTTAGGTCTTCATGGAATTATGTGCAAACAATATGTTTATGTTTAGAAACCAAAATCTAGATTATTTTCTCTAAAAAACAACTCTTTCTTATGCAACTTAATTGATAACTGATTTTTTGAAACTAAATTAAAAGGCATAGCATTAAAAATAGTCATGAACAACTTTCTATATTAACCTAGCTATTTTTGTTAAAGATTAATAAAGATTCAATGTCATAGATCCAATTAGATATACAATTTCTCAACCATACATTTAACTCAACAATTATTGATGATCAATCAATAAAGAATAAGACGCAATAGAATTGAAAAACATATCATTCAGTAACATCCATGAAAAACACTAAGACATTAACTTAAATTCTTAATTAAATCATCAAACACTCTAAAACTATGAGTTTAGCCACACATATCCATCAAATACATACTTTCATATGAGTTCTTAAGCATAAGAGTAAAAGAGAAAGATAAACTTAGGAAGAAAAACTCTTGAGTTGCTTCCTAACATTGAACTCCTTCAAATCTCCGCTTAGTCGGCACCTCGGGATCTCGAACGACCTTCAACTCTTTGCCAACACTTTCTTCTCATGTTGATAGTAAACTCACACTCTTAGACTCTCTTTTCCTCTTATATTTCATGCTTAGGGTTGCTGAGGACCTCCCAATTTATAGATTTTCACAAACTTGCAGCCAAATTTCTTTCCTAATTTAACAATTTTTCAATTTTGGAAAGTTGGTGCCGTGACACTTCAGGACAACGTTGCGATGCTGACCATCATAAAAAGTAACCCCCAAATGCCGTGTTGTAGCGCCTCCTCTACGCTGTGGAACTTCCTATATAGCGGTATGACACTCTTCCAAATCTCAAATTAGATGCTCTCAGGTCTACAGCACCAAGGCAACGCCACCTATAGAGTCACGACATTGCCCTATATTTGCACTTTTCCCTCTATTTGTCATCCAGTTTTGGCTCAGTTTGGTTCTTTTGGCCTTTAACACTTAATTTCTTCGTAAGTATTTGAAAACCACTAGCAACTAGCATCATATCTAATAGAATCATCCCTAAATTAAAGGCTAATAAAACAATGTTTAGGTGCTTTTCAGTTAGTTCAGGCAAAGTCACGTTTGGAGATGGTGCCACAGGAAAAATAATTGGGAAGGGCAAACCAATTACCATGGTCTCTCAGTGATGTTATGTTTGTTGAAGGTCTCACTGCTAACCTTATTAGCGTCAATCAGTTGTGCGATCAAGGTTTGAATGCCAACTTTACAAATGATCAATGTGTTGTCACAGATAATGACTAGGCTCATGTTATGACAGGTACTCGATCGTCTGATAATTGCTATTTATGGTCTTCTAACAATGCTTTACAAGTCCGTCATCTCTCTCATTAAGTGAAACATGTATGTGGCATAAATGCCTAGGACATGTGAGCATGAAAACTATTCAAAAAACAGTGGCCAAGGATGGCATATCAGGGCTCCCTCTGTTGCCTGCTAAAGGTAGAATTGTGTGTAGTGATTGTCAGGTAGGCAACAAACTCATGCCCCACATAAACAGATTTCTCATATTGGTTCAAGTCGTATTTTGGAACTTCTTCATCTAGACCTTATGGGTCCCATGCAATTTAAGAGTTTGAGAGGAAAAAGGTATGTCATAGTGTGTATTGTAGAATGAAAGAATTGAACTCTTCAAGAGATGACACGAGCCATGATACATGCTAAACGGATGCCATTTCATTTTTGGGGTGAAACCATTAACACAACTTGTCATATCCATAACAGAGTTGTTCTGCGACTAGGCTCCTCCAGCACGAATTACGAAATATGGAGGCGGAGGAAACCAAATGTTAA

mRNA sequence

ATGCACACCAGAGTTTGGGTACTCCAGTCTTCTCATATTCATATGTTTCTACCATGTCCAATACCGGGGAGAACACTAGATTTACCTTTGGAACCTGGAAAGAAGGTTGGCGTGTACATAGAGATCAAAGTTCCATTAGCTTCCAGAGGTCGACCTTGCCCCGTGCTACGATATTCAACTCGAAAAGTGTTACTGCCGCCGATCACCAATTTTCTTCATCTCCTTCTCGATAAGTACTTTCTCCTGTGTCCAGATATTCCTACTAGATCCTTAGCCAAAGGAGATTGGGTCTTTGATAGTGGAAGCTCAAGGTATATGATAGGAGAAAAGAGTTACTTGTCAGATTTGAAATCTATTAGTGATAGAGACAACCGCAAGTGCACGAGTCAAGTTGGTGCCGTGACACTTCAGGACAACGTTGCGATGCTGACCATCATAAAAAGTAACCCCCAAATGCCGTGTTGTAGCGCCTCCTCTACGCTGTGGAACTTCCTATATAGCGGTCTCACTGCTAACCTTATTAGCGTCAATCAGTTGTGCGATCAAGGACATGTGAGCATGAAAACTATTCAAAAAACAGTGGCCAAGGATGGCATATCAGGGCTCCCTCTGTTGCCTGCTAAAGGTAGAATTGTGTGTAGTGATTGTCAGAGTTGTTCTGCGACTAGGCTCCTCCAGCACGAATTACGAAATATGGAGGCGGAGGAAACCAAATGTTAA

Coding sequence (CDS)

ATGCACACCAGAGTTTGGGTACTCCAGTCTTCTCATATTCATATGTTTCTACCATGTCCAATACCGGGGAGAACACTAGATTTACCTTTGGAACCTGGAAAGAAGGTTGGCGTGTACATAGAGATCAAAGTTCCATTAGCTTCCAGAGGTCGACCTTGCCCCGTGCTACGATATTCAACTCGAAAAGTGTTACTGCCGCCGATCACCAATTTTCTTCATCTCCTTCTCGATAAGTACTTTCTCCTGTGTCCAGATATTCCTACTAGATCCTTAGCCAAAGGAGATTGGGTCTTTGATAGTGGAAGCTCAAGGTATATGATAGGAGAAAAGAGTTACTTGTCAGATTTGAAATCTATTAGTGATAGAGACAACCGCAAGTGCACGAGTCAAGTTGGTGCCGTGACACTTCAGGACAACGTTGCGATGCTGACCATCATAAAAAGTAACCCCCAAATGCCGTGTTGTAGCGCCTCCTCTACGCTGTGGAACTTCCTATATAGCGGTCTCACTGCTAACCTTATTAGCGTCAATCAGTTGTGCGATCAAGGACATGTGAGCATGAAAACTATTCAAAAAACAGTGGCCAAGGATGGCATATCAGGGCTCCCTCTGTTGCCTGCTAAAGGTAGAATTGTGTGTAGTGATTGTCAGAGTTGTTCTGCGACTAGGCTCCTCCAGCACGAATTACGAAATATGGAGGCGGAGGAAACCAAATGTTAA

Protein sequence

MHTRVWVLQSSHIHMFLPCPIPGRTLDLPLEPGKKVGVYIEIKVPLASRGRPCPVLRYSTRKVLLPPITNFLHLLLDKYFLLCPDIPTRSLAKGDWVFDSGSSRYMIGEKSYLSDLKSISDRDNRKCTSQVGAVTLQDNVAMLTIIKSNPQMPCCSASSTLWNFLYSGLTANLISVNQLCDQGHVSMKTIQKTVAKDGISGLPLLPAKGRIVCSDCQSCSATRLLQHELRNMEAEETKC
Homology
BLAST of Clc11G09160 vs. NCBI nr
Match: GAU42103.1 (hypothetical protein TSUD_134870 [Trifolium subterraneum])

HSP 1 Score: 76.6 bits (187), Expect = 3.2e-10
Identity = 48/117 (41.03%), Postives = 69/117 (58.97%), Query Frame = 0

Query: 89  RSLAKGDWVFDSGSSRYMIGEKSYLSDLKSISDRDNRKCTSQVGAVTLQDNVAMLTIIKS 148
           R+ +K DW FDSG SR+M G+  +L D+KS S       T   GA      V  L I KS
Sbjct: 577 RASSKEDWYFDSGCSRHMTGDDRFLVDIKSYS---TSYVTFGDGAKGEIIGVGKL-INKS 636

Query: 149 NPQMPCCSASSTLWNFLYSGLTANLISVNQLCDQGHVSMKTIQKTVAKDGISGLPLL 206
            P++            L  GLTANLIS++QLCDQGH+++++++K ++++ I GLP L
Sbjct: 637 LPKLDNV--------LLVKGLTANLISISQLCDQGHLNLRSMKKEISEEAIRGLPKL 681

BLAST of Clc11G09160 vs. NCBI nr
Match: PNX99239.1 (gag-pol polyprotein, partial [Trifolium pratense])

HSP 1 Score: 66.2 bits (160), Expect = 4.3e-07
Identity = 44/95 (46.32%), Postives = 55/95 (57.89%), Query Frame = 0

Query: 89  RSLAKGDWVFDSGSSRYMIGEKSYLSDLKSISDRDNRKCTSQVGAVTLQDNVAMLTIIKS 148
           R  +K DW FDSG S++MIGEK+YL ++KS S   N   T   GA      +  L    S
Sbjct: 151 RVSSKEDWYFDSGCSKHMIGEKTYLKEVKSYS---NSYVTFGDGAKGKIKGIGKL----S 210

Query: 149 NPQMPCCSASSTLWNFLYSGLTANLISVNQLCDQG 184
            P +P  S  + L   L  GLTANLIS++QLCDQG
Sbjct: 211 GPDLP--SLDNVL---LVEGLTANLISISQLCDQG 233

BLAST of Clc11G09160 vs. NCBI nr
Match: MCH89489.1 (gag-pol polyprotein [Trifolium medium])

HSP 1 Score: 65.5 bits (158), Expect = 7.3e-07
Identity = 47/172 (27.33%), Postives = 76/172 (44.19%), Query Frame = 0

Query: 89  RSLAKGDWVFDSGSSRYMIGEKSYLSDLKSISDRDNRKCTSQVGAVTLQDNVAMLTIIKS 148
           R+ ++ DW FDSG SR+M G + +L D+KS S             VT  D       IK 
Sbjct: 177 RASSREDWYFDSGCSRHMTGVEKFLVDIKSYS----------TSFVTFGDGAK--GEIKG 236

Query: 149 NPQMPCCSASSTLWNFLYSGLTANLISVNQLCDQ-------------------------- 208
             ++            L  GLTANLIS++QLCDQ                          
Sbjct: 237 VGKLXXXXXXKLDNVLLVKGLTANLISISQLCDQGMKVNFTKSECLVTNEKEDEVKLWHR 296

Query: 209 --GHVSMKTIQKTVAKDGISGLPLLPAKGRIVCSDCQSCSATRLLQHELRNM 233
             GH++++ +++ + ++   GLP L  +   +C +CQ    TR+   +L+++
Sbjct: 297 KLGHLNLRGMKRAITEEATRGLPKLIIEEDNICGECQIGKQTRMSHQKLQHL 336

BLAST of Clc11G09160 vs. NCBI nr
Match: KAA0056418.1 (gag-pol polyprotein [Cucumis melo var. makuwa] >TYK29102.1 gag-pol polyprotein [Cucumis melo var. makuwa])

HSP 1 Score: 62.0 bits (149), Expect = 8.0e-06
Identity = 50/195 (25.64%), Postives = 73/195 (37.44%), Query Frame = 0

Query: 96  WVFDSGSSRYMIGEKSYLSDLKSISDRDNRKCTSQVGAVTLQDNVAMLTIIKSN---PQM 155
           W FDSGSSR+M G +S+ ++L+        +C S  G VT  D      I K N     +
Sbjct: 247 WYFDSGSSRHMTGNRSFFTELE--------ECVS--GHVTFGDGAKGKIIAKGNINKSNL 306

Query: 156 PCCSASSTLWNFLYSGLTANLISVNQLCDQ------------------------------ 215
           PC +    +      GL ANLISV+QLCDQ                              
Sbjct: 307 PCLNKVRYV-----DGLKANLISVSQLCDQGYSVNFNNTGYVVTDKNNQVFMSRRREADN 366

Query: 216 ---------------------------GHVSMKTIQKTVAKDGISGLPLLPAKGRIVCSD 231
                                      GH+S++++ K +  + + G+P L   G+  C D
Sbjct: 367 YYHWSSNGSNICHLTKVDQTWLWHTKLGHISLRSLDKIIKNEAVVGIPSLDINGKFFCGD 426

BLAST of Clc11G09160 vs. NCBI nr
Match: KAA0035514.1 (gag-pol polyprotein [Cucumis melo var. makuwa] >TYK31017.1 gag-pol polyprotein [Cucumis melo var. makuwa])

HSP 1 Score: 62.0 bits (149), Expect = 8.0e-06
Identity = 46/143 (32.17%), Postives = 64/143 (44.76%), Query Frame = 0

Query: 96  WVFDSGSSRYMIGEKSYLSDLKSISDRDNRKCTSQVGAVTLQDNVAMLTIIKSN---PQM 155
           W FDSG SR+M G +S+ ++L+        +CTS  G VT  D      I K N     +
Sbjct: 214 WYFDSGCSRHMTGNQSFFTELE--------ECTS--GHVTFGDRAKGRIIAKGNIDKSNL 273

Query: 156 PCCSASSTLWNFLY-SGLTANLISVNQLCDQ-------------------------GHVS 210
           PC      L+   Y  GL ANLIS++QLCDQ                         GH+S
Sbjct: 274 PC------LYEVRYVDGLKANLISISQLCDQGYNVNFNNTGCVVTEKKSSVYEWKLGHIS 333

BLAST of Clc11G09160 vs. ExPASy TrEMBL
Match: A0A2Z6NJJ4 (Reverse transcriptase Ty1/copia-type domain-containing protein OS=Trifolium subterraneum OX=3900 GN=TSUD_134870 PE=4 SV=1)

HSP 1 Score: 76.6 bits (187), Expect = 1.5e-10
Identity = 48/117 (41.03%), Postives = 69/117 (58.97%), Query Frame = 0

Query: 89  RSLAKGDWVFDSGSSRYMIGEKSYLSDLKSISDRDNRKCTSQVGAVTLQDNVAMLTIIKS 148
           R+ +K DW FDSG SR+M G+  +L D+KS S       T   GA      V  L I KS
Sbjct: 577 RASSKEDWYFDSGCSRHMTGDDRFLVDIKSYS---TSYVTFGDGAKGEIIGVGKL-INKS 636

Query: 149 NPQMPCCSASSTLWNFLYSGLTANLISVNQLCDQGHVSMKTIQKTVAKDGISGLPLL 206
            P++            L  GLTANLIS++QLCDQGH+++++++K ++++ I GLP L
Sbjct: 637 LPKLDNV--------LLVKGLTANLISISQLCDQGHLNLRSMKKEISEEAIRGLPKL 681

BLAST of Clc11G09160 vs. ExPASy TrEMBL
Match: A0A2K3N8B2 (Gag-pol polyprotein (Fragment) OS=Trifolium pratense OX=57577 GN=L195_g022503 PE=4 SV=1)

HSP 1 Score: 66.2 bits (160), Expect = 2.1e-07
Identity = 44/95 (46.32%), Postives = 55/95 (57.89%), Query Frame = 0

Query: 89  RSLAKGDWVFDSGSSRYMIGEKSYLSDLKSISDRDNRKCTSQVGAVTLQDNVAMLTIIKS 148
           R  +K DW FDSG S++MIGEK+YL ++KS S   N   T   GA      +  L    S
Sbjct: 151 RVSSKEDWYFDSGCSKHMIGEKTYLKEVKSYS---NSYVTFGDGAKGKIKGIGKL----S 210

Query: 149 NPQMPCCSASSTLWNFLYSGLTANLISVNQLCDQG 184
            P +P  S  + L   L  GLTANLIS++QLCDQG
Sbjct: 211 GPDLP--SLDNVL---LVEGLTANLISISQLCDQG 233

BLAST of Clc11G09160 vs. ExPASy TrEMBL
Match: A0A5D3E4L4 (Gag-pol polyprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold455G002660 PE=4 SV=1)

HSP 1 Score: 62.0 bits (149), Expect = 3.9e-06
Identity = 46/143 (32.17%), Postives = 64/143 (44.76%), Query Frame = 0

Query: 96  WVFDSGSSRYMIGEKSYLSDLKSISDRDNRKCTSQVGAVTLQDNVAMLTIIKSN---PQM 155
           W FDSG SR+M G +S+ ++L+        +CTS  G VT  D      I K N     +
Sbjct: 214 WYFDSGCSRHMTGNQSFFTELE--------ECTS--GHVTFGDRAKGRIIAKGNIDKSNL 273

Query: 156 PCCSASSTLWNFLY-SGLTANLISVNQLCDQ-------------------------GHVS 210
           PC      L+   Y  GL ANLIS++QLCDQ                         GH+S
Sbjct: 274 PC------LYEVRYVDGLKANLISISQLCDQGYNVNFNNTGCVVTEKKSSVYEWKLGHIS 333

BLAST of Clc11G09160 vs. ExPASy TrEMBL
Match: A0A5D3DZQ7 (Gag-pol polyprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold120G002370 PE=4 SV=1)

HSP 1 Score: 62.0 bits (149), Expect = 3.9e-06
Identity = 50/195 (25.64%), Postives = 73/195 (37.44%), Query Frame = 0

Query: 96  WVFDSGSSRYMIGEKSYLSDLKSISDRDNRKCTSQVGAVTLQDNVAMLTIIKSN---PQM 155
           W FDSGSSR+M G +S+ ++L+        +C S  G VT  D      I K N     +
Sbjct: 247 WYFDSGSSRHMTGNRSFFTELE--------ECVS--GHVTFGDGAKGKIIAKGNINKSNL 306

Query: 156 PCCSASSTLWNFLYSGLTANLISVNQLCDQ------------------------------ 215
           PC +    +      GL ANLISV+QLCDQ                              
Sbjct: 307 PCLNKVRYV-----DGLKANLISVSQLCDQGYSVNFNNTGYVVTDKNNQVFMSRRREADN 366

Query: 216 ---------------------------GHVSMKTIQKTVAKDGISGLPLLPAKGRIVCSD 231
                                      GH+S++++ K +  + + G+P L   G+  C D
Sbjct: 367 YYHWSSNGSNICHLTKVDQTWLWHTKLGHISLRSLDKIIKNEAVVGIPSLDINGKFFCGD 426

BLAST of Clc11G09160 vs. ExPASy TrEMBL
Match: A0A5A7SMR2 (Gag-pol polyprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold653G00200 PE=4 SV=1)

HSP 1 Score: 59.7 bits (143), Expect = 1.9e-05
Identity = 47/195 (24.10%), Postives = 73/195 (37.44%), Query Frame = 0

Query: 96  WVFDSGSSRYMIGEKSYLSDLKSISDRDNRKCTSQVGAVTLQDNVAMLTIIKSN---PQM 155
           W FDSG SR+M G +S+ ++L+        +CTS    VT +D      I K N     +
Sbjct: 25  WYFDSGCSRHMTGNRSFFTELE--------ECTSV--HVTFEDGAKGRIIAKGNINKSNL 84

Query: 156 PCCSASSTLWNFLYSGLTANLISVNQLCDQ------------------------------ 215
           PC +    +      GL ANLIS++Q+CDQ                              
Sbjct: 85  PCLNEVRYM-----DGLKANLISISQICDQGYSVNFNNTGCVVTDKNNQVFMSGRRQTDN 144

Query: 216 ---------------------------GHVSMKTIQKTVAKDGISGLPLLPAKGRIVCSD 231
                                      GH+SM+++ K +  + +  +P L   G+  C D
Sbjct: 145 CYHWSSNSSNICHLTKTDQTWLWHRKLGHISMRSLDKVIRNEAVVDIPSLDINGKFFCGD 204

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
GAU42103.13.2e-1041.03hypothetical protein TSUD_134870 [Trifolium subterraneum][more]
PNX99239.14.3e-0746.32gag-pol polyprotein, partial [Trifolium pratense][more]
MCH89489.17.3e-0727.33gag-pol polyprotein [Trifolium medium][more]
KAA0056418.18.0e-0625.64gag-pol polyprotein [Cucumis melo var. makuwa] >TYK29102.1 gag-pol polyprotein [... [more]
KAA0035514.18.0e-0632.17gag-pol polyprotein [Cucumis melo var. makuwa] >TYK31017.1 gag-pol polyprotein [... [more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A2Z6NJJ41.5e-1041.03Reverse transcriptase Ty1/copia-type domain-containing protein OS=Trifolium subt... [more]
A0A2K3N8B22.1e-0746.32Gag-pol polyprotein (Fragment) OS=Trifolium pratense OX=57577 GN=L195_g022503 PE... [more]
A0A5D3E4L43.9e-0632.17Gag-pol polyprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold455G... [more]
A0A5D3DZQ73.9e-0625.64Gag-pol polyprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold120G... [more]
A0A5A7SMR21.9e-0524.10Gag-pol polyprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold653G... [more]
Match NameE-valueIdentityDescription
Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Clc11G09160.1Clc11G09160.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0043170 macromolecule metabolic process
biological_process GO:0006807 nitrogen compound metabolic process
biological_process GO:0044238 primary metabolic process