Cla97C06G118067 (gene) Watermelon (97103) v2.5

Overview
NameCla97C06G118067
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
LocationCla97Chr06: 11315319 .. 11318307 (+)
RNA-Seq ExpressionCla97C06G118067
SyntenyCla97C06G118067
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTTTTCTCCTACAGTGGACCATTGGGCTTCAGTAGAACAAATTCTATGTTATTTGAAAGCTGCACCTGGATGTGGGATCTTATATAAAGATCATGACCATACAAGAGTTGAATGTTTTCCAGATGCTGATTGGGCTGGATCTCGAGAAGATAGGAGATCGACCTCTAGATATTGTGTCTTTGTAGGAGGAAACTTAGTATTGTGTAAGAGTAAGAAACAAAATGTAGTTTCACGTTCGAGTGTTGAGTCAAACTATAGGGCCATGGCACAATCTGTGTGCGAAATAGTGTGGATATATCAACTATTATCCAAGATGGGATTCAAAGTTACCATACTAGCTAAATTATGGTGTGATAATCTAGCTGCACTTCACATTTCATCTTACCAGGTGTTTCATGAACGGATGAACATATTGACGTGGATTGTCATTTCATTCGCGAGCAAATACAAGGGTTGGTGTCCTTAGGATATGTGAAGACTGGAGGACAATTGGGAGATATTCTTACCAAAGTTGTAAATGTAGCAAGAATAAGCTATCTATGCAACAAGCTGGACATGACTGATATATTTGCTCCAGCTTGAGGGGGAGTGTTATAATACATATTTTACATTTCATGTATAATCGTCCTTCATCGTAATTGTACATATCTAGTCTTTTCTTGTTCCCTATATATATGTAACTTTATTCTTAATAATATAGGGAGAATACATTCTAAATTACTGAGTACAAAGCACAAACCAATGAACCAAAGAAGATAAAAAGATGACTCTAATGTGGGAAGATGAAAACTCTCCAAATGAAGCATATTTCTTGCACTGCAAAGCTTGAAAATGGCTTGGGAAGATAGCACTACGTGGATGTGTTAAGCCTTGCAACTTCAAACCTATTATTCCACGATAAGGATTTTTCATGAAAGACACGTTGGTTCCCTTCGAACCAAATTTCAGTAAGCAATGTCTTCATCGCATTTGCCCAAATTAAGAAAGCTTGAGATTTTAACTTCGGACCTCCCAAAACCTGGACAACATTCTCCTTAAAACAGCCACTGAATGCCAAATGAACTGAAAAGTATCAAATAATTTCTGTCAACATGTTGAGGAGAAGCTACAATAAAAAAATAAATATCAAATAATTCATTATGCAATATTCTAGGTTTCATGGAATTAATTGCAAAGCATATGCCTGAGTCTAGAAAATAAATTTGGGTTATTTTCCATGAAAAAAATTATTTTCTCATGCAACTTAATTGATCACTAACTTCTTGAAACCGAATCAAATCCCATGACATTAAGATTAGTCTTTAACAACTCTCATTATTTACCTAACTATTTCCATGAGAGATTAATAATGATTCATTGTTATAAGTCTGGTTAAACATGCAGTCCATCAAATTGAATATCTAACCTAACAAATTCCCACCAATAAATAGTTCTATCATTCCTCCTCATTTAGCCACAGTTGATGATTGTGGTCTTGGTTTGGGCTGATTATTAGCTTGTTCTTGTTGATACTCTATTATGAAGATCATCACATGGAATACCAGCTGCCTCGGTGATAGTTCTAAACAATTGGCCCTTAAGTGTTTTTTGAAGTAACAGAACCCGAATTTGGTTTTAATCCAAGAATCTAAAAAGGATGAGTTTGATGCTGCATTTATTAAGTTGTTATGGAGCTCAAAGGATATTGGGTGGGCATTTGTGGAATCAATTGGCAAATCAGGTAGGATTTTAACTATGTGGGATGAAAGCAAGCTAATGGTAACTGAAGTATTAAAAGGTGGTCACTCCTCATCGGTCAAATGTATGACTACTTACAAAAAAATTTGTTGGATTACAAACGTGTACAACCCTAATGACTACAAAGAACGAAGATATATTTGGCAAGAGCTATCTTCTTTGGCAGATTATTGCATTGAACAATGGTGTCTTCGAGGAGATTTCAACATTACAAGATCACCTCAAGAACGATCTCCAATTGGCAGTGTCACACGAGGCATGAGAAAATTCAACAAATTTATCATTGTCACTCAATTGATGGAAATCCCTTTATCAAATGGTAGATTCACTTGGTCAAGGGAAGGAAGCTCAATTTCAAGATCTCTTATTGGTAGATTCCTTGTTACAAAGGCTTGGGATGACATGTTTGAAAATTCTAGAGTTTCAAGGCAAGCTCGTACTTTTTCAAACCATTTTCCGTTACTACTTAAAGCTGGTTCGTTTTCTTGGTGTCATCCTCCTTTTCGTTTATGCAATAGTTGGCTTTTAGAGAAAGCTTGTTGTCAATTAATTGAACGATCCTTGTCAAATGGAAACTTCCAAGGTTAGGTAGGTTTCATTATTGATTCAAGGCTTAAAAAGGTCAAATCAGCAATTAAAACTTGGCACAAAGATTCAGAAGCTTCTAAGAAGAGAAGGGAGGAGGAATTACTAGAAGATATGCAGAAACTTGATATACAAGTCGATAATCAAGAGGCAGTTTCAGAAGAAATAAACTTGAGAACCTCTTAAAAGCTGGGCTCTTACCGATGTATCAAATGGAAGAAAGAAGACTTATACAACAAAGTAAACTAAATTGGCTGAGGTTGGGAGATGAAAATTCAAGCTTTTTCCACTGTTTTTTTACAGCAAAGAAGAGGGGAAATTTAGTTACGGAACTGAATAATGAGCAGGGTGTTCCTTCCAAAACATTCCGTGAAATCGAGAGTATTGTGTTGGGCTTCTACTCCTCTATTTATCTCGAATCCCCTAGATTAACATCCTTACCCCTCAACTTCTGCTGGACGAAGATTTTTAAGGAACAAAATGCATTATTGACTGCCAGTTGCATAGTAGAGGAAATTTTTCAGGCCTTAAAAGCACTTGGCAAGAATAAAGCTCCTGGACCAGATGGCTTTACAACTGAATTCTTACTTAAATACTGGAGTTCATTCCGATCAAATTTCTTGAAGTTATTTGAGGAATTTTCTTCAAATTGGAGCGTATTATGA

mRNA sequence

ATGTTTTCTCCTACAGTGGACCATTGGGCTTCAGTAGAACAAATTCTATGTTATTTGAAAGCTGCACCTGGATGTGGGATCTTATATAAAGATCATGACCATACAAGAGTTGAATGTTTTCCAGATGCTGATTGGGCTGGATCTCGAGAAGATAGGAGATCGACCTCTAGATATTGTGTCTTTGTAGGAGGAAACTTAGTATTGTGTAAGAGTAAGAAACAAAATGTAGTTTCACGTTCGAGTGTTGAGTCAAACTATAGGGCCATGGCACAATCTGTGTGCGAAATAGTGTGGATATATCAACTATTATCCAAGATGGGATTCAAAGTTACCATACTAGCTAAATTATGGTGTGATAATCTAGCTGCACTTCACATTTCATCTTACCAGAACCCGAATTTGGTTTTAATCCAAGAATCTAAAAAGGATGAGTTTGATGCTGCATTTATTAAGTTGTTATGGAGCTCAAAGGATATTGGGTGGGCATTTGTGGAATCAATTGGCAAATCAGGTAGGATTTTAACTATGTGGGATGAAAGCAAGCTAATGGTAACTGAAGTATTAAAAGGTGGTCACTCCTCATCGGTCAAATGTATGACTACTTACAAAAAAATTTGTTGGATTACAAACGTGTACAACCCTAATGACTACAAAGAACGAAGATATATTTGGCAAGAGCTATCTTCTTTGGCAGATTATTGCATTGAACAATGGTGTCTTCGAGGAGATTTCAACATTACAAGATCACCTCAAGAACGATCTCCAATTGGCAGTGTCACACGAGGCATGAGAAAATTCAACAAATTTATCATTGTCACTCAATTGATGGAAATCCCTTTATCAAATGGTAGATTCACTTGGTCAAGGGAAGGAAGCTCAATTTCAAGATCTCTTATTGGTAGATTCCTTGTTACAAAGGCTTGGGATGACATGTTTGAAAATTCTAGAGTTTCAAGGCAAGCTCGTACTTTTTCAAACCATTTTCCGTTACTACTTAAAGCTGCAAAGAAGAGGGGAAATTTAGTTACGGAACTGAATAATGAGCAGGGTGTTCCTTCCAAAACATTCCGTGAAATCGAGAGTATTGTGTTGGGCTTCTACTCCTCTATTTATCTCGAATCCCCTAGATTAACATCCTTACCCCTCAACTTCTGCTGGACGAAGATTTTTAAGGAACAAAATGCATTATTGACTGCCAGTTGCATAGTAGAGGAAATTTTTCAGGCCTTAAAAGCACTTGGCAAGAATAAAGCTCCTGGACCAGATGGCTTTACAACTGAATTCTTACTTAAATACTGGAGTTCATTCCGATCAAATTTCTTGAAGTTATTTGAGGAATTTTCTTCAAATTGGAGCGTATTATGA

Coding sequence (CDS)

ATGTTTTCTCCTACAGTGGACCATTGGGCTTCAGTAGAACAAATTCTATGTTATTTGAAAGCTGCACCTGGATGTGGGATCTTATATAAAGATCATGACCATACAAGAGTTGAATGTTTTCCAGATGCTGATTGGGCTGGATCTCGAGAAGATAGGAGATCGACCTCTAGATATTGTGTCTTTGTAGGAGGAAACTTAGTATTGTGTAAGAGTAAGAAACAAAATGTAGTTTCACGTTCGAGTGTTGAGTCAAACTATAGGGCCATGGCACAATCTGTGTGCGAAATAGTGTGGATATATCAACTATTATCCAAGATGGGATTCAAAGTTACCATACTAGCTAAATTATGGTGTGATAATCTAGCTGCACTTCACATTTCATCTTACCAGAACCCGAATTTGGTTTTAATCCAAGAATCTAAAAAGGATGAGTTTGATGCTGCATTTATTAAGTTGTTATGGAGCTCAAAGGATATTGGGTGGGCATTTGTGGAATCAATTGGCAAATCAGGTAGGATTTTAACTATGTGGGATGAAAGCAAGCTAATGGTAACTGAAGTATTAAAAGGTGGTCACTCCTCATCGGTCAAATGTATGACTACTTACAAAAAAATTTGTTGGATTACAAACGTGTACAACCCTAATGACTACAAAGAACGAAGATATATTTGGCAAGAGCTATCTTCTTTGGCAGATTATTGCATTGAACAATGGTGTCTTCGAGGAGATTTCAACATTACAAGATCACCTCAAGAACGATCTCCAATTGGCAGTGTCACACGAGGCATGAGAAAATTCAACAAATTTATCATTGTCACTCAATTGATGGAAATCCCTTTATCAAATGGTAGATTCACTTGGTCAAGGGAAGGAAGCTCAATTTCAAGATCTCTTATTGGTAGATTCCTTGTTACAAAGGCTTGGGATGACATGTTTGAAAATTCTAGAGTTTCAAGGCAAGCTCGTACTTTTTCAAACCATTTTCCGTTACTACTTAAAGCTGCAAAGAAGAGGGGAAATTTAGTTACGGAACTGAATAATGAGCAGGGTGTTCCTTCCAAAACATTCCGTGAAATCGAGAGTATTGTGTTGGGCTTCTACTCCTCTATTTATCTCGAATCCCCTAGATTAACATCCTTACCCCTCAACTTCTGCTGGACGAAGATTTTTAAGGAACAAAATGCATTATTGACTGCCAGTTGCATAGTAGAGGAAATTTTTCAGGCCTTAAAAGCACTTGGCAAGAATAAAGCTCCTGGACCAGATGGCTTTACAACTGAATTCTTACTTAAATACTGGAGTTCATTCCGATCAAATTTCTTGAAGTTATTTGAGGAATTTTCTTCAAATTGGAGCGTATTATGA

Protein sequence

MFSPTVDHWASVEQILCYLKAAPGCGILYKDHDHTRVECFPDADWAGSREDRRSTSRYCVFVGGNLVLCKSKKQNVVSRSSVESNYRAMAQSVCEIVWIYQLLSKMGFKVTILAKLWCDNLAALHISSYQNPNLVLIQESKKDEFDAAFIKLLWSSKDIGWAFVESIGKSGRILTMWDESKLMVTEVLKGGHSSSVKCMTTYKKICWITNVYNPNDYKERRYIWQELSSLADYCIEQWCLRGDFNITRSPQERSPIGSVTRGMRKFNKFIIVTQLMEIPLSNGRFTWSREGSSISRSLIGRFLVTKAWDDMFENSRVSRQARTFSNHFPLLLKAAKKRGNLVTELNNEQGVPSKTFREIESIVLGFYSSIYLESPRLTSLPLNFCWTKIFKEQNALLTASCIVEEIFQALKALGKNKAPGPDGFTTEFLLKYWSSFRSNFLKLFEEFSSNWSVL
Homology
BLAST of Cla97C06G118067 vs. NCBI nr
Match: TYJ98683.1 (hypothetical protein E5676_scaffold429G00120 [Cucumis melo var. makuwa])

HSP 1 Score: 263.8 bits (673), Expect = 2.6e-66
Identity = 143/309 (46.28%), Postives = 186/309 (60.19%), Query Frame = 0

Query: 118 CDNLAALHISSYQNP-NLVLIQESKKDEFDAAFIKLLWSSKDIGWAFVESIGKSGRILTM 177
           C   + +   S+ +P +LV+   ++  E D A IK LWSSKDIGW  VES G+ G ILTM
Sbjct: 54  CSQWSKIVAYSWLDPDHLVICYRNQGQEIDIALIKSLWSSKDIGWELVESFGRFGGILTM 113

Query: 178 WDESKLMVTEVLKGGHSSSVKCMTTYKKICWITNVYNPNDYKERRYIWQELSSLADYCIE 237
           WD SK+ V E LKGG+S S+  +T+ KK CWITNVY P DY+ERR++W  L SL+ YC  
Sbjct: 114 WDMSKIKVVETLKGGYSLSINSITSCKKSCWITNVYGPYDYEERRFVWLVLVSLSGYCTG 173

Query: 238 QWCLRGDFNITRSPQERSPIGSVTRGMRKFNKFIIVTQLMEIPLSNGRFTWSREGSSISR 297
            WC+ G  NITR   E  P+   TRGMR+FN  I    + E+PL NGR TWSREGSSISR
Sbjct: 174 AWCIGGKCNITRWAHECFPLEKQTRGMRQFNNPIDSLNIWELPLQNGRCTWSREGSSISR 233

Query: 298 SLIGRFLVTKAWDDMFENSRVSRQARTFSNHFPLLLKAAKKRGNLVTELNNEQGVPSKTF 357
           SL+  F + K WD++ ENSRV R+A T S+HFPLLL+A   +        +   +P   F
Sbjct: 234 SLLDPFFIDKEWDEISENSRVGRKAHTISDHFPLLLEAGSIKWGPSPFRFSNSWLP---F 293

Query: 358 REIESIVLGFYSSIYLESPRLTSLPLNFCWTKIFKEQNALLTASCIVEEIFQALKALGKN 417
            E   I+   ++        +TS+     +  I        T +    EIF+ALKALGKN
Sbjct: 294 SECNRIIKEVWN--------ITSITDWAGFVLIECSSRCTFTDA----EIFKALKALGKN 347

Query: 418 KAPGPDGFT 426
           K+P P+GFT
Sbjct: 354 KSPSPNGFT 347

BLAST of Cla97C06G118067 vs. NCBI nr
Match: XP_031744754.1 (uncharacterized protein LOC101212255 isoform X2 [Cucumis sativus])

HSP 1 Score: 238.4 bits (607), Expect = 1.2e-58
Identity = 121/175 (69.14%), Postives = 136/175 (77.71%), Query Frame = 0

Query: 1    MFSPTVDHWASVEQILCYLKAAPGCGILYKDHDHTRVECFPDADWAGSREDRRSTSRYCV 60
            M SPTVDHWA+VEQILCYLKAAPG GILYKDH HTRVECF DADWAGSREDRRSTS YCV
Sbjct: 1153 MSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCV 1212

Query: 61   FVGGNLVLCKSKKQNVVSRSSVESNYRAMAQSVCEIVWIYQLLSKMGFKVTILAKLWCDN 120
            FVGGNLV  KSKKQNVVSRSS ES YRAMAQSVCEIVWI+QLLS++GF +T+ AKLWCDN
Sbjct: 1213 FVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQLLSEIGFSITVPAKLWCDN 1272

Query: 121  LAALHISSYQNPNLVLIQESKKDEFDAAFIKLLWSSKDIGWAFVESIGKSGRILT 176
             AALHI+S    N V  + +K  E D  FI+       +   +V++  + G ILT
Sbjct: 1273 QAALHIAS----NPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILT 1323

BLAST of Cla97C06G118067 vs. NCBI nr
Match: XP_031744758.1 (uncharacterized protein LOC101212255 isoform X5 [Cucumis sativus])

HSP 1 Score: 238.4 bits (607), Expect = 1.2e-58
Identity = 121/175 (69.14%), Postives = 136/175 (77.71%), Query Frame = 0

Query: 1   MFSPTVDHWASVEQILCYLKAAPGCGILYKDHDHTRVECFPDADWAGSREDRRSTSRYCV 60
           M SPTVDHWA+VEQILCYLKAAPG GILYKDH HTRVECF DADWAGSREDRRSTS YCV
Sbjct: 776 MSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCV 835

Query: 61  FVGGNLVLCKSKKQNVVSRSSVESNYRAMAQSVCEIVWIYQLLSKMGFKVTILAKLWCDN 120
           FVGGNLV  KSKKQNVVSRSS ES YRAMAQSVCEIVWI+QLLS++GF +T+ AKLWCDN
Sbjct: 836 FVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQLLSEIGFSITVPAKLWCDN 895

Query: 121 LAALHISSYQNPNLVLIQESKKDEFDAAFIKLLWSSKDIGWAFVESIGKSGRILT 176
            AALHI+S    N V  + +K  E D  FI+       +   +V++  + G ILT
Sbjct: 896 QAALHIAS----NPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILT 946

BLAST of Cla97C06G118067 vs. NCBI nr
Match: XP_031744753.1 (uncharacterized protein LOC101212255 isoform X1 [Cucumis sativus])

HSP 1 Score: 238.4 bits (607), Expect = 1.2e-58
Identity = 121/175 (69.14%), Postives = 136/175 (77.71%), Query Frame = 0

Query: 1    MFSPTVDHWASVEQILCYLKAAPGCGILYKDHDHTRVECFPDADWAGSREDRRSTSRYCV 60
            M SPTVDHWA+VEQILCYLKAAPG GILYKDH HTRVECF DADWAGSREDRRSTS YCV
Sbjct: 1168 MSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCV 1227

Query: 61   FVGGNLVLCKSKKQNVVSRSSVESNYRAMAQSVCEIVWIYQLLSKMGFKVTILAKLWCDN 120
            FVGGNLV  KSKKQNVVSRSS ES YRAMAQSVCEIVWI+QLLS++GF +T+ AKLWCDN
Sbjct: 1228 FVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQLLSEIGFSITVPAKLWCDN 1287

Query: 121  LAALHISSYQNPNLVLIQESKKDEFDAAFIKLLWSSKDIGWAFVESIGKSGRILT 176
             AALHI+S    N V  + +K  E D  FI+       +   +V++  + G ILT
Sbjct: 1288 QAALHIAS----NPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILT 1338

BLAST of Cla97C06G118067 vs. NCBI nr
Match: XP_031744755.1 (uncharacterized protein LOC101212255 isoform X3 [Cucumis sativus])

HSP 1 Score: 238.4 bits (607), Expect = 1.2e-58
Identity = 121/175 (69.14%), Postives = 136/175 (77.71%), Query Frame = 0

Query: 1    MFSPTVDHWASVEQILCYLKAAPGCGILYKDHDHTRVECFPDADWAGSREDRRSTSRYCV 60
            M SPTVDHWA+VEQILCYLKAAPG GILYKDH HTRVECF DADWAGSREDRRSTS YCV
Sbjct: 1120 MSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCV 1179

Query: 61   FVGGNLVLCKSKKQNVVSRSSVESNYRAMAQSVCEIVWIYQLLSKMGFKVTILAKLWCDN 120
            FVGGNLV  KSKKQNVVSRSS ES YRAMAQSVCEIVWI+QLLS++GF +T+ AKLWCDN
Sbjct: 1180 FVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQLLSEIGFSITVPAKLWCDN 1239

Query: 121  LAALHISSYQNPNLVLIQESKKDEFDAAFIKLLWSSKDIGWAFVESIGKSGRILT 176
             AALHI+S    N V  + +K  E D  FI+       +   +V++  + G ILT
Sbjct: 1240 QAALHIAS----NPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILT 1290

BLAST of Cla97C06G118067 vs. ExPASy Swiss-Prot
Match: Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)

HSP 1 Score: 102.1 bits (253), Expect = 1.7e-20
Identity = 53/151 (35.10%), Postives = 85/151 (56.29%), Query Frame = 0

Query: 1    MFSPTVDHWASVEQILCYLKAAPGCGILYKDHDHTRVECFPDADWAGSREDRRSTSRYCV 60
            M  PT DHW +++++L YL   P  GI  K  +   +  + DADWAG  +D  ST+ Y V
Sbjct: 1256 MHMPTDDHWNALKRVLRYLAGTPDHGIFLKKGNTLSLHAYSDADWAGDTDDYVSTNGYIV 1315

Query: 61   FVGGNLVLCKSKKQNVVSRSSVESNYRAMAQSVCEIVWIYQLLSKMGFKVTILAKLWCDN 120
            ++G + +   SKKQ  V RSS E+ YR++A +  E+ WI  LL+++G +++    ++CDN
Sbjct: 1316 YLGHHPISWSSKKQKGVVRSSTEAEYRSVANTSSELQWICSLLTELGIQLSHPPVIYCDN 1375

Query: 121  LAALHISSYQNPNLVLIQESKKDEFDAAFIK 152
            + A ++ +    N V     K    D  FI+
Sbjct: 1376 VGATYLCA----NPVFHSRMKHIALDYHFIR 1402

BLAST of Cla97C06G118067 vs. ExPASy Swiss-Prot
Match: Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)

HSP 1 Score: 99.8 bits (247), Expect = 8.7e-20
Identity = 53/151 (35.10%), Postives = 85/151 (56.29%), Query Frame = 0

Query: 1    MFSPTVDHWASVEQILCYLKAAPGCGILYKDHDHTRVECFPDADWAGSREDRRSTSRYCV 60
            M  PT +H  ++++IL YL   P  GI  K  +   +  + DADWAG ++D  ST+ Y V
Sbjct: 1273 MHMPTEEHLQALKRILRYLAGTPNHGIFLKKGNTLSLHAYSDADWAGDKDDYVSTNGYIV 1332

Query: 61   FVGGNLVLCKSKKQNVVSRSSVESNYRAMAQSVCEIVWIYQLLSKMGFKVTILAKLWCDN 120
            ++G + +   SKKQ  V RSS E+ YR++A +  E+ WI  LL+++G ++T    ++CDN
Sbjct: 1333 YLGHHPISWSSKKQKGVVRSSTEAEYRSVANTSSEMQWICSLLTELGIRLTRPPVIYCDN 1392

Query: 121  LAALHISSYQNPNLVLIQESKKDEFDAAFIK 152
            + A ++ +    N V     K    D  FI+
Sbjct: 1393 VGATYLCA----NPVFHSRMKHIAIDYHFIR 1419

BLAST of Cla97C06G118067 vs. ExPASy Swiss-Prot
Match: P92519 (Uncharacterized mitochondrial protein AtMg00810 OS=Arabidopsis thaliana OX=3702 GN=AtMg00810 PE=4 SV=1)

HSP 1 Score: 80.1 bits (196), Expect = 7.1e-14
Identity = 36/98 (36.73%), Postives = 58/98 (59.18%), Query Frame = 0

Query: 1   MFSPTVDHWASVEQILCYLKAAPGCGILYKDHDHTRVECFPDADWAGSREDRRSTSRYCV 60
           M  PT+  +  ++++L Y+K     G+    +    V+ F D+DWAG    RRST+ +C 
Sbjct: 128 MHEPTLADFDLLKRVLRYVKGTIFHGLYIHKNSKLNVQAFCDSDWAGCTSTRRSTTGFCT 187

Query: 61  FVGGNLVLCKSKKQNVVSRSSVESNYRAMAQSVCEIVW 99
           F+G N++   +K+Q  VSRSS E+ YRA+A +  E+ W
Sbjct: 188 FLGCNIISWSAKRQPTVSRSSTETEYRALALTAAELTW 225

BLAST of Cla97C06G118067 vs. ExPASy Swiss-Prot
Match: P0CV72 (Secreted RxLR effector protein 161 OS=Plasmopara viticola OX=143451 GN=RXLR161 PE=2 SV=1)

HSP 1 Score: 74.7 bits (182), Expect = 3.0e-12
Identity = 35/96 (36.46%), Postives = 55/96 (57.29%), Query Frame = 0

Query: 4   PTVDHWASVEQILCYLKAAPGCGILYKDHDHTRVECFPDADWAGSREDRRSTSRYCVFVG 63
           P   HW +++++L YL++    G+ +      ++  + DADWAG  E RRSTS Y   + 
Sbjct: 38  PCPTHWQALKRVLRYLQSTQTYGLEFTRAGTAKLVGYSDADWAGDVESRRSTSGYLFKLN 97

Query: 64  GNLVLCKSKKQNVVSRSSVESNYRAMAQSVCEIVWI 100
           G  V  +SKKQ  V+ SS E  Y A++++  E VW+
Sbjct: 98  GGCVSWRSKKQRTVALSSTEDEYMALSEATQEAVWL 133

BLAST of Cla97C06G118067 vs. ExPASy Swiss-Prot
Match: P04146 (Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3)

HSP 1 Score: 72.0 bits (175), Expect = 1.9e-11
Identity = 40/128 (31.25%), Postives = 74/128 (57.81%), Query Frame = 0

Query: 9    WASVEQILCYLKAAPGCGILYKDH--DHTRVECFPDADWAGSREDRRSTSRYCV-FVGGN 68
            W +++++L YLK      +++K +     ++  + D+DWAGS  DR+ST+ Y       N
Sbjct: 1218 WQNLKRVLRYLKGTIDMKLIFKKNLAFENKIIGYVDSDWAGSEIDRKSTTGYLFKMFDFN 1277

Query: 69   LVLCKSKKQNVVSRSSVESNYRAMAQSVCEIVWIYQLLSKMGFKVTILAKLWCDNLAALH 128
            L+   +K+QN V+ SS E+ Y A+ ++V E +W+  LL+ +  K+    K++ DN   + 
Sbjct: 1278 LICWNTKRQNSVAASSTEAEYMALFEAVREALWLKFLLTSINIKLENPIKIYEDNQGCIS 1337

Query: 129  ISSYQNPN 134
            I++  NP+
Sbjct: 1338 IAN--NPS 1343

BLAST of Cla97C06G118067 vs. ExPASy TrEMBL
Match: A0A5D3BHE3 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold429G00120 PE=4 SV=1)

HSP 1 Score: 263.8 bits (673), Expect = 1.3e-66
Identity = 143/309 (46.28%), Postives = 186/309 (60.19%), Query Frame = 0

Query: 118 CDNLAALHISSYQNP-NLVLIQESKKDEFDAAFIKLLWSSKDIGWAFVESIGKSGRILTM 177
           C   + +   S+ +P +LV+   ++  E D A IK LWSSKDIGW  VES G+ G ILTM
Sbjct: 54  CSQWSKIVAYSWLDPDHLVICYRNQGQEIDIALIKSLWSSKDIGWELVESFGRFGGILTM 113

Query: 178 WDESKLMVTEVLKGGHSSSVKCMTTYKKICWITNVYNPNDYKERRYIWQELSSLADYCIE 237
           WD SK+ V E LKGG+S S+  +T+ KK CWITNVY P DY+ERR++W  L SL+ YC  
Sbjct: 114 WDMSKIKVVETLKGGYSLSINSITSCKKSCWITNVYGPYDYEERRFVWLVLVSLSGYCTG 173

Query: 238 QWCLRGDFNITRSPQERSPIGSVTRGMRKFNKFIIVTQLMEIPLSNGRFTWSREGSSISR 297
            WC+ G  NITR   E  P+   TRGMR+FN  I    + E+PL NGR TWSREGSSISR
Sbjct: 174 AWCIGGKCNITRWAHECFPLEKQTRGMRQFNNPIDSLNIWELPLQNGRCTWSREGSSISR 233

Query: 298 SLIGRFLVTKAWDDMFENSRVSRQARTFSNHFPLLLKAAKKRGNLVTELNNEQGVPSKTF 357
           SL+  F + K WD++ ENSRV R+A T S+HFPLLL+A   +        +   +P   F
Sbjct: 234 SLLDPFFIDKEWDEISENSRVGRKAHTISDHFPLLLEAGSIKWGPSPFRFSNSWLP---F 293

Query: 358 REIESIVLGFYSSIYLESPRLTSLPLNFCWTKIFKEQNALLTASCIVEEIFQALKALGKN 417
            E   I+   ++        +TS+     +  I        T +    EIF+ALKALGKN
Sbjct: 294 SECNRIIKEVWN--------ITSITDWAGFVLIECSSRCTFTDA----EIFKALKALGKN 347

Query: 418 KAPGPDGFT 426
           K+P P+GFT
Sbjct: 354 KSPSPNGFT 347

BLAST of Cla97C06G118067 vs. ExPASy TrEMBL
Match: A0A5A7SQ84 (Putative mitochondrial protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold121G00990 PE=4 SV=1)

HSP 1 Score: 222.2 bits (565), Expect = 4.3e-54
Identity = 112/163 (68.71%), Postives = 125/163 (76.69%), Query Frame = 0

Query: 1   MFSPTVDHWASVEQILCYLKAAPGCGILYKDHDHTRVECFPDADWAGSREDRRSTSRYCV 60
           M SPTVDHWA+VEQILCY KAAPG GILYKDH HTRVECF DADWA SREDRRSTS YCV
Sbjct: 103 MSSPTVDHWAAVEQILCYSKAAPGRGILYKDHGHTRVECFSDADWAESREDRRSTSGYCV 162

Query: 61  FVGGNLVLCKSKKQNVVSRSSVESNYRAMAQSVCEIVWIYQLLSKMGFKVTILAKLWCDN 120
           FVGG LV  KSKKQNVVSRSS +S YRA AQSVCEI WI+QLLS++GF +T+ AKLWCDN
Sbjct: 163 FVGGKLVSWKSKKQNVVSRSSAKSEYRATAQSVCEIAWIHQLLSEIGFSITVPAKLWCDN 222

Query: 121 LAALHISSYQNPNLVLIQESKKDEFDAAFI----KLLWSSKDI 160
             ALHI+S    N V  + +K  E D  FI    K+ W  +D+
Sbjct: 223 QVALHIAS----NPVFHERTKHIEVDCYFILRKSKMGWCPQDM 261

BLAST of Cla97C06G118067 vs. ExPASy TrEMBL
Match: A0A5D3CID2 (Putative mitochondrial protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold828G00470 PE=4 SV=1)

HSP 1 Score: 221.9 bits (564), Expect = 5.6e-54
Identity = 115/175 (65.71%), Postives = 129/175 (73.71%), Query Frame = 0

Query: 1   MFSPTVDHWASVEQILCYLKAAPGCGILYKDHDHTRVECFPDADWAGSREDRRSTSRYCV 60
           M  PTVDHWA VEQILCYLKAAPGCGIL KDH HTRVECF DADWAGSREDRRST  YCV
Sbjct: 663 MSFPTVDHWAVVEQILCYLKAAPGCGILCKDHGHTRVECFSDADWAGSREDRRSTFGYCV 722

Query: 61  FVGGNLVLCKSKKQNVVSRSSVESNYRAMAQSVCEIVWIYQLLSKMGFKVTILAKLWCDN 120
           FVGGNLV  KSKKQNVVS  S ES YRAM QSVCEIVWI+QLLS++GF +T+ AKL CDN
Sbjct: 723 FVGGNLVSWKSKKQNVVSCLSAESKYRAMTQSVCEIVWIHQLLSEIGFSITVPAKLQCDN 782

Query: 121 LAALHISSYQNPNLVLIQESKKDEFDAAFIKLLWSSKDIGWAFVESIGKSGRILT 176
            AALHI+S    N V  + +K  E D  FI+       +   +V++  + G ILT
Sbjct: 783 QAALHIAS----NPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILT 833

BLAST of Cla97C06G118067 vs. ExPASy TrEMBL
Match: A0A5A7UHS1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold131G00150 PE=4 SV=1)

HSP 1 Score: 218.8 bits (556), Expect = 4.7e-53
Identity = 111/175 (63.43%), Postives = 130/175 (74.29%), Query Frame = 0

Query: 1   MFSPTVDHWASVEQILCYLKAAPGCGILYKDHDHTRVECFPDADWAGSREDRRSTSRYCV 60
           M  PTVDHWA+VEQILCYLKAA G GILYKDH HT+V+CF DADW GSREDRRS S YCV
Sbjct: 473 MSFPTVDHWAAVEQILCYLKAASGRGILYKDHGHTKVKCFSDADWVGSREDRRSISGYCV 532

Query: 61  FVGGNLVLCKSKKQNVVSRSSVESNYRAMAQSVCEIVWIYQLLSKMGFKVTILAKLWCDN 120
           FVGGNLV  KSKKQNVVS SS +S YRAMAQSVCEIVWI+QLLS++GF +T+  KLWCDN
Sbjct: 533 FVGGNLVSWKSKKQNVVSCSSAKSEYRAMAQSVCEIVWIHQLLSEIGFSITVPVKLWCDN 592

Query: 121 LAALHISSYQNPNLVLIQESKKDEFDAAFIKLLWSSKDIGWAFVESIGKSGRILT 176
             ALHI+S    N V  +++K  E D  FI+       +   +V++  + G ILT
Sbjct: 593 QVALHIAS----NPVFHEQTKHIEVDCHFIREKIQDGLMSTGYVKTGEQLGDILT 643

BLAST of Cla97C06G118067 vs. ExPASy TrEMBL
Match: A0A5D3DZU1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold2277G00150 PE=4 SV=1)

HSP 1 Score: 218.8 bits (556), Expect = 4.7e-53
Identity = 111/175 (63.43%), Postives = 130/175 (74.29%), Query Frame = 0

Query: 1   MFSPTVDHWASVEQILCYLKAAPGCGILYKDHDHTRVECFPDADWAGSREDRRSTSRYCV 60
           M  PTVDHWA+VEQILCYLKAA G GILYKDH HT+V+CF DADW GSREDRRS S YCV
Sbjct: 473 MSFPTVDHWAAVEQILCYLKAASGRGILYKDHGHTKVKCFSDADWVGSREDRRSISGYCV 532

Query: 61  FVGGNLVLCKSKKQNVVSRSSVESNYRAMAQSVCEIVWIYQLLSKMGFKVTILAKLWCDN 120
           FVGGNLV  KSKKQNVVS SS +S YRAMAQSVCEIVWI+QLLS++GF +T+  KLWCDN
Sbjct: 533 FVGGNLVSWKSKKQNVVSCSSAKSEYRAMAQSVCEIVWIHQLLSEIGFSITVPVKLWCDN 592

Query: 121 LAALHISSYQNPNLVLIQESKKDEFDAAFIKLLWSSKDIGWAFVESIGKSGRILT 176
             ALHI+S    N V  +++K  E D  FI+       +   +V++  + G ILT
Sbjct: 593 QVALHIAS----NPVFHEQTKHIEVDCHFIREKIQDGLMSTGYVKTGEQLGDILT 643

BLAST of Cla97C06G118067 vs. TAIR 10
Match: AT4G23160.1 (cysteine-rich RLK (RECEPTOR-like protein kinase) 8 )

HSP 1 Score: 104.8 bits (260), Expect = 1.9e-22
Identity = 52/149 (34.90%), Postives = 89/149 (59.73%), Query Frame = 0

Query: 3   SPTVDHWASVEQILCYLKAAPGCGILYKDHDHTRVECFPDADWAGSREDRRSTSRYCVFV 62
           +P + H  +V +IL Y+K   G G+ Y      +++ F DA +   ++ RRST+ YC+F+
Sbjct: 408 APRLAHQQAVMKILHYIKGTVGQGLFYSSQAEMQLQVFSDASFQSCKDTRRSTNGYCMFL 467

Query: 63  GGNLVLCKSKKQNVVSRSSVESNYRAMAQSVCEIVWIYQLLSKMGFKVTILAKLWCDNLA 122
           G +L+  KSKKQ VVS+SS E+ YRA++ +  E++W+ Q   ++   ++    L+CDN A
Sbjct: 468 GTSLISWKSKKQQVVSKSSAEAEYRALSFATDEMMWLAQFFRELQLPLSKPTLLFCDNTA 527

Query: 123 ALHISSYQNPNLVLIQESKKDEFDAAFIK 152
           A+HI++    N V  + +K  E D   ++
Sbjct: 528 AIHIAT----NAVFHERTKHIESDCHSVR 552

BLAST of Cla97C06G118067 vs. TAIR 10
Match: ATMG00810.1 (DNA/RNA polymerases superfamily protein )

HSP 1 Score: 80.1 bits (196), Expect = 5.0e-15
Identity = 36/98 (36.73%), Postives = 58/98 (59.18%), Query Frame = 0

Query: 1   MFSPTVDHWASVEQILCYLKAAPGCGILYKDHDHTRVECFPDADWAGSREDRRSTSRYCV 60
           M  PT+  +  ++++L Y+K     G+    +    V+ F D+DWAG    RRST+ +C 
Sbjct: 128 MHEPTLADFDLLKRVLRYVKGTIFHGLYIHKNSKLNVQAFCDSDWAGCTSTRRSTTGFCT 187

Query: 61  FVGGNLVLCKSKKQNVVSRSSVESNYRAMAQSVCEIVW 99
           F+G N++   +K+Q  VSRSS E+ YRA+A +  E+ W
Sbjct: 188 FLGCNIISWSAKRQPTVSRSSTETEYRALALTAAELTW 225

BLAST of Cla97C06G118067 vs. TAIR 10
Match: ATMG00240.1 (Gag-Pol-related retrotransposon family protein )

HSP 1 Score: 43.1 bits (100), Expect = 6.8e-04
Identity = 17/52 (32.69%), Postives = 29/52 (55.77%), Query Frame = 0

Query: 11 SVEQILCYLKAAPGCGILYKDHDHTRVECFPDADWAGSREDRRSTSRYCVFV 63
          +V ++L Y+K   G G+ Y      +++ F D+DWA   + RRS + +C  V
Sbjct: 31 AVYKVLHYVKGTVGQGLFYSATSDLQLKAFADSDWASCPDTRRSVTGFCSLV 82

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
TYJ98683.12.6e-6646.28hypothetical protein E5676_scaffold429G00120 [Cucumis melo var. makuwa][more]
XP_031744754.11.2e-5869.14uncharacterized protein LOC101212255 isoform X2 [Cucumis sativus][more]
XP_031744758.11.2e-5869.14uncharacterized protein LOC101212255 isoform X5 [Cucumis sativus][more]
XP_031744753.11.2e-5869.14uncharacterized protein LOC101212255 isoform X1 [Cucumis sativus][more]
XP_031744755.11.2e-5869.14uncharacterized protein LOC101212255 isoform X3 [Cucumis sativus][more]
Match NameE-valueIdentityDescription
Q9ZT941.7e-2035.10Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... [more]
Q94HW28.7e-2035.10Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... [more]
P925197.1e-1436.73Uncharacterized mitochondrial protein AtMg00810 OS=Arabidopsis thaliana OX=3702 ... [more]
P0CV723.0e-1236.46Secreted RxLR effector protein 161 OS=Plasmopara viticola OX=143451 GN=RXLR161 P... [more]
P041461.9e-1131.25Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3[more]
Match NameE-valueIdentityDescription
A0A5D3BHE31.3e-6646.28Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A5A7SQ844.3e-5468.71Putative mitochondrial protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_s... [more]
A0A5D3CID25.6e-5465.71Putative mitochondrial protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_s... [more]
A0A5A7UHS14.7e-5363.43Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var.... [more]
A0A5D3DZU14.7e-5363.43Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var.... [more]
Match NameE-valueIdentityDescription
AT4G23160.11.9e-2234.90cysteine-rich RLK (RECEPTOR-like protein kinase) 8 [more]
ATMG00810.15.0e-1536.73DNA/RNA polymerases superfamily protein [more]
ATMG00240.16.8e-0432.69Gag-Pol-related retrotransposon family protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (97103) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR036691Endonuclease/exonuclease/phosphatase superfamilyGENE3D3.60.10.10Endonuclease/exonuclease/phosphatasecoord: 115..333
e-value: 1.2E-21
score: 79.6
IPR036691Endonuclease/exonuclease/phosphatase superfamilySUPERFAMILY56219DNase I-likecoord: 128..332
NoneNo IPR availablePANTHERPTHR33710:SF32SUBFAMILY NOT NAMEDcoord: 192..333
NoneNo IPR availablePANTHERPTHR33710BNAC02G09200D PROTEINcoord: 192..333
NoneNo IPR availableCDDcd09272RNase_HI_RT_Ty1coord: 38..132
e-value: 1.21066E-38
score: 134.903

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C06G118067.1Cla97C06G118067.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016020 membrane