Cla97C03G055200 (gene) Watermelon (97103) v2.5

Overview
NameCla97C03G055200
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionCentromere protein S-like
LocationCla97Chr03: 4197056 .. 4199222 (-)
RNA-Seq ExpressionCla97C03G055200
SyntenyCla97C03G055200
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GGAGGGAAATAAAGGCGGATGAAAACAGTGAGACTGCAGCAAAATTCCGTAAATTAACTCCAGAAGCCTCTGCAATGGAAACTGGGATGGAAGAAGACGACTCCGCCTCCGAACTCTTGAGGGACAGATTCCGACTCTCCACCATTTCTATTGCTGAAGCTGAAGGTTAATTCTCTTCATCTTCGAGCCTCTCTTGTTTTACATGTAATATAAGCTGAATGCTAGACGAATTCTTCATATTTTTGTTTGGCTTACTCTATTTTTTTTTTCTTCCAACTGTTTGGTTTGTTCTGCAGCGAAGAGAAGCGGCATGGAAATTTCTGAACCCGTGATGATTTGTGTTGCTGATTTAGCCTTCAAATATACAAGTTACTTTACTTTTCTTTTCTGCCTGTTTTGAGTCTGCAATGTTATTTTGAGCACTCTGTTTCTACATGACTACACCTTATATAACAGCATAATATATTGATCACTGAAGTTGCTAATCCTACCATTCGGAACACTTTCATTCTTCATTATTTTAGATAACCTAATTGCTAATTCTTTTAATTTTACATTTCTATATAATTTAGCATGCGTTCGGTTTGTTGGGACGCCTCTCTGTGCTGTGTTGTTAAATTGTGTGACCTAGTTGGAAGGAAACTGCATAACTTTGGTAAAGTTTAAGACAGAAAAGAGTTTCCAGAAGAACAAATGATGGAGAGACTATTGTGATAACTTAAAGAATGTCTTTGCACGATACTTTTGGATTCTAATCAGATGTTCTTATAGTTCATATAAATGTTTCTTTCGGAATGAGGGACAGAAGCTGTCCCTCATTCTACCTTGAGACTTACACCTTACAGTGATCATGTTGGGTGTAAAGCCTTAACTCTCGTTGTATTCTTGTGGGAGCATATGCCTTAGGTAGAACCAAAAATGGCGCTAATCTTGAGACTTAGATTTTGTGCTTCGCCTTAAGGAATACGATAGGAAGAATAGTCCTAGTTGTATTCTGTTTCTTTGTTTAGATTTCAATCACGTAAAGAAGACATATAGTTATTATTTTAATTTATATCAATATATTAGCGTCAAGTTCGTTTTCTGTTACCGATGAATATTCTATTCATTATCTAAAAGAAGCAAATATATACTTATACTTACTTATATCATATGCATGAAATTAGTGTATTCCTTCATCATAGGCATGAATGATTTGGTAGGTTTACAAAATTTCAACCAGACCTTGAGTTCATATCTTGGTATAGGGTCACACTTACGTTTGTAGTCACAGGGCTAACATATGAACCGCCAATGGTGTGTATAATACATGCAAGGAAAACAATTTTTTGTCAATTACTCTGCTGCAGTAGTTCTAGTTGCTAACCAGCAAGGAAAGAAGATAATTCTGTCATAGTTGTGTTTACTTTGTTTTTATGATTCTCAAAGTGTTGCAATGAGATAACGGAATGATAACTTGTGTAGCAGAACAGTTGGCAAAGGACCTTGAGTTATTTGTTCAGCATGCTGGTCGAAAATCTGTGAATACAGAAGATGTAATACTATCAGGTTAGATGGTCTGATTCCCAAGTTGTTGTTGTTACTGTCTAGTTATATTTAGTCCATATGCAACAGACTTGAAGTAATACATAATTGCCTGAACCGCTTTGCTTCTGCTTCATTTTGGTCTCAGCCCATAGAAACGAGCATTTGGCTGCTATATTAACATCCATTTGCAATGATCTAAAGACAAAAGAACCTCAAAGTGAGAGAAAGCGAAAAAAGGCACAAAAAAAAGAAGATAGAGATAGAGGTGCAGTGCATATTACCGACGCCTAATCACATCTCCCATCGGGCTGATCTAGTTAACAAGGGTGCATACCAACAAGCAATGGCAATAGCTTATGATCTCTTTCCTTTTTCTTGGTTATACTAATTAGTAATACTGTTTTCACATCACATGGGCCTATGGAGTTCTTCTGATTTAAAGGATAAGCTTCTGCACCATGCAAATTCATTTGTGTTTCCAAACGCAGGAAACATGGCTTTTGGTTTGTTTTAGGTATTTGAGATTGTCGAAGCTTGAGTTTCAATGTTCATTGGTGACCAACTCTGAACGGGGTGATAAATTGCTCTTTGTAAATCTACAGCACAAATGTATGATTTAAGACTTTTAGTTAAGTACTCA

mRNA sequence

GGAGGGAAATAAAGGCGGATGAAAACAGTGAGACTGCAGCAAAATTCCGTAAATTAACTCCAGAAGCCTCTGCAATGGAAACTGGGATGGAAGAAGACGACTCCGCCTCCGAACTCTTGAGGGACAGATTCCGACTCTCCACCATTTCTATTGCTGAAGCTGAAGCGAAGAGAAGCGGCATGGAAATTTCTGAACCCGTGATGATTTGTGTTGCTGATTTAGCCTTCAAATATACAAATAACGGAATGATAACTTGTGTAGCAGAACAGTTGGCAAAGGACCTTGAGTTATTTGTTCAGCATGCTGGTCGAAAATCTGTGAATACAGAAGATGTAATACTATCAGCCCATAGAAACGAGCATTTGGCTGCTATATTAACATCCATTTGCAATGATCTAAAGACAAAAGAACCTCAAAGTGAGAGAAAGCGAAAAAAGGCACAAAAAAAAGAAGATAGAGATAGAGGTGCAGTGCATATTACCGACGCCTAATCACATCTCCCATCGGGCTGATCTAGTTAACAAGGGTGCATACCAACAAGCAATGGCAATAGCTTATGATCTCTTTCCTTTTTCTTGGTTATACTAATTAGTAATACTGTTTTCACATCACATGGGCCTATGGAGTTCTTCTGATTTAAAGGATAAGCTTCTGCACCATGCAAATTCATTTGTGTTTCCAAACGCAGGAAACATGGCTTTTGGTTTGTTTTAGGTATTTGAGATTGTCGAAGCTTGAGTTTCAATGTTCATTGGTGACCAACTCTGAACGGGGTGATAAATTGCTCTTTGTAAATCTACAGCACAAATGTATGATTTAAGACTTTTAGTTAAGTACTCA

Coding sequence (CDS)

ATGGAAACTGGGATGGAAGAAGACGACTCCGCCTCCGAACTCTTGAGGGACAGATTCCGACTCTCCACCATTTCTATTGCTGAAGCTGAAGCGAAGAGAAGCGGCATGGAAATTTCTGAACCCGTGATGATTTGTGTTGCTGATTTAGCCTTCAAATATACAAATAACGGAATGATAACTTGTGTAGCAGAACAGTTGGCAAAGGACCTTGAGTTATTTGTTCAGCATGCTGGTCGAAAATCTGTGAATACAGAAGATGTAATACTATCAGCCCATAGAAACGAGCATTTGGCTGCTATATTAACATCCATTTGCAATGATCTAAAGACAAAAGAACCTCAAAGTGAGAGAAAGCGAAAAAAGGCACAAAAAAAAGAAGATAGAGATAGAGGTGCAGTGCATATTACCGACGCCTAA

Protein sequence

METGMEEDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMICVADLAFKYTNNGMITCVAEQLAKDLELFVQHAGRKSVNTEDVILSAHRNEHLAAILTSICNDLKTKEPQSERKRKKAQKKEDRDRGAVHITDA
Homology
BLAST of Cla97C03G055200 vs. NCBI nr
Match: XP_011652290.1 (protein MHF1 homolog isoform X1 [Cucumis sativus] >XP_011652296.1 protein MHF1 homolog isoform X1 [Cucumis sativus])

HSP 1 Score: 223.8 bits (569), Expect = 9.2e-55
Identity = 120/138 (86.96%), Postives = 124/138 (89.86%), Query Frame = 0

Query: 1   METGMEEDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMICVADLAFKYTNNGMIT 60
           METGMEEDDSASELLRDRFRLS+ISIAEAEA +SGMEISEPVM CVADLAFKYT      
Sbjct: 1   METGMEEDDSASELLRDRFRLSSISIAEAEANKSGMEISEPVMTCVADLAFKYT------ 60

Query: 61  CVAEQLAKDLELFVQHAGRKSVNTEDVILSAHRNEHLAAILTSICNDLKTKEPQSERKRK 120
              EQLAKDLELF QHAGRKSVNTEDVIL+AHRNEHLAAILTSICNDLKTKEPQSERKRK
Sbjct: 61  --TEQLAKDLELFAQHAGRKSVNTEDVILTAHRNEHLAAILTSICNDLKTKEPQSERKRK 120

Query: 121 KAQKKEDRDRGAVHITDA 139
           KA KK+DRDRGAVHI DA
Sbjct: 121 KAPKKDDRDRGAVHIADA 130

BLAST of Cla97C03G055200 vs. NCBI nr
Match: XP_038894698.1 (protein MHF1 homolog isoform X1 [Benincasa hispida])

HSP 1 Score: 223.0 bits (567), Expect = 1.6e-54
Identity = 121/138 (87.68%), Postives = 124/138 (89.86%), Query Frame = 0

Query: 1   METGMEEDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMICVADLAFKYTNNGMIT 60
           M+TG+EED SASELLRDRFRLSTISIAEAEA RSGMEISEPVM CVADLAFKYT      
Sbjct: 1   MKTGIEEDGSASELLRDRFRLSTISIAEAEANRSGMEISEPVMTCVADLAFKYT------ 60

Query: 61  CVAEQLAKDLELFVQHAGRKSVNTEDVILSAHRNEHLAAILTSICNDLKTKEPQSERKRK 120
              EQLAKDLELFVQHAGRKSVNTEDVILSAHRNEHL+AILTSICNDLKTKEPQSERKRK
Sbjct: 61  --TEQLAKDLELFVQHAGRKSVNTEDVILSAHRNEHLSAILTSICNDLKTKEPQSERKRK 120

Query: 121 KAQKKEDRDRGAVHITDA 139
           KA KKEDRDRGAVHI DA
Sbjct: 121 KAPKKEDRDRGAVHIIDA 130

BLAST of Cla97C03G055200 vs. NCBI nr
Match: XP_004154006.1 (protein MHF1 homolog isoform X2 [Cucumis sativus] >XP_004154007.1 protein MHF1 homolog isoform X2 [Cucumis sativus] >KGN64440.1 hypothetical protein Csa_013391 [Cucumis sativus])

HSP 1 Score: 221.9 bits (564), Expect = 3.5e-54
Identity = 119/138 (86.23%), Postives = 124/138 (89.86%), Query Frame = 0

Query: 1   METGMEEDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMICVADLAFKYTNNGMIT 60
           METGMEEDDSASELLRDRFRLS+ISIAEAEA +SGMEISEPVM CVADLAFKYT      
Sbjct: 1   METGMEEDDSASELLRDRFRLSSISIAEAEANKSGMEISEPVMTCVADLAFKYT------ 60

Query: 61  CVAEQLAKDLELFVQHAGRKSVNTEDVILSAHRNEHLAAILTSICNDLKTKEPQSERKRK 120
              +QLAKDLELF QHAGRKSVNTEDVIL+AHRNEHLAAILTSICNDLKTKEPQSERKRK
Sbjct: 61  ---KQLAKDLELFAQHAGRKSVNTEDVILTAHRNEHLAAILTSICNDLKTKEPQSERKRK 120

Query: 121 KAQKKEDRDRGAVHITDA 139
           KA KK+DRDRGAVHI DA
Sbjct: 121 KAPKKDDRDRGAVHIADA 129

BLAST of Cla97C03G055200 vs. NCBI nr
Match: XP_038894699.1 (protein MHF1 homolog isoform X2 [Benincasa hispida])

HSP 1 Score: 221.1 bits (562), Expect = 6.0e-54
Identity = 120/138 (86.96%), Postives = 124/138 (89.86%), Query Frame = 0

Query: 1   METGMEEDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMICVADLAFKYTNNGMIT 60
           M+TG+EED SASELLRDRFRLSTISIAEAEA RSGMEISEPVM CVADLAFKYT      
Sbjct: 1   MKTGIEEDGSASELLRDRFRLSTISIAEAEANRSGMEISEPVMTCVADLAFKYT------ 60

Query: 61  CVAEQLAKDLELFVQHAGRKSVNTEDVILSAHRNEHLAAILTSICNDLKTKEPQSERKRK 120
              +QLAKDLELFVQHAGRKSVNTEDVILSAHRNEHL+AILTSICNDLKTKEPQSERKRK
Sbjct: 61  ---KQLAKDLELFVQHAGRKSVNTEDVILSAHRNEHLSAILTSICNDLKTKEPQSERKRK 120

Query: 121 KAQKKEDRDRGAVHITDA 139
           KA KKEDRDRGAVHI DA
Sbjct: 121 KAPKKEDRDRGAVHIIDA 129

BLAST of Cla97C03G055200 vs. NCBI nr
Match: XP_008440142.1 (PREDICTED: centromere protein S isoform X1 [Cucumis melo] >KAA0055356.1 centromere protein S isoform X1 [Cucumis melo var. makuwa])

HSP 1 Score: 217.2 bits (552), Expect = 8.6e-53
Identity = 117/138 (84.78%), Postives = 122/138 (88.41%), Query Frame = 0

Query: 1   METGMEEDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMICVADLAFKYTNNGMIT 60
           MET MEEDDSASELLRDRFRLSTISIAEAEA +SGMEISEPVM CVADLAFK+T      
Sbjct: 1   METRMEEDDSASELLRDRFRLSTISIAEAEANKSGMEISEPVMTCVADLAFKFT------ 60

Query: 61  CVAEQLAKDLELFVQHAGRKSVNTEDVILSAHRNEHLAAILTSICNDLKTKEPQSERKRK 120
              EQLAKDLELF QHAGRKSVNTEDVIL+AHRNEHLAAILTSICNDLK KEPQSERKRK
Sbjct: 61  --TEQLAKDLELFAQHAGRKSVNTEDVILTAHRNEHLAAILTSICNDLKAKEPQSERKRK 120

Query: 121 KAQKKEDRDRGAVHITDA 139
           KA KK+DRDRGAVHI +A
Sbjct: 121 KAPKKDDRDRGAVHIAEA 130

BLAST of Cla97C03G055200 vs. ExPASy Swiss-Prot
Match: Q9FI55 (Protein MHF1 homolog OS=Arabidopsis thaliana OX=3702 GN=MHF1 PE=3 SV=1)

HSP 1 Score: 149.8 bits (377), Expect = 2.2e-35
Identity = 86/133 (64.66%), Postives = 101/133 (75.94%), Query Frame = 0

Query: 7   EDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMICVADLAFKYTNNGMITCVAEQL 66
           E+ S  +L+RDRFRLS ISIAEAEAK++GMEI  PV+ CVADLAFKY         AE +
Sbjct: 116 EEYSMDDLIRDRFRLSAISIAEAEAKKNGMEIGGPVVACVADLAFKY---------AENV 175

Query: 67  AKDLELFVQHAGRKSVNTEDVILSAHRNEHLAAILTSICNDLKTKEPQSERKRKK-AQKK 126
           AKDLELF  HAGRK VN +DV+LSAHRN++LAA L S+CN+LK KEPQSERKRKK + KK
Sbjct: 176 AKDLELFAHHAGRKVVNMDDVVLSAHRNDNLAASLRSLCNELKAKEPQSERKRKKGSAKK 235

Query: 127 EDR--DRGAVHIT 137
           ED+     AV IT
Sbjct: 236 EDKASSSNAVRIT 239

BLAST of Cla97C03G055200 vs. ExPASy Swiss-Prot
Match: Q2TBR7 (Centromere protein S OS=Bos taurus OX=9913 GN=CENPS PE=2 SV=1)

HSP 1 Score: 53.9 bits (128), Expect = 1.7e-06
Identity = 36/113 (31.86%), Postives = 57/113 (50.44%), Query Frame = 0

Query: 26  IAEAEAKRSGMEISEPVMICVADLAFKYTNNGMITCVAEQLAKDLELFVQHAGRKSVNTE 85
           + E  A    M+ S+  +  ++++ F            E  AKDLE+F +HA R ++NTE
Sbjct: 30  LCEEVASDKDMQFSKQTIAAISEVTFGQ---------CENFAKDLEMFARHAKRSTINTE 89

Query: 86  DVILSAHRNEHLAAILTSICNDLKTKEPQSERKRKKAQKKEDRDRGAVHITDA 139
           DV L A R+  L   +T    D+   +   E+K KK +K ED +R +V   +A
Sbjct: 90  DVKLLARRSHSLLKYITEKNEDI--AQLNLEKKAKKKKKLEDENRNSVESAEA 131

BLAST of Cla97C03G055200 vs. ExPASy Swiss-Prot
Match: Q6NRI8 (Centromere protein S OS=Xenopus laevis OX=8355 GN=cenps PE=2 SV=1)

HSP 1 Score: 48.9 bits (115), Expect = 5.3e-05
Identity = 31/121 (25.62%), Postives = 59/121 (48.76%), Query Frame = 0

Query: 1   METGMEEDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMICVADLAFKYTNNGMIT 60
           M  G EE  S ++ L+        S+ +  A    ++ S+  +  ++++ F+        
Sbjct: 1   MAEGQEEHFSRTQRLKAAVHYVVGSLCQEVADDKEIDFSKQAIAAISEITFRQ------- 60

Query: 61  CVAEQLAKDLELFVQHAGRKSVNTEDVILSAHRNEHLAAILTSICNDLKTKEPQSERKRK 120
              E  AKDLE+F +HA R ++N +DV L A R+  L A ++   +++     + + K+K
Sbjct: 61  --CESFAKDLEIFARHAKRTTINMDDVKLLARRSRSLYAHISKCSDEIAANSLEQKEKKK 112

Query: 121 K 122
           K
Sbjct: 121 K 112

BLAST of Cla97C03G055200 vs. ExPASy Swiss-Prot
Match: Q8N2Z9 (Centromere protein S OS=Homo sapiens OX=9606 GN=CENPS PE=1 SV=1)

HSP 1 Score: 48.5 bits (114), Expect = 7.0e-05
Identity = 37/131 (28.24%), Postives = 63/131 (48.09%), Query Frame = 0

Query: 2   ETGMEEDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMICVADLAFKYTNNGMITC 61
           ET  ++  S  + L+     +   + E  A    M+ S+  +  +++L F+         
Sbjct: 6   ETEEQQRFSYQQRLKAAVHYTVGCLCEEVALDKEMQFSKQTIAAISELTFRQ-------- 65

Query: 62  VAEQLAKDLELFVQHAGRKSVNTEDVILSAHRNEHLAAILTSICNDLKTKEPQSERKRKK 121
             E  AKDLE+F +HA R ++NTEDV L A R+  L   +T    ++   +   ERK +K
Sbjct: 66  -CENFAKDLEMFARHAKRTTINTEDVKLLARRSNSLLKYITDKSEEI--AQINLERKAQK 125

Query: 122 AQKKEDRDRGA 133
            +K ED  + +
Sbjct: 126 KKKSEDGSKNS 125

BLAST of Cla97C03G055200 vs. ExPASy Swiss-Prot
Match: O74807 (Inner kinetochore subunit mhf1 OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=mhf1 PE=3 SV=2)

HSP 1 Score: 48.5 bits (114), Expect = 7.0e-05
Identity = 32/106 (30.19%), Postives = 54/106 (50.94%), Query Frame = 0

Query: 17  DRFRLSTISIAEAEAKRSGMEISE-PVMICVADLAFKYTNNGMITCVAEQLAKDLELFVQ 76
           +RF+     + +    R+  E++E      + D  F      M+      LAKD+E F +
Sbjct: 5   ERFKAEIFHVTQEVCNRTASELTESESRNVIVDELFCVGVTEMVWEQIRVLAKDIEAFAE 64

Query: 77  HAGRKSVNTEDVILSAHRNEHLAAILTSICNDLKTKEPQSERKRKK 122
           HAGRK+V  +DV+L   RNE L  I+    N+   +  +S++K+K+
Sbjct: 65  HAGRKTVQPQDVLLCCRRNEGLYEII----NNFHKESIKSKKKKKE 106

BLAST of Cla97C03G055200 vs. ExPASy TrEMBL
Match: A0A0A0LR97 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G051770 PE=3 SV=1)

HSP 1 Score: 221.9 bits (564), Expect = 1.7e-54
Identity = 119/138 (86.23%), Postives = 124/138 (89.86%), Query Frame = 0

Query: 1   METGMEEDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMICVADLAFKYTNNGMIT 60
           METGMEEDDSASELLRDRFRLS+ISIAEAEA +SGMEISEPVM CVADLAFKYT      
Sbjct: 1   METGMEEDDSASELLRDRFRLSSISIAEAEANKSGMEISEPVMTCVADLAFKYT------ 60

Query: 61  CVAEQLAKDLELFVQHAGRKSVNTEDVILSAHRNEHLAAILTSICNDLKTKEPQSERKRK 120
              +QLAKDLELF QHAGRKSVNTEDVIL+AHRNEHLAAILTSICNDLKTKEPQSERKRK
Sbjct: 61  ---KQLAKDLELFAQHAGRKSVNTEDVILTAHRNEHLAAILTSICNDLKTKEPQSERKRK 120

Query: 121 KAQKKEDRDRGAVHITDA 139
           KA KK+DRDRGAVHI DA
Sbjct: 121 KAPKKDDRDRGAVHIADA 129

BLAST of Cla97C03G055200 vs. ExPASy TrEMBL
Match: A0A5A7UM53 (Centromere protein S isoform X1 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold80G002010 PE=3 SV=1)

HSP 1 Score: 217.2 bits (552), Expect = 4.2e-53
Identity = 117/138 (84.78%), Postives = 122/138 (88.41%), Query Frame = 0

Query: 1   METGMEEDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMICVADLAFKYTNNGMIT 60
           MET MEEDDSASELLRDRFRLSTISIAEAEA +SGMEISEPVM CVADLAFK+T      
Sbjct: 1   METRMEEDDSASELLRDRFRLSTISIAEAEANKSGMEISEPVMTCVADLAFKFT------ 60

Query: 61  CVAEQLAKDLELFVQHAGRKSVNTEDVILSAHRNEHLAAILTSICNDLKTKEPQSERKRK 120
              EQLAKDLELF QHAGRKSVNTEDVIL+AHRNEHLAAILTSICNDLK KEPQSERKRK
Sbjct: 61  --TEQLAKDLELFAQHAGRKSVNTEDVILTAHRNEHLAAILTSICNDLKAKEPQSERKRK 120

Query: 121 KAQKKEDRDRGAVHITDA 139
           KA KK+DRDRGAVHI +A
Sbjct: 121 KAPKKDDRDRGAVHIAEA 130

BLAST of Cla97C03G055200 vs. ExPASy TrEMBL
Match: A0A1S3B159 (centromere protein S isoform X1 OS=Cucumis melo OX=3656 GN=LOC103484694 PE=3 SV=1)

HSP 1 Score: 217.2 bits (552), Expect = 4.2e-53
Identity = 117/138 (84.78%), Postives = 122/138 (88.41%), Query Frame = 0

Query: 1   METGMEEDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMICVADLAFKYTNNGMIT 60
           MET MEEDDSASELLRDRFRLSTISIAEAEA +SGMEISEPVM CVADLAFK+T      
Sbjct: 1   METRMEEDDSASELLRDRFRLSTISIAEAEANKSGMEISEPVMTCVADLAFKFT------ 60

Query: 61  CVAEQLAKDLELFVQHAGRKSVNTEDVILSAHRNEHLAAILTSICNDLKTKEPQSERKRK 120
              EQLAKDLELF QHAGRKSVNTEDVIL+AHRNEHLAAILTSICNDLK KEPQSERKRK
Sbjct: 61  --TEQLAKDLELFAQHAGRKSVNTEDVILTAHRNEHLAAILTSICNDLKAKEPQSERKRK 120

Query: 121 KAQKKEDRDRGAVHITDA 139
           KA KK+DRDRGAVHI +A
Sbjct: 121 KAPKKDDRDRGAVHIAEA 130

BLAST of Cla97C03G055200 vs. ExPASy TrEMBL
Match: A0A5D3BHQ8 (Centromere protein S isoform X2 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold248G004950 PE=3 SV=1)

HSP 1 Score: 215.3 bits (547), Expect = 1.6e-52
Identity = 116/138 (84.06%), Postives = 122/138 (88.41%), Query Frame = 0

Query: 1   METGMEEDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMICVADLAFKYTNNGMIT 60
           MET MEEDDSASELLRDRFRLSTISIAEAEA +SGMEISEPVM CVADLAFK+T      
Sbjct: 1   METRMEEDDSASELLRDRFRLSTISIAEAEANKSGMEISEPVMTCVADLAFKFT------ 60

Query: 61  CVAEQLAKDLELFVQHAGRKSVNTEDVILSAHRNEHLAAILTSICNDLKTKEPQSERKRK 120
              +QLAKDLELF QHAGRKSVNTEDVIL+AHRNEHLAAILTSICNDLK KEPQSERKRK
Sbjct: 61  ---KQLAKDLELFAQHAGRKSVNTEDVILTAHRNEHLAAILTSICNDLKAKEPQSERKRK 120

Query: 121 KAQKKEDRDRGAVHITDA 139
           KA KK+DRDRGAVHI +A
Sbjct: 121 KAPKKDDRDRGAVHIAEA 129

BLAST of Cla97C03G055200 vs. ExPASy TrEMBL
Match: A0A1S3B173 (centromere protein S isoform X2 OS=Cucumis melo OX=3656 GN=LOC103484694 PE=3 SV=1)

HSP 1 Score: 215.3 bits (547), Expect = 1.6e-52
Identity = 116/138 (84.06%), Postives = 122/138 (88.41%), Query Frame = 0

Query: 1   METGMEEDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMICVADLAFKYTNNGMIT 60
           MET MEEDDSASELLRDRFRLSTISIAEAEA +SGMEISEPVM CVADLAFK+T      
Sbjct: 1   METRMEEDDSASELLRDRFRLSTISIAEAEANKSGMEISEPVMTCVADLAFKFT------ 60

Query: 61  CVAEQLAKDLELFVQHAGRKSVNTEDVILSAHRNEHLAAILTSICNDLKTKEPQSERKRK 120
              +QLAKDLELF QHAGRKSVNTEDVIL+AHRNEHLAAILTSICNDLK KEPQSERKRK
Sbjct: 61  ---KQLAKDLELFAQHAGRKSVNTEDVILTAHRNEHLAAILTSICNDLKAKEPQSERKRK 120

Query: 121 KAQKKEDRDRGAVHITDA 139
           KA KK+DRDRGAVHI +A
Sbjct: 121 KAPKKDDRDRGAVHIAEA 129

BLAST of Cla97C03G055200 vs. TAIR 10
Match: AT5G50930.1 (Histone superfamily protein )

HSP 1 Score: 149.8 bits (377), Expect = 1.6e-36
Identity = 86/133 (64.66%), Postives = 101/133 (75.94%), Query Frame = 0

Query: 7   EDDSASELLRDRFRLSTISIAEAEAKRSGMEISEPVMICVADLAFKYTNNGMITCVAEQL 66
           E+ S  +L+RDRFRLS ISIAEAEAK++GMEI  PV+ CVADLAFKY         AE +
Sbjct: 116 EEYSMDDLIRDRFRLSAISIAEAEAKKNGMEIGGPVVACVADLAFKY---------AENV 175

Query: 67  AKDLELFVQHAGRKSVNTEDVILSAHRNEHLAAILTSICNDLKTKEPQSERKRKK-AQKK 126
           AKDLELF  HAGRK VN +DV+LSAHRN++LAA L S+CN+LK KEPQSERKRKK + KK
Sbjct: 176 AKDLELFAHHAGRKVVNMDDVVLSAHRNDNLAASLRSLCNELKAKEPQSERKRKKGSAKK 235

Query: 127 EDR--DRGAVHIT 137
           ED+     AV IT
Sbjct: 236 EDKASSSNAVRIT 239

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_011652290.19.2e-5586.96protein MHF1 homolog isoform X1 [Cucumis sativus] >XP_011652296.1 protein MHF1 h... [more]
XP_038894698.11.6e-5487.68protein MHF1 homolog isoform X1 [Benincasa hispida][more]
XP_004154006.13.5e-5486.23protein MHF1 homolog isoform X2 [Cucumis sativus] >XP_004154007.1 protein MHF1 h... [more]
XP_038894699.16.0e-5486.96protein MHF1 homolog isoform X2 [Benincasa hispida][more]
XP_008440142.18.6e-5384.78PREDICTED: centromere protein S isoform X1 [Cucumis melo] >KAA0055356.1 centrome... [more]
Match NameE-valueIdentityDescription
Q9FI552.2e-3564.66Protein MHF1 homolog OS=Arabidopsis thaliana OX=3702 GN=MHF1 PE=3 SV=1[more]
Q2TBR71.7e-0631.86Centromere protein S OS=Bos taurus OX=9913 GN=CENPS PE=2 SV=1[more]
Q6NRI85.3e-0525.62Centromere protein S OS=Xenopus laevis OX=8355 GN=cenps PE=2 SV=1[more]
Q8N2Z97.0e-0528.24Centromere protein S OS=Homo sapiens OX=9606 GN=CENPS PE=1 SV=1[more]
O748077.0e-0530.19Inner kinetochore subunit mhf1 OS=Schizosaccharomyces pombe (strain 972 / ATCC 2... [more]
Match NameE-valueIdentityDescription
A0A0A0LR971.7e-5486.23Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G051770 PE=3 SV=1[more]
A0A5A7UM534.2e-5384.78Centromere protein S isoform X1 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_... [more]
A0A1S3B1594.2e-5384.78centromere protein S isoform X1 OS=Cucumis melo OX=3656 GN=LOC103484694 PE=3 SV=... [more]
A0A5D3BHQ81.6e-5284.06Centromere protein S isoform X2 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_... [more]
A0A1S3B1731.6e-5284.06centromere protein S isoform X2 OS=Cucumis melo OX=3656 GN=LOC103484694 PE=3 SV=... [more]
Match NameE-valueIdentityDescription
AT5G50930.11.6e-3664.66Histone superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (97103) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR009072Histone-foldGENE3D1.10.20.10Histone, subunit Acoord: 1..125
e-value: 1.9E-26
score: 94.5
IPR009072Histone-foldSUPERFAMILY47113Histone-foldcoord: 59..97
IPR029003CENP-S/Mhf1PFAMPF15630CENP-Scoord: 15..97
e-value: 8.7E-14
score: 51.7
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 110..138
NoneNo IPR availablePANTHERPTHR22980CORTISTATINcoord: 5..129
NoneNo IPR availablePANTHERPTHR22980:SF0CENTROMERE PROTEIN Scoord: 5..129

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C03G055200.2Cla97C03G055200.2mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0007129 homologous chromosome pairing at meiosis
biological_process GO:0036297 interstrand cross-link repair
biological_process GO:0006312 mitotic recombination
biological_process GO:0031297 replication fork processing
biological_process GO:0000712 resolution of meiotic recombination intermediates
cellular_component GO:0071821 FANCM-MHF complex
cellular_component GO:0043240 Fanconi anaemia nuclear complex
molecular_function GO:0003682 chromatin binding
molecular_function GO:0046982 protein heterodimerization activity