CSPI06G21050 (gene) Cucumber (PI 183967) v1

Overview
NameCSPI06G21050
Typegene
OrganismCucumis sativus var. hardwickii cv. PI 183967 (Cucumber (PI 183967) v1)
Descriptioncentromere protein V-like
LocationChr6: 19105862 .. 19112411 (-)
RNA-Seq ExpressionCSPI06G21050
SyntenyCSPI06G21050
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CTTCATCGGTCGGCGCCGGTTCCCTGATTCACGGTTACGAACGCCCGTCGGCCACTGAGGTTGAGGTTCGTTTGCTCTTCTTTCCTTCATCTTCGACTTTTTCTCAATCCAAAATTTGTTTCTGCAGTCCTCCTCCTCCTCCTCTCCCATCTCATTTTTCGTTTTGTTCTCTATTTCTCATGGATCCAAACACCCCTTGGATTTCTAGTAATTTGATTACATTCCAGCTTGAGATATTCTCTTTTTCAAAATCTTTGATATATTGATATGTGATTCAGTTTATATTTGTTAAATTCTGTTTAAGCTGGCTTTTGAATTACAACTTGTTGTTTTAAGCTCCATCCAAGTTGATTTAAACCAATTGTATACCTATTACCTAGGTACTCATCAAACTTGTGTAACTTCTATTTCATCGAAACTTCTTTCTTGCTATGAAGTTAGACTATATTTCTAACTTTTACTTTGAAGAAACCGTTATGAGGAAGTGATGATGGAATGTTCTTCCCCATTCACCCACTATAACTTTTAGAATCTCCCTACCTTGTTTTGTAAACTTCATACATCAATAAAGTTTGTTTCTTATACAAGATAATACACACACACACCTTCTCAAGTTGTAGCCAAACCAAAAAATGGGAGGGTACCGCATTGGGCAGAAGGCGATCCAAAAGTCAAAGGCTCACACTTATTATTAATATTCACTCTAAAAGAAGGAAGGATCTTTGGCATAAAGTACCTTCTTATTAGAGAAAGGGCAACTTAATAATAAGAAAAGAGGGGGTGAGGTAGGCGACAGGTAGCTTGCTTCCAAGCTGGCCCATTAGGATTCTATTAGTATGAAGAGCCGAAAACATTACCACCATCAACGGTTGATGATATTAAAACCCAGACCTGTTACTAATGTACTGACCCCCCAACACCAGCCAACTATAAGAGCAGATTGGCGAACTGGGTGATCTTTTGCCTAAACCTCATAGGAAGAAGACAGCATTTTAGGACTCAGTCCGTTGTTCTAGTGATAGGGACTGTCTTAAGTTGGTTTGGGTTAAAAGCTTGTTAGAATACTGATTTCAATACATTCCTTCCTTCTATTTTCACTGATTTCCAGTGAAGGTTCCTTTTTTCATTATCTTCTTGGTTACTAAACTTACTAGCATTTTCCCTCCCTTTTTCTTTAAGTCATTTATATGTGAGGTTCTACAACATGTTAATAATTGTACTGGGTTAATCTTACAGACCCTAAGCTAAGTTCAAAAAACTTCTACTTCAAGGCTTAGTTCAAAAGAGATCTTGAAACTCCAGTTGAGGTATTCCTATCAAGGTCATGGGACTAAAATTTGAGATGTGAAAGAAAGTTACTTATATTGTTGGAGTAGCAATGAGGTTACTTGTAGAAATCGAAGGTTACCTTGAGAAAAGATCATGAGGAGCATATCATTCAATTGACGTTCTTTCCTTTTCTCATACGATATTGTAGTAACATGATTGCCACAACCCACTTTAAATTTGTTTTGAGTCTTTCTTTTACGAATACCAGTTAAGGTGGTTTGTAGGTCTCTGAAGAAGAGTTTTTAATAGAGGAATATAAGAGTGTGGATCTACTTTTTTCTAAGTTCAATCTCTAGTTAATTTTGGCCTCGATGAAGATAGATTAATAGAACTGTTTGTTGTAAATTGGTTCCTTTGAAACTCAATTCTAATCATTTCAAACTTGCATTCTACTTTTGCCAAGCTATTCTAGGGCAGTAAGTGCTCTTAAATGGTCAAGTTTCTGTTATGATCCATAGCCTACACGAAGTTATACTCAATTTTGACCTAGATTTTTTATTATGTGGAAAGGATGTGTAAAATATGAATATTCTATTCGTTTAATTTAATTTTGCCACTAATTTTTGGATTTTCAATCGTGATCTCCGTTTTCAAAGACATGGAAAACTGGCTTCTAATAGTGCTTTTGATCTCCACTCAAAAGTTAGATTTTGATGAAAAGGGTGATACTAGGATCTTTCTTTCATCTCTATGATTAGAAAGAAACAATAGGATCTTCAATCATGTCTGAACTGTTGGCTAATTGCACATATATTGACCTCAAATTAGAATGTCATCCAAACGGATTTCAGTAATTATGACAGATTTTTTATTTACTCAGACTGGAAGCGTTCTTATTTCCCTTTTAAATGATAAGGTCCTCTTCCCTTTGATGTTTTGGCTGTATTCTCAATTGTTATCTCTTGTCAAAAAGTAAATAATTGAAAAATAGAAAGATAATATGATATGATTCATAATTGGATCAATTTGAAACATGGACCAACTTTCTTTTCTGTTTTAGTTTACTAGGTCTATACCTTTTACTCAGGAAAACGAATAATCATTCACTAAAAGGATTGGAAGGTTTGATTCCTTTTCAATGGCTTCTGAGTTGGTTGTACATCATGGTGGATGTCACTGCAAGAAAGTAAGATGGCGAGTTGAAGCACCTGCCAGTGTTGTTGCTTGGGATTGCAACTGTTCCAACTGCTTCATGAGGGCCAATACACATTTTATTGTACCTTTGGAACGGTTCAAGCTTTTAGGAGATTCTAGCAACTTTGTTTCTACCTACACATTTGGTTCTCACACTGCAAAACATACCTTCTGCAAAAACTGCGGCATTACCTCATTTTACCATCCACGCTCAAATCCAGATGGAGTTGCTATTACTTTCAAATGTGTTGATCCTGGAACTTTGACCCATATTGAGGTTAGGCAGTTTGATGGGAGCAACTGGGAGGCCTCTTATGATCAAACAGGCATTGCTTCATTGTCAAAGTTGAACATAAGTTAGTATGCATTTTGACAACTGCTGTAAACTAAATTTATGACCGTTGATGATTGATACAAACTTTTTTTATATGGCTTCCATCAACGATTGATATGGCTGCTTGAAAAGGTTTGTTGCTCGGACAACCCCTTCACGTGCAAACTTCCCCTTGCCGCTCGCTCTTTCCATTGTCGATCAGGATTTCCCATTACTGATAAAAATAGGTGCTACGTCTGGTCGGTAGTGTTTGGGTCGAGCAAGGTAGGAAAGGAGGTGATGTTTTCTCTATTGTTTTCCAATGCCGATGATGAGATCTCCATTTTCATAGTTTTTCTTATTCATGATCATACTTTATTCTACCAAATATGTTTTTTTAATTAGTTGTGAATCTGTATTTTGCTATTTATGTTCATGTTTGATGCTGCTGTCTATATCCAATTTGCCTACCCATAGCTTATGCCATTACTCCTACTGTGTGTATATCGTGAATACTTATGTACTTTCCATTGCAATATGCTTCTGTATATTTTTTTTACCTAAGCTTCTGATGCTTCTGTCCCCCCCCCCCCCCCCCCCCCTTTTTTTGTAAGTACACATCATGTGCATTATGTGGTAGGCATGTTGCATTTGAACTTTGACCCACTTTTCTTTACCCATTTGTTGTATGTCGTCTTTCCCATTGTCATCAGTCCATGCTCAGGTTTGCTTGTTTAGGGGTCTTCAATGTCATTACTTCTTCCATTTCCCATCTATATATACTTGCTATGCCATTGCATAGTAACTCTTGGTTCTCCCTTTGTTTGGTAATGTATATGCATCCATCTGTTTGTGTTCCATTGTGTGTTTGGTCTTCACAGTCTACCTATGCACTTGTCATTGATTGATGTTGCGATCCATGCTATGCCACTTGTTGTTTATTTCCATGCCTTTCATGTCACCACTTGCATACTACGACCCGGTTCCATTTTTGTTGTCCATTCACATGTCCTTCATATCTCCACATTTGAGGTTTGTGTTCCATTGTTGTTGTTCATTTCCATGATTTTCATATCACCACATATTGAGGCTCGTGTTCCATTGTTGCTGTTCATTTTCCATGCCTTTCATATCACCACATTTGATACTGTTACATTTTTGTAGCTTTTTTTCCATTTTCTTTGTTCATGTGCACGCCTTTCATATCTTCACATTTCCTAGCATTGCATATGTGAAGACATTTGTTTCACTCCATGAAGTTCAATTTCAAATTATTGAACAAATTTCCTTCCTCTAAGGCACTCAAAATTTTCACATTGTTCACGTTTTTTTTTTGACGTAATTGTTGATTGTTTGTTCATTTGTCCTTCGTATGTCTTTGGGAAAGATCGCACAACGGGCGCACGTGCAGAGACATTTGATGTAGAACATCCTCTTTTCTTGATGTTTCTTTAAGTTCTTTAGTATTTCTTGAGCTAATTCTTGAGTATCTTGAATTGTCATTCTTGTATTTTAGGATTAGTCATCTTTAGTTAAGGCTTGTGTATCTTGAGTTAGTTATAAATAAACTAACTCAACTCCAAGTTAGTTAAGTTCTAAAATAACTAACTTCTAACAACTCTTGTAAACTTCCCGCCTATAAAAAGGCTTTTCCTCTCATTAATAAATCAATCATCTGAAGTATATGGAAACAACTTACTTGAACATTAATAGACGATTCTGATTCTTCTGATGAAGAGAACTTGTTGAACATTCATCAAGAACCACAACAAAGGTTCCCTATCCCTCGTTTATTTGACCTACTTGACCAACTAGGTAAGGCTTCCATCTTTTCCAAAGTAGATCTTAAGAGTGGCTACCACTAAATTAGAATTAAACCAGGATGAGTGGAAAACAACCTTCAAAACTAATGAAGGATTGTTTGAATAGTTAGTTATGCCTTTTGGTCTCTTAAATGCCCTTAGCACCTTTGTGAGATTAATGAATCAAATCCTACAAACTTTTTTGAATCAATTCATTGTGGTATACTTTGATGACATATTCATCTATAGCTCCTCTAAGGAAGATCACCTAAAACACATTCAGTTACTTTTTACAACCTTACAAGAAAATGAATTACAAATTAACCTTAAAAAAATGTGAATTTTTGTGCTATAGCATTCATTTTCTTGGATTTATCATAAGTTTTAATGACATTTTTGTTGATCCTAAAAAGATTCACTCTATTAGTTATTGGCCACAACCAAAGACTCCAAAAGATATTCAATGTTTTTTTAGGTTTAGCCACTTTTTATAGAAGGTTCATAAAGAACTTTAGCACTATTGTAGCACCTTTAACAAACTGTTTAAAGAAAGGTAATTTTCAATGAGGACAAATTGAAGAAGACAACTTCTATCAATTGAAATTAGCTTTAACTAGCCCCTTGTTTTAAAACTATCGAACTTTGGTAAACCATTCACAGTAGATGCTTCAGGTCTAGGAATAGGAGTTGTCCTTAGTCAAGAAGGTCACCCTTTAGAGTATTTTAGTGAAAAATTAAGTACTTTTAGACAAAATTTGAGTACCTATGAACATGAACTATATGTCCTTGTGAGTGCTTTAAAACAATAGGAATATTATCTACTTGCTAATAATTTTATCTTGCTTACTGACCTCTTTTCTCTTAAATTTCTGAATTCCCAAAAAATAATTAGTAGAATGTATGCTCGATGACTGCAATTCTTGCAAAGATTTGATTTTGTGATCAAAATACTTCGGGTAAGATTAATAAGGCTACTAATGCCCTCAGTCGAAAAGGTATACTTCTCACCACCCTTCAATCTCAAATCATTGCCTTTGATCACATACCTACATTGTACCCTTCGGACACTAATTTTAAAAGTATTTGGAAAGCTTGCTCTAACCATAAACCTTGCAAGACTACCTGCATGTCATCATTCCAAAATGACCAACAAGAGAATGGGTCAATTTCAAACATTAGAAAGACTCGGCTCCAACGCTTACCGAGTAGATCTTCCAGCAAACATGAGAATCAACCCATCTTTAAATATTGTTTACTTGTCACCTTACTATGCACCAGATTCCTTCTCTTTAGCTCCCTAATTCTCACTCGAGGTCGAGTTTTATCCTAGGGGAGGGGTTTGATGTTGAACATCCTCATTTCTTGATGTTTCTTTGAGTTCTTTAGTATTTCTTTAGCTAATTCTTGAGTATCTTGAATTACGATTCTTGTATTTTAGGATTAGTTATCTTTAGTAAGGCGTGTATCTTGAGTTACTAATCTTGTATCTTGAGTTAGTTTAACTTGGGGTTGTTCTAAAATAACTAACTTCTAACAATTCTTGTAAACTTTCCGTCTATAAAAGGGCTTCTCCTCTCATTAATAAATCACTAATAAATCATCATCTGAAGTATTATTCATAATATTTGCTAAACTTTGCATTACTACATCACATTAGCTGATGTCGGGTCCAATGTGTCGGTTGAAAACGAGGGATTTCATCCGAAAGATGGGATCGATAAGGAGTTCCTTACTATGTATAGTAAGAGAATGAACATGTCGCCTGAAGACATGATGGACGCATGACCTGGTAGGTCCAGCGACTGTAGGTCGAGTTTCAATGGATTGGAGAGGAAGCGCGGAGGGTAGACTGTCGAGACTGTGGAGGTGGATTGTGCGAATGAACAGCTGAGGGCCATTGCAGACTGGTTGAAGATAACTAGCCAAGAGGAAGACGTTGTGCGGGG

mRNA sequence

CTTCATCGGTCGGCGCCGGTTCCCTGATTCACGGTTACGAACGCCCGTCGGCCACTGAGGTTGAGGAAAACGAATAATCATTCACTAAAAGGATTGGAAGGTTTGATTCCTTTTCAATGGCTTCTGAGTTGGTTGTACATCATGGTGGATGTCACTGCAAGAAAGTAAGATGGCGAGTTGAAGCACCTGCCAGTGTTGTTGCTTGGGATTGCAACTGTTCCAACTGCTTCATGAGGGCCAATACACATTTTATTGTACCTTTGGAACGGTTCAAGCTTTTAGGAGATTCTAGCAACTTTGTTTCTACCTACACATTTGGTTCTCACACTGCAAAACATACCTTCTGCAAAAACTGCGGCATTACCTCATTTTACCATCCACGCTCAAATCCAGATGGAGTTGCTATTACTTTCAAATGTGTTGATCCTGGAACTTTGACCCATATTGAGGTTAGGCAGTTTGATGGGAGCAACTGGGAGGCCTCTTATGATCAAACAGGCATTGCTTCATTGTCAAAGTTGAACATAAGTTAGTATGCATTTTGACAACTGCTGTAAACTAAATTTATGACCGTTGATGATTGATACAAACTTTTTTTATATGGCTTCCATCAACGATTGATATGGCTGCTTGAAAAGGTTTGTTGCTCGGACAACCCCTTCACGTGCAAACTTCCCCTTGCCGCTCGCTCTTTCCATTGTCGATCAGGATTTCCCATTACTGATAAAAATAGGTGCTACGTCTGGTCGGTAGTGTTTGGGTCGAGCAAGCTGATGTCGGGTCCAATGTGTCGGTTGAAAACGAGGGATTTCATCCGAAAGATGGGATCGATAAGGAGTTCCTTACTATGTATAGTAAGAGAATGAACATGTCGCCTGAAGACATGATGGACGCATGACCTGGTAGGTCCAGCGACTGTAGGTCGAGTTTCAATGGATTGGAGAGGAAGCGCGGAGGGTAGACTGTCGAGACTGTGGAGGTGGATTGTGCGAATGAACAGCTGAGGGCCATTGCAGACTGGTTGAAGATAACTAGCCAAGAGGAAGACGTTGTGCGGGG

Coding sequence (CDS)

ATGGCTTCTGAGTTGGTTGTACATCATGGTGGATGTCACTGCAAGAAAGTAAGATGGCGAGTTGAAGCACCTGCCAGTGTTGTTGCTTGGGATTGCAACTGTTCCAACTGCTTCATGAGGGCCAATACACATTTTATTGTACCTTTGGAACGGTTCAAGCTTTTAGGAGATTCTAGCAACTTTGTTTCTACCTACACATTTGGTTCTCACACTGCAAAACATACCTTCTGCAAAAACTGCGGCATTACCTCATTTTACCATCCACGCTCAAATCCAGATGGAGTTGCTATTACTTTCAAATGTGTTGATCCTGGAACTTTGACCCATATTGAGGTTAGGCAGTTTGATGGGAGCAACTGGGAGGCCTCTTATGATCAAACAGGCATTGCTTCATTGTCAAAGTTGAACATAAGTTAG

Protein sequence

MASELVVHHGGCHCKKVRWRVEAPASVVAWDCNCSNCFMRANTHFIVPLERFKLLGDSSNFVSTYTFGSHTAKHTFCKNCGITSFYHPRSNPDGVAITFKCVDPGTLTHIEVRQFDGSNWEASYDQTGIASLSKLNIS*
Homology
BLAST of CSPI06G21050 vs. ExPASy Swiss-Prot
Match: Q9CXS4 (Centromere protein V OS=Mus musculus OX=10090 GN=Cenpv PE=1 SV=2)

HSP 1 Score: 142.5 bits (358), Expect = 3.6e-33
Identity = 66/131 (50.38%), Postives = 86/131 (65.65%), Query Frame = 0

Query: 5   LVVHHGGCHCKKVRWRVEAPASVVAWDCNCSNCFMRANTHFIVPLERFKLLGDSSNFVST 64
           LV H GGCHC  VR+ V A A +  +DCNCS C  + N HFIVP  RFKLL  + + ++T
Sbjct: 122 LVKHTGGCHCGAVRFEVWASADLHIFDCNCSICKKKQNRHFIVPASRFKLLKGAES-ITT 181

Query: 65  YTFGSHTAKHTFCKNCGITSFYHPRSNPDGVAITFKCVDPGTLTHIEVRQFDGSNWE-AS 124
           YTF +H A+HTFCK CG+ SFY PRSNP G  I   C+D GT+  +   +F+GS+WE A 
Sbjct: 182 YTFNTHKAQHTFCKRCGVQSFYTPRSNPGGFGIAPHCLDEGTVRSVVTEEFNGSDWERAM 241

Query: 125 YDQTGIASLSK 135
            +   I ++SK
Sbjct: 242 KEHKTIKNMSK 251

BLAST of CSPI06G21050 vs. ExPASy Swiss-Prot
Match: Q7Z7K6 (Centromere protein V OS=Homo sapiens OX=9606 GN=CENPV PE=1 SV=1)

HSP 1 Score: 141.4 bits (355), Expect = 8.0e-33
Identity = 66/131 (50.38%), Postives = 85/131 (64.89%), Query Frame = 0

Query: 5   LVVHHGGCHCKKVRWRVEAPASVVAWDCNCSNCFMRANTHFIVPLERFKLLGDSSNFVST 64
           LV H GGCHC  VR+ V A A +  +DCNCS C  + N HFIVP  RFKLL   +  ++T
Sbjct: 145 LVKHTGGCHCGAVRFEVWASADLHIFDCNCSICKKKQNRHFIVPASRFKLL-KGAEHITT 204

Query: 65  YTFGSHTAKHTFCKNCGITSFYHPRSNPDGVAITFKCVDPGTLTHIEVRQFDGSNWE-AS 124
           YTF +H A+HTFCK CG+ SFY PRSNP G  I   C+D GT+  +   +F+GS+WE A 
Sbjct: 205 YTFNTHKAQHTFCKRCGVQSFYTPRSNPGGFGIAPHCLDEGTVRSMVTEEFNGSDWEKAM 264

Query: 125 YDQTGIASLSK 135
            +   I ++SK
Sbjct: 265 KEHKTIKNMSK 274

BLAST of CSPI06G21050 vs. ExPASy Swiss-Prot
Match: A0A0U1RR11 (Centromere protein V-like protein 1 OS=Homo sapiens OX=9606 GN=CENPVL1 PE=2 SV=1)

HSP 1 Score: 98.2 bits (243), Expect = 7.7e-20
Identity = 45/115 (39.13%), Postives = 65/115 (56.52%), Query Frame = 0

Query: 5   LVVHHGGCHCKKVRWRVEAPASVVAWDCNCSNCFMRANTHFIVPLERFKLLGDSSNFVST 64
           LV H GGCHC  VR+ V APA +   DC+C  C  + + HF+VP  RF LL  + + V T
Sbjct: 130 LVHHTGGCHCGAVRFAVWAPADLRVVDCSCRLCRKKQHRHFLVPASRFTLLQGAESIV-T 189

Query: 65  YTFGSHTAKHTFCKNCGITSFYHPRSNPDGVAITFKCVDPGTLTHIEVRQFDGSN 120
           Y   +H A H+FC  CG+ SF+   S+P    +   C+D GT+  + + +  G +
Sbjct: 190 YRSNTHPALHSFCSRCGVQSFHAAVSDPRVYGVAPHCLDEGTVRSVVIEEVGGGD 243

BLAST of CSPI06G21050 vs. ExPASy Swiss-Prot
Match: P0DPI3 (Centromere protein V-like protein 2 OS=Homo sapiens OX=9606 GN=CENPVL2 PE=3 SV=1)

HSP 1 Score: 98.2 bits (243), Expect = 7.7e-20
Identity = 45/115 (39.13%), Postives = 65/115 (56.52%), Query Frame = 0

Query: 5   LVVHHGGCHCKKVRWRVEAPASVVAWDCNCSNCFMRANTHFIVPLERFKLLGDSSNFVST 64
           LV H GGCHC  VR+ V APA +   DC+C  C  + + HF+VP  RF LL  + + V T
Sbjct: 130 LVHHTGGCHCGAVRFAVWAPADLRVVDCSCRLCRKKQHRHFLVPASRFTLLQGAESIV-T 189

Query: 65  YTFGSHTAKHTFCKNCGITSFYHPRSNPDGVAITFKCVDPGTLTHIEVRQFDGSN 120
           Y   +H A H+FC  CG+ SF+   S+P    +   C+D GT+  + + +  G +
Sbjct: 190 YRSNTHPALHSFCSRCGVQSFHAAVSDPRVYGVAPHCLDEGTVRSVVIEEVGGGD 243

BLAST of CSPI06G21050 vs. ExPASy Swiss-Prot
Match: A0A0U1RRI6 (Centromere protein V-like protein 3 OS=Homo sapiens OX=9606 GN=CENPVL3 PE=3 SV=1)

HSP 1 Score: 98.2 bits (243), Expect = 7.7e-20
Identity = 45/115 (39.13%), Postives = 65/115 (56.52%), Query Frame = 0

Query: 5   LVVHHGGCHCKKVRWRVEAPASVVAWDCNCSNCFMRANTHFIVPLERFKLLGDSSNFVST 64
           LV H GGCHC  VR+ V APA +   DC+C  C  + + HF+VP  RF LL  + + V T
Sbjct: 145 LVHHTGGCHCGAVRFAVWAPADLRVVDCSCRLCRKKQHRHFLVPASRFTLLQGAESIV-T 204

Query: 65  YTFGSHTAKHTFCKNCGITSFYHPRSNPDGVAITFKCVDPGTLTHIEVRQFDGSN 120
           Y   +H A H+FC  CG+ SF+   S+P    +   C+D GT+  + + +  G +
Sbjct: 205 YRSNTHPALHSFCSRCGVQSFHAAVSDPRVYGVAPHCLDEGTVRSVVIEEVGGGD 258

BLAST of CSPI06G21050 vs. ExPASy TrEMBL
Match: A0A0A0KE05 (CENP-V/GFA domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G403060 PE=3 SV=1)

HSP 1 Score: 301.2 bits (770), Expect = 2.2e-78
Identity = 137/138 (99.28%), Postives = 137/138 (99.28%), Query Frame = 0

Query: 1   MASELVVHHGGCHCKKVRWRVEAPASVVAWDCNCSNCFMRANTHFIVPLERFKLLGDSSN 60
           MASELVVHHGGCHCKKVRWRVEAPASVVAWDCNCSNCFMRANTHFIVPLERFKLLGDSSN
Sbjct: 1   MASELVVHHGGCHCKKVRWRVEAPASVVAWDCNCSNCFMRANTHFIVPLERFKLLGDSSN 60

Query: 61  FVSTYTFGSHTAKHTFCKNCGITSFYHPRSNPDGVAITFKCVDPGTLTHIEVRQFDGSNW 120
           FVSTYTFGSHTAKHTFCKNCGITSFYHPRSNPDGVAITFKCVDPGTLTHIEVRQFDGSNW
Sbjct: 61  FVSTYTFGSHTAKHTFCKNCGITSFYHPRSNPDGVAITFKCVDPGTLTHIEVRQFDGSNW 120

Query: 121 EASYDQTGIASLSKLNIS 139
           EAS DQTGIASLSKLNIS
Sbjct: 121 EASCDQTGIASLSKLNIS 138

BLAST of CSPI06G21050 vs. ExPASy TrEMBL
Match: A0A5D3CJK9 (Centromere protein V isoform X1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold807G00190 PE=3 SV=1)

HSP 1 Score: 293.1 bits (749), Expect = 6.0e-76
Identity = 132/138 (95.65%), Postives = 134/138 (97.10%), Query Frame = 0

Query: 1   MASELVVHHGGCHCKKVRWRVEAPASVVAWDCNCSNCFMRANTHFIVPLERFKLLGDSSN 60
           MASELVVH+GGCHCKKVRWRVEAPASVVAWDCNCSNCFMRANTHFIVPLERFKLLGDSSN
Sbjct: 1   MASELVVHNGGCHCKKVRWRVEAPASVVAWDCNCSNCFMRANTHFIVPLERFKLLGDSSN 60

Query: 61  FVSTYTFGSHTAKHTFCKNCGITSFYHPRSNPDGVAITFKCVDPGTLTHIEVRQFDGSNW 120
           FVSTYTFGSHTAKHTFCKNCGITSFY PRSNPDGVAITFKCVDPGTLTH+EVRQFDGSNW
Sbjct: 61  FVSTYTFGSHTAKHTFCKNCGITSFYRPRSNPDGVAITFKCVDPGTLTHVEVRQFDGSNW 120

Query: 121 EASYDQTGIASLSKLNIS 139
           EASYD TGIAS SKLN S
Sbjct: 121 EASYDHTGIASFSKLNTS 138

BLAST of CSPI06G21050 vs. ExPASy TrEMBL
Match: A0A1S4E4W3 (centromere protein V isoform X6 OS=Cucumis melo OX=3656 GN=LOC103502595 PE=3 SV=1)

HSP 1 Score: 293.1 bits (749), Expect = 6.0e-76
Identity = 132/138 (95.65%), Postives = 134/138 (97.10%), Query Frame = 0

Query: 1   MASELVVHHGGCHCKKVRWRVEAPASVVAWDCNCSNCFMRANTHFIVPLERFKLLGDSSN 60
           MASELVVH+GGCHCKKVRWRVEAPASVVAWDCNCSNCFMRANTHFIVPLERFKLLGDSSN
Sbjct: 1   MASELVVHNGGCHCKKVRWRVEAPASVVAWDCNCSNCFMRANTHFIVPLERFKLLGDSSN 60

Query: 61  FVSTYTFGSHTAKHTFCKNCGITSFYHPRSNPDGVAITFKCVDPGTLTHIEVRQFDGSNW 120
           FVSTYTFGSHTAKHTFCKNCGITSFY PRSNPDGVAITFKCVDPGTLTH+EVRQFDGSNW
Sbjct: 61  FVSTYTFGSHTAKHTFCKNCGITSFYRPRSNPDGVAITFKCVDPGTLTHVEVRQFDGSNW 120

Query: 121 EASYDQTGIASLSKLNIS 139
           EASYD TGIAS SKLN S
Sbjct: 121 EASYDHTGIASFSKLNTS 138

BLAST of CSPI06G21050 vs. ExPASy TrEMBL
Match: A0A1S4E5L6 (centromere protein V isoform X1 OS=Cucumis melo OX=3656 GN=LOC103502595 PE=3 SV=1)

HSP 1 Score: 293.1 bits (749), Expect = 6.0e-76
Identity = 132/138 (95.65%), Postives = 134/138 (97.10%), Query Frame = 0

Query: 1   MASELVVHHGGCHCKKVRWRVEAPASVVAWDCNCSNCFMRANTHFIVPLERFKLLGDSSN 60
           MASELVVH+GGCHCKKVRWRVEAPASVVAWDCNCSNCFMRANTHFIVPLERFKLLGDSSN
Sbjct: 1   MASELVVHNGGCHCKKVRWRVEAPASVVAWDCNCSNCFMRANTHFIVPLERFKLLGDSSN 60

Query: 61  FVSTYTFGSHTAKHTFCKNCGITSFYHPRSNPDGVAITFKCVDPGTLTHIEVRQFDGSNW 120
           FVSTYTFGSHTAKHTFCKNCGITSFY PRSNPDGVAITFKCVDPGTLTH+EVRQFDGSNW
Sbjct: 61  FVSTYTFGSHTAKHTFCKNCGITSFYRPRSNPDGVAITFKCVDPGTLTHVEVRQFDGSNW 120

Query: 121 EASYDQTGIASLSKLNIS 139
           EASYD TGIAS SKLN S
Sbjct: 121 EASYDHTGIASFSKLNTS 138

BLAST of CSPI06G21050 vs. ExPASy TrEMBL
Match: A0A1S3CMD8 (centromere protein V isoform X2 OS=Cucumis melo OX=3656 GN=LOC103502595 PE=3 SV=1)

HSP 1 Score: 292.0 bits (746), Expect = 1.3e-75
Identity = 131/136 (96.32%), Postives = 133/136 (97.79%), Query Frame = 0

Query: 1   MASELVVHHGGCHCKKVRWRVEAPASVVAWDCNCSNCFMRANTHFIVPLERFKLLGDSSN 60
           MASELVVH+GGCHCKKVRWRVEAPASVVAWDCNCSNCFMRANTHFIVPLERFKLLGDSSN
Sbjct: 1   MASELVVHNGGCHCKKVRWRVEAPASVVAWDCNCSNCFMRANTHFIVPLERFKLLGDSSN 60

Query: 61  FVSTYTFGSHTAKHTFCKNCGITSFYHPRSNPDGVAITFKCVDPGTLTHIEVRQFDGSNW 120
           FVSTYTFGSHTAKHTFCKNCGITSFY PRSNPDGVAITFKCVDPGTLTH+EVRQFDGSNW
Sbjct: 61  FVSTYTFGSHTAKHTFCKNCGITSFYRPRSNPDGVAITFKCVDPGTLTHVEVRQFDGSNW 120

Query: 121 EASYDQTGIASLSKLN 137
           EASYD TGIAS SKLN
Sbjct: 121 EASYDHTGIASFSKLN 136

BLAST of CSPI06G21050 vs. NCBI nr
Match: XP_004146757.2 (centromere protein V [Cucumis sativus] >XP_004146758.2 centromere protein V [Cucumis sativus] >XP_031743695.1 centromere protein V [Cucumis sativus] >KAE8647287.1 hypothetical protein Csa_003537 [Cucumis sativus])

HSP 1 Score: 301.2 bits (770), Expect = 4.6e-78
Identity = 137/138 (99.28%), Postives = 137/138 (99.28%), Query Frame = 0

Query: 1   MASELVVHHGGCHCKKVRWRVEAPASVVAWDCNCSNCFMRANTHFIVPLERFKLLGDSSN 60
           MASELVVHHGGCHCKKVRWRVEAPASVVAWDCNCSNCFMRANTHFIVPLERFKLLGDSSN
Sbjct: 1   MASELVVHHGGCHCKKVRWRVEAPASVVAWDCNCSNCFMRANTHFIVPLERFKLLGDSSN 60

Query: 61  FVSTYTFGSHTAKHTFCKNCGITSFYHPRSNPDGVAITFKCVDPGTLTHIEVRQFDGSNW 120
           FVSTYTFGSHTAKHTFCKNCGITSFYHPRSNPDGVAITFKCVDPGTLTHIEVRQFDGSNW
Sbjct: 61  FVSTYTFGSHTAKHTFCKNCGITSFYHPRSNPDGVAITFKCVDPGTLTHIEVRQFDGSNW 120

Query: 121 EASYDQTGIASLSKLNIS 139
           EAS DQTGIASLSKLNIS
Sbjct: 121 EASCDQTGIASLSKLNIS 138

BLAST of CSPI06G21050 vs. NCBI nr
Match: XP_016903271.1 (PREDICTED: centromere protein V isoform X6 [Cucumis melo])

HSP 1 Score: 293.1 bits (749), Expect = 1.2e-75
Identity = 132/138 (95.65%), Postives = 134/138 (97.10%), Query Frame = 0

Query: 1   MASELVVHHGGCHCKKVRWRVEAPASVVAWDCNCSNCFMRANTHFIVPLERFKLLGDSSN 60
           MASELVVH+GGCHCKKVRWRVEAPASVVAWDCNCSNCFMRANTHFIVPLERFKLLGDSSN
Sbjct: 1   MASELVVHNGGCHCKKVRWRVEAPASVVAWDCNCSNCFMRANTHFIVPLERFKLLGDSSN 60

Query: 61  FVSTYTFGSHTAKHTFCKNCGITSFYHPRSNPDGVAITFKCVDPGTLTHIEVRQFDGSNW 120
           FVSTYTFGSHTAKHTFCKNCGITSFY PRSNPDGVAITFKCVDPGTLTH+EVRQFDGSNW
Sbjct: 61  FVSTYTFGSHTAKHTFCKNCGITSFYRPRSNPDGVAITFKCVDPGTLTHVEVRQFDGSNW 120

Query: 121 EASYDQTGIASLSKLNIS 139
           EASYD TGIAS SKLN S
Sbjct: 121 EASYDHTGIASFSKLNTS 138

BLAST of CSPI06G21050 vs. NCBI nr
Match: XP_016903270.1 (PREDICTED: centromere protein V isoform X1 [Cucumis melo] >KAA0045397.1 centromere protein V isoform X1 [Cucumis melo var. makuwa] >TYK11344.1 centromere protein V isoform X1 [Cucumis melo var. makuwa])

HSP 1 Score: 293.1 bits (749), Expect = 1.2e-75
Identity = 132/138 (95.65%), Postives = 134/138 (97.10%), Query Frame = 0

Query: 1   MASELVVHHGGCHCKKVRWRVEAPASVVAWDCNCSNCFMRANTHFIVPLERFKLLGDSSN 60
           MASELVVH+GGCHCKKVRWRVEAPASVVAWDCNCSNCFMRANTHFIVPLERFKLLGDSSN
Sbjct: 1   MASELVVHNGGCHCKKVRWRVEAPASVVAWDCNCSNCFMRANTHFIVPLERFKLLGDSSN 60

Query: 61  FVSTYTFGSHTAKHTFCKNCGITSFYHPRSNPDGVAITFKCVDPGTLTHIEVRQFDGSNW 120
           FVSTYTFGSHTAKHTFCKNCGITSFY PRSNPDGVAITFKCVDPGTLTH+EVRQFDGSNW
Sbjct: 61  FVSTYTFGSHTAKHTFCKNCGITSFYRPRSNPDGVAITFKCVDPGTLTHVEVRQFDGSNW 120

Query: 121 EASYDQTGIASLSKLNIS 139
           EASYD TGIAS SKLN S
Sbjct: 121 EASYDHTGIASFSKLNTS 138

BLAST of CSPI06G21050 vs. NCBI nr
Match: XP_008464790.1 (PREDICTED: centromere protein V isoform X2 [Cucumis melo])

HSP 1 Score: 292.0 bits (746), Expect = 2.8e-75
Identity = 131/136 (96.32%), Postives = 133/136 (97.79%), Query Frame = 0

Query: 1   MASELVVHHGGCHCKKVRWRVEAPASVVAWDCNCSNCFMRANTHFIVPLERFKLLGDSSN 60
           MASELVVH+GGCHCKKVRWRVEAPASVVAWDCNCSNCFMRANTHFIVPLERFKLLGDSSN
Sbjct: 1   MASELVVHNGGCHCKKVRWRVEAPASVVAWDCNCSNCFMRANTHFIVPLERFKLLGDSSN 60

Query: 61  FVSTYTFGSHTAKHTFCKNCGITSFYHPRSNPDGVAITFKCVDPGTLTHIEVRQFDGSNW 120
           FVSTYTFGSHTAKHTFCKNCGITSFY PRSNPDGVAITFKCVDPGTLTH+EVRQFDGSNW
Sbjct: 61  FVSTYTFGSHTAKHTFCKNCGITSFYRPRSNPDGVAITFKCVDPGTLTHVEVRQFDGSNW 120

Query: 121 EASYDQTGIASLSKLN 137
           EASYD TGIAS SKLN
Sbjct: 121 EASYDHTGIASFSKLN 136

BLAST of CSPI06G21050 vs. NCBI nr
Match: XP_008464791.1 (PREDICTED: centromere protein V isoform X3 [Cucumis melo])

HSP 1 Score: 292.0 bits (746), Expect = 2.8e-75
Identity = 131/136 (96.32%), Postives = 133/136 (97.79%), Query Frame = 0

Query: 1   MASELVVHHGGCHCKKVRWRVEAPASVVAWDCNCSNCFMRANTHFIVPLERFKLLGDSSN 60
           MASELVVH+GGCHCKKVRWRVEAPASVVAWDCNCSNCFMRANTHFIVPLERFKLLGDSSN
Sbjct: 1   MASELVVHNGGCHCKKVRWRVEAPASVVAWDCNCSNCFMRANTHFIVPLERFKLLGDSSN 60

Query: 61  FVSTYTFGSHTAKHTFCKNCGITSFYHPRSNPDGVAITFKCVDPGTLTHIEVRQFDGSNW 120
           FVSTYTFGSHTAKHTFCKNCGITSFY PRSNPDGVAITFKCVDPGTLTH+EVRQFDGSNW
Sbjct: 61  FVSTYTFGSHTAKHTFCKNCGITSFYRPRSNPDGVAITFKCVDPGTLTHVEVRQFDGSNW 120

Query: 121 EASYDQTGIASLSKLN 137
           EASYD TGIAS SKLN
Sbjct: 121 EASYDHTGIASFSKLN 136

BLAST of CSPI06G21050 vs. TAIR 10
Match: AT5G16940.1 (carbon-sulfur lyases )

HSP 1 Score: 217.6 bits (553), Expect = 6.2e-57
Identity = 92/134 (68.66%), Postives = 108/134 (80.60%), Query Frame = 0

Query: 1   MASELVVHHGGCHCKKVRWRVEAPASVVAWDCNCSNCFMRANTHFIVPLERFKLLGDSSN 60
           M SEL+ H GGCHC K++WRV+A  SV+AW CNCS+C MR N HFIVP   F+LL DS +
Sbjct: 1   MESELIFHEGGCHCGKIKWRVKAARSVIAWSCNCSDCSMRGNVHFIVPSSNFELLDDSKD 60

Query: 61  FVSTYTFGSHTAKHTFCKNCGITSFYHPRSNPDGVAITFKCVDPGTLTHIEVRQFDGSNW 120
           F++TYTFG+HTAKHTFCK CGITSFY PRSNPDGVA+T KCV  GTL HIEV+ +DG NW
Sbjct: 61  FITTYTFGTHTAKHTFCKVCGITSFYIPRSNPDGVAVTVKCVKSGTLAHIEVKSYDGQNW 120

Query: 121 EASYDQTGIASLSK 135
           E S+ +TGIAS SK
Sbjct: 121 EMSHKKTGIASFSK 134

BLAST of CSPI06G21050 vs. TAIR 10
Match: AT5G16940.2 (carbon-sulfur lyases )

HSP 1 Score: 217.6 bits (553), Expect = 6.2e-57
Identity = 92/134 (68.66%), Postives = 108/134 (80.60%), Query Frame = 0

Query: 1   MASELVVHHGGCHCKKVRWRVEAPASVVAWDCNCSNCFMRANTHFIVPLERFKLLGDSSN 60
           M SEL+ H GGCHC K++WRV+A  SV+AW CNCS+C MR N HFIVP   F+LL DS +
Sbjct: 1   MESELIFHEGGCHCGKIKWRVKAARSVIAWSCNCSDCSMRGNVHFIVPSSNFELLDDSKD 60

Query: 61  FVSTYTFGSHTAKHTFCKNCGITSFYHPRSNPDGVAITFKCVDPGTLTHIEVRQFDGSNW 120
           F++TYTFG+HTAKHTFCK CGITSFY PRSNPDGVA+T KCV  GTL HIEV+ +DG NW
Sbjct: 61  FITTYTFGTHTAKHTFCKVCGITSFYIPRSNPDGVAVTVKCVKSGTLAHIEVKSYDGQNW 120

Query: 121 EASYDQTGIASLSK 135
           E S+ +TGIAS SK
Sbjct: 121 EMSHKKTGIASFSK 134

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9CXS43.6e-3350.38Centromere protein V OS=Mus musculus OX=10090 GN=Cenpv PE=1 SV=2[more]
Q7Z7K68.0e-3350.38Centromere protein V OS=Homo sapiens OX=9606 GN=CENPV PE=1 SV=1[more]
A0A0U1RR117.7e-2039.13Centromere protein V-like protein 1 OS=Homo sapiens OX=9606 GN=CENPVL1 PE=2 SV=1[more]
P0DPI37.7e-2039.13Centromere protein V-like protein 2 OS=Homo sapiens OX=9606 GN=CENPVL2 PE=3 SV=1[more]
A0A0U1RRI67.7e-2039.13Centromere protein V-like protein 3 OS=Homo sapiens OX=9606 GN=CENPVL3 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KE052.2e-7899.28CENP-V/GFA domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G403060 ... [more]
A0A5D3CJK96.0e-7695.65Centromere protein V isoform X1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_... [more]
A0A1S4E4W36.0e-7695.65centromere protein V isoform X6 OS=Cucumis melo OX=3656 GN=LOC103502595 PE=3 SV=... [more]
A0A1S4E5L66.0e-7695.65centromere protein V isoform X1 OS=Cucumis melo OX=3656 GN=LOC103502595 PE=3 SV=... [more]
A0A1S3CMD81.3e-7596.32centromere protein V isoform X2 OS=Cucumis melo OX=3656 GN=LOC103502595 PE=3 SV=... [more]
Match NameE-valueIdentityDescription
XP_004146757.24.6e-7899.28centromere protein V [Cucumis sativus] >XP_004146758.2 centromere protein V [Cuc... [more]
XP_016903271.11.2e-7595.65PREDICTED: centromere protein V isoform X6 [Cucumis melo][more]
XP_016903270.11.2e-7595.65PREDICTED: centromere protein V isoform X1 [Cucumis melo] >KAA0045397.1 centrome... [more]
XP_008464790.12.8e-7596.32PREDICTED: centromere protein V isoform X2 [Cucumis melo][more]
XP_008464791.12.8e-7596.32PREDICTED: centromere protein V isoform X3 [Cucumis melo][more]
Match NameE-valueIdentityDescription
AT5G16940.16.2e-5768.66carbon-sulfur lyases [more]
AT5G16940.26.2e-5768.66carbon-sulfur lyases [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (PI 183967) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableGENE3D2.170.150.70coord: 8..124
e-value: 5.4E-25
score: 89.4
NoneNo IPR availablePANTHERPTHR28620:SF9CARBON-SULFUR LYASEScoord: 1..134
NoneNo IPR availablePANTHERPTHR28620CENTROMERE PROTEIN Vcoord: 1..134
IPR006913CENP-V/GFA domainPFAMPF04828GFAcoord: 31..105
e-value: 1.4E-7
score: 31.7
IPR006913CENP-V/GFA domainPROSITEPS51891CENP_V_GFAcoord: 8..121
score: 24.590271
IPR011057Mss4-like superfamilySUPERFAMILY51316Mss4-likecoord: 6..120

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI06G21050.1CSPI06G21050.1mRNA
CSPI06G21050.3CSPI06G21050.3mRNA
CSPI06G21050.2CSPI06G21050.2mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0016846 carbon-sulfur lyase activity
molecular_function GO:0046872 metal ion binding