HG10023321 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10023321
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptioncentromere protein C-like isoform X1
LocationChr05: 33042487 .. 33047969 (+)
RNA-Seq ExpressionHG10023321
SyntenyHG10023321
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTGACCGAAGAGGCTCGACACTCCGATGTGATCGATCCACTTGCTGCTTATTCTGGTATCAATCTCTTTTCGAACGCATTTCGTACTTTGCGGGATCCGTCAAAGCCACATGATCTTGGAACCGACCTTGACGGCATCCACAAGCACCTCAAATCCATGGTATTTGTATCTTTCGATACTCGTTGTATATGAACCTTCCAATTTACCGGTTTATTTGGTTTATAGTGCCTCCTGCTTGCTCCAAGAGCTCCGTGGGCAATTCAATCGCCAGTGTTCTTTTTTAACTTTTAATTTTTTGCATGTTTTGGAGGATGAATTAGAATTCGTTTCTTCTCCCATTTTGTCAGTAAGTATGCCAGATTGTGACGTGGTTTTCCTTCTTAGTCGTAACGATTTAGCTCAACGGTTGAATTGGAGACTCAGGTTCCTTGTTACTGTGAAATTGTGAGGAAGTAAAATGTTTTTGGCTTTGCATGTAGGGGTTTCTTCTTTCCTTAAAATTAAACGCTCAGCCTACTTAGGTTTATAATGGTAAAAGCTGGTTAGCCCATTTGGTTTGTGAACTGGGATTTGAGATACACCCCGGCATGATGTTTAGTTCGTCGGTTAAAGACGGTGGATTGGTTCTTTAGTATATCAATGCGAGAGTACTTGAAATTACAAGTTAAATAAAATTATACTGGAAATTATGGTGTTAGTGTGGTACAGCCTTGCTTCTCATTTCTCCCATCTAATCTCCATTTTCTCATCATCACGGCTCCTCACGAAGCATTTTGTATCTATTTCTAGTTTCGGTTTAATTGACGATCAACGACTCTCTTCCAAATTTTTCAGGTGTCAAGAAGTCCCAGTAAACTTGTAGAGCAGGCCAGATCAATTTTAGACGGGAACTCAAATTTGATGCAATCTGAAGCTGCCACATTTCTTGTAAAGAATGAGAAAGATGAGGAAGCTACAGTGAAGGTGGAGGAAAATCTTCATGAAAGAAGGCCGGCCTTAAACCGAAAGCGGGCTAGGTTCTCTTTAAAACCTGATGCTAGGTAATTATGCTTATAATGGTGAAATTTCCCTACTTTTTTGTAAGAACAAAAATTATATTCTGGCCAACTTTTATGATATCTTCTTTTTGTAGACAACCTTCTGTGAACTTGGAACCAACATTTGACATCAAACAATTGAAAGATCCCGAGGAGTTCTTTTTGGCCTACGAAAGGCTTGAAAGTAAGTTGTTCTTATGCTTTCTTTTCCACACAAAATTTGACATGCATATAGATTTGTGTTACTATTTTCTTCCCATTTTGCAATTGATCCATAGATGCCAAAAAAGAAATTCAAAAACAGACAGGAGCAGTTTTGAAGGACTTGAACCAACAAAATCCATCCACGAATAAACGCCAGCGTAGACCAGGGATTCTTGGGTATAATCACTTACATGTTATATCATTAACAAAGTTTTGTGTCTTTTTTTTAAGCGTCACGTTTTCTTGTGGTTGAGAATAAAAAATCTTCTCGTGCATGTAGGAGATCTGTTAGATACAAGCATCAATATTCATCAATAACAACTGAAGATGATGAGAATGTAGATCCTTCTCAAGTGACGCTTGAATCAGGTAGCATCAGTCCATCGATATTGGGCACAGAGACACACCCAAGTCCACATATAATTGACTCGGAAAAGAAAACTGATGAAGATGTAGCCTTTGAGGAGGAGGAGGAGGAGGAGGAGTTCGTTGGTAAGTAATTTATAATAGAGATCAAAATGCAACTTTCTGCATGCATCATGGTCCAGTTGATTTTATCGTTCTCACATGCTGCTGTCCTCTTCCTTTCTTCTTTCTCATTTCGAACTGTCCCTGTTAAAGCTAAAGGTAGTTTTCATGGACCGTCTTTGATGCTGTGATCTTTTTTTTCTCCTTTCCTCTTAGGTTCAGTTACCAAGGCAGAGAACAAAGTGAATAAAATTTTGGATGAATTACTCTCTGCCAATTGTGAAGATCTAGAAGGTGATCGAGCCATCAACATATTACAGGAGTGCTTGCAGATTAAACCCATTAATTTAGAGAAATTATGCCTTCCAGATTTAGAAGCCATTCCAACAATGAATTTGAAATCTTCAAGTTGCAATCTGTCAAAGCGTAGTTTCATCAGTGTGGACAATCAGTTACAAAGGATAGAAACTTTGAAATCTAAGCAGGACGATGAAACTTTGGTTAATCCTGTTTCTACACCATCCTCAATCAGAAGTCCATTGGCATCTGTATCAGCCCTAAATAGACGAATTTCGCTTTCAAATTCATCAGGTGATCCATTTTCTGCTCATGACATTGGCCAATCTCCAGCAAGAGATCCTTACCTTTTTGAACTCAGTAATTACTTGTCTGATGCAGTTGGTATTGCAGAGCAGTCAAGTATTTCTAAATTGAAGTCACTTTTAACCAAAGATAGCGGGACTGTAGCAAATGGAATTAAGCCATCCAAAATTCTTTTTGGAGATGTTGATTCCATGTCTAAAATGTCTTCAAGTAATGTTTTAAATGTCCCCCAAGTTGGTGTCGATACTGCCTTAAGTGGAACTCACGCCAGCATGGAAACTAAAGATGTTAGTGGCAGCCGCACAGAAGTGGAAGTAAATGAGAAATTGAGTTGCCTTGAAGATGTTGTGGCTAATATGCAGATGGAAGATCACGAAGGATCAGCTTCTGAGCAACCAAACTCATCCAAGGTGGATGTGATCAAAGAGTACCCAGTTGGCATTCAGAGTCAGTTGGGTATGATCTTCAACGCCAGTATCGGGTGTTACTAGCTGATAATTTCTTTTATCATCATGTAAATAGGAAATTAAAATCTAATGCTATGTTACTGCTGAGTGCTCATTTTACTGCGTTTGATGGCTGGACTTTTTTTGATAAATTTTGGACATGTATGCTTTACTAAATATGTCAGAGAAATATTATATAACTTCATGAGAAATATATCCCGGATAGTTTACTAAATTTTAATAGTTGAGGTGTACCACTTTAATGCTTCACACTATAACTAATATTTGTTCTCTGAGTGCAAAGATACATTCCAATATTTTTCAGTCATGTCACAGTTCTGAATTGGCTTCAACTTTCTCATATGAAGTACCATTTTCTCTCTTCACAGATTCAACTGCTTATTCGTACTTCTAGTTTTATGTTGAAAAGGCAGAAATTTATTCATGTTTGCTTATCACTTCCTAGTAGATGGATTATATGATTAGGCTGCATGTTTTTTGTTTTCTAACAGATCAAGCAACTGCTACTTGTACTGAAAATATTGTCGATGGGCCATCTAGATGCAGTGGAATGGATCACGCCGATGAGGTTTTTGCCCTTTCTTCTTTTCTTTTTTCTCCTCTTGGACCTTTATTTCCAGTAGCTCATTATTGATTGATCGGAGTGCTTTGTCAGATTATGGTATTTGTTATAAAGGAAGGCTTTGTCTATTGTGTTTTACAAAATAAAAAAAAAAAGACTTTGTTTATCGTGGTCATCATTGTCATAAGGAATTACTGTTCTCAGTTTCATTGATCATTTTGTCCGATACTTATTAGTAACTAATGATAAGTATTGTGATAATATGCAGATGGAAGATCACGAAGGATTAGCTATTGAGCAACCAAACTCATCCAAGGTGGATGTGATCAAAGAGTACCCGGTTGGCATTCAGAGTCAGTTGGGTATGATCTTCAATGCCAGTATCGTTAGTTACTTGCTGATAATTTGTTGTATCATCATGTAAATAGGAAATTGAAACCTAATGTTTTGTTGTTGCTGAGTGCTCATTTTAGTGCGTTTGATGGTTGGATTTTTTTTGATAAATTTTGGACATGTATGCTTTACTAAATATGTCTGAGACAATTTATAATTTCATGAGAAATACATCCCGGATAATTAACTAAAATTTAGTAGTTGAAATGTACTACTTTAATGCTTCACACTATAACTAATATTTGTTTCTCTGAGCGCAAAGATACATTCCAATATTTTTCAGTCATGTCATAGTTCTGAATTGCCTTCGGCTTTCTCATATGAAGAACCATTTCCTCTTTTCACAGATTCAACTGCTTATTCGTACTTCTATTTTTATGTTGAAATGGCAGAAATTTATTCATGTTTGCTTATCAATTCCCAGTAGATGGATTATATGATTTAGGCTGCATGTTCTTTGTTTTCTAACAGATCAATCAACTGCTACTTGTACTGAAAATATTGTCAACGGGCCGTCTAGAAGCAGTGGAACGGATCACCACGATGAGGTTTTTGACCTTTCTTCTTTTCTTTTTTCTCCTCTTGGACCCTTATTTCCAGTAGCTCATTATTGATGAGTCAGAGTGCTTTGTCAAATTGTGGTATTTGTTATAAATGAAGGCTTTGCCTATTATATTTTACAAAAAAAAAAAAGAAAGGGCTTTGTTTATGGTGGTCATCATTGTCATAAGGAAGTACTGTTCTCAGTTTCAGTAATCATTTTGTCCGGCATGTATTTGTTTAATTATTCTCACAATTACTAATATTGTATGTTGACGATTGTTTGACTACCCGTGCTATTTTTGTTAACGATATATTGCCATCTAAAACTTAAAATTAGAAGTAATCTACCATCTCCACTCGCTTCCTCTCTATACTTAATAATGAATTAGAAATAAGGCAATTAATTTACTTGCTGCCCCCCTGGGGAAGGAAGACAATGATAAGAACCCTCATTTTGTTAATTCGACTCGTTTACTAGAACACAGTTGTTTCTGGTTTTTACTTTTGAGAGACAATCAAGGAATCTCTTTTTTTTTTTTTTTTTTTGGTTATTGATGTAGGAACAGGTCAAGCCAAAATCTCGTGCAAACAAACAACGGAAAGGCAAAAAGATTTCTGGGAGGCAAAGCCTTGCAGGTGTTTAGCCGTAGATTTAACCCAAACTTTGATTTCTAATAGTATTTAGATTTTTCTTTTAAAAATACTCTGTTATCTACCAATCTTCCCAGGGGCTGGTACAACGTGGCAAAGTGGGGTGAGAAGAAGTACCAGGTTCAAAACACGACCGTTGGAGTACTGGAAAGGTGAAAGGCTGTTGTACGGACGCGTACATGAGAGTAAGTGGACACTTATTAGTTATTGCATTGCATCATCTTTGGAAATGTCTTTTAACAATTCCACTTGCATATGTTTCTCTCAAATTCCTTTATTTTAAGGCTCTTTTGGACTGATAATGTTCAACGCCAAAAGCTTTTAAATTGGTAAAATAATGAGGAGCATTCCTTCTGTCTTATTCTCCATGGTTTTATTTCTTTTTCACGTGCTTGTAAATTAATTACCGGATCCTTTTTGTTAGGCCTAGCAACAGTAATCGGGTTGAAGTATGTATCTCCTGCAAAAGGAAATGGCCAACCAACTATGAAGGTGAAGTCTTTAGTCTCCAATGAGTACAAAGATCTCGTTGAGTTAGCAGCTCTTCACTAA

mRNA sequence

ATGGTGACCGAAGAGGCTCGACACTCCGATGTGATCGATCCACTTGCTGCTTATTCTGGTATCAATCTCTTTTCGAACGCATTTCGTACTTTGCGGGATCCGTCAAAGCCACATGATCTTGGAACCGACCTTGACGGCATCCACAAGCACCTCAAATCCATGGTGTCAAGAAGTCCCAGTAAACTTGTAGAGCAGGCCAGATCAATTTTAGACGGGAACTCAAATTTGATGCAATCTGAAGCTGCCACATTTCTTGTAAAGAATGAGAAAGATGAGGAAGCTACAGTGAAGGTGGAGGAAAATCTTCATGAAAGAAGGCCGGCCTTAAACCGAAAGCGGGCTAGGTTCTCTTTAAAACCTGATGCTAGACAACCTTCTGTGAACTTGGAACCAACATTTGACATCAAACAATTGAAAGATCCCGAGGAGTTCTTTTTGGCCTACGAAAGGCTTGAAAATGCCAAAAAAGAAATTCAAAAACAGACAGGAGCAGTTTTGAAGGACTTGAACCAACAAAATCCATCCACGAATAAACGCCAGCGTAGACCAGGGATTCTTGGGAGATCTGTTAGATACAAGCATCAATATTCATCAATAACAACTGAAGATGATGAGAATGTAGATCCTTCTCAAGTGACGCTTGAATCAGGTAGCATCAGTCCATCGATATTGGGCACAGAGACACACCCAAGTCCACATATAATTGACTCGGAAAAGAAAACTGATGAAGATGTAGCCTTTGAGGAGGAGGAGGAGGAGGAGGAGTTCGTTGGTTCAGTTACCAAGGCAGAGAACAAAGTGAATAAAATTTTGGATGAATTACTCTCTGCCAATTGTGAAGATCTAGAAGGTGATCGAGCCATCAACATATTACAGGAGTGCTTGCAGATTAAACCCATTAATTTAGAGAAATTATGCCTTCCAGATTTAGAAGCCATTCCAACAATGAATTTGAAATCTTCAAGTTGCAATCTGTCAAAGCGTAGTTTCATCAGTGTGGACAATCAGTTACAAAGGATAGAAACTTTGAAATCTAAGCAGGACGATGAAACTTTGGTTAATCCTGTTTCTACACCATCCTCAATCAGAAGTCCATTGGCATCTGTATCAGCCCTAAATAGACGAATTTCGCTTTCAAATTCATCAGGTGATCCATTTTCTGCTCATGACATTGGCCAATCTCCAGCAAGAGATCCTTACCTTTTTGAACTCAGTAATTACTTGTCTGATGCAGTTGGTATTGCAGAGCAGTCAAGTATTTCTAAATTGAAGTCACTTTTAACCAAAGATAGCGGGACTGTAGCAAATGGAATTAAGCCATCCAAAATTCTTTTTGGAGATGTTGATTCCATGTCTAAAATGTCTTCAAGTAATGTTTTAAATGTCCCCCAAGTTGGTGTCGATACTGCCTTAAGTGGAACTCACGCCAGCATGGAAACTAAAGATGTTAGTGGCAGCCGCACAGAAGTGGAAGTAAATGAGAAATTGAGTTGCCTTGAAGATGTTGTGGCTAATATGCAGATGGAAGATCACGAAGGATCAGCTTCTGAGCAACCAAACTCATCCAAGGTGGATGTGATCAAAGAGTACCCAGTTGGCATTCAGAGTCAGTTGGATCAAGCAACTGCTACTTGTACTGAAAATATTGTCGATGGGCCATCTAGATGCAGTGGAATGGATCACGCCGATGAGATGGAAGATCACGAAGGATTAGCTATTGAGCAACCAAACTCATCCAAGGTGGATGTGATCAAAGAGTACCCGGTTGGCATTCAGAGTCAGTTGGATCAATCAACTGCTACTTGTACTGAAAATATTGTCAACGGGCCGTCTAGAAGCAGTGGAACGGATCACCACGATGAGGAACAGGTCAAGCCAAAATCTCGTGCAAACAAACAACGGAAAGGCAAAAAGATTTCTGGGAGGCAAAGCCTTGCAGGGGCTGGTACAACGTGGCAAAGTGGGGTGAGAAGAAGTACCAGGTTCAAAACACGACCGTTGGAGTACTGGAAAGGTGAAAGGCTGTTGTACGGACGCGTACATGAGAGCCTAGCAACAGTAATCGGGTTGAAGTATGTATCTCCTGCAAAAGGAAATGGCCAACCAACTATGAAGGTGAAGTCTTTAGTCTCCAATGAGTACAAAGATCTCGTTGAGTTAGCAGCTCTTCACTAA

Coding sequence (CDS)

ATGGTGACCGAAGAGGCTCGACACTCCGATGTGATCGATCCACTTGCTGCTTATTCTGGTATCAATCTCTTTTCGAACGCATTTCGTACTTTGCGGGATCCGTCAAAGCCACATGATCTTGGAACCGACCTTGACGGCATCCACAAGCACCTCAAATCCATGGTGTCAAGAAGTCCCAGTAAACTTGTAGAGCAGGCCAGATCAATTTTAGACGGGAACTCAAATTTGATGCAATCTGAAGCTGCCACATTTCTTGTAAAGAATGAGAAAGATGAGGAAGCTACAGTGAAGGTGGAGGAAAATCTTCATGAAAGAAGGCCGGCCTTAAACCGAAAGCGGGCTAGGTTCTCTTTAAAACCTGATGCTAGACAACCTTCTGTGAACTTGGAACCAACATTTGACATCAAACAATTGAAAGATCCCGAGGAGTTCTTTTTGGCCTACGAAAGGCTTGAAAATGCCAAAAAAGAAATTCAAAAACAGACAGGAGCAGTTTTGAAGGACTTGAACCAACAAAATCCATCCACGAATAAACGCCAGCGTAGACCAGGGATTCTTGGGAGATCTGTTAGATACAAGCATCAATATTCATCAATAACAACTGAAGATGATGAGAATGTAGATCCTTCTCAAGTGACGCTTGAATCAGGTAGCATCAGTCCATCGATATTGGGCACAGAGACACACCCAAGTCCACATATAATTGACTCGGAAAAGAAAACTGATGAAGATGTAGCCTTTGAGGAGGAGGAGGAGGAGGAGGAGTTCGTTGGTTCAGTTACCAAGGCAGAGAACAAAGTGAATAAAATTTTGGATGAATTACTCTCTGCCAATTGTGAAGATCTAGAAGGTGATCGAGCCATCAACATATTACAGGAGTGCTTGCAGATTAAACCCATTAATTTAGAGAAATTATGCCTTCCAGATTTAGAAGCCATTCCAACAATGAATTTGAAATCTTCAAGTTGCAATCTGTCAAAGCGTAGTTTCATCAGTGTGGACAATCAGTTACAAAGGATAGAAACTTTGAAATCTAAGCAGGACGATGAAACTTTGGTTAATCCTGTTTCTACACCATCCTCAATCAGAAGTCCATTGGCATCTGTATCAGCCCTAAATAGACGAATTTCGCTTTCAAATTCATCAGGTGATCCATTTTCTGCTCATGACATTGGCCAATCTCCAGCAAGAGATCCTTACCTTTTTGAACTCAGTAATTACTTGTCTGATGCAGTTGGTATTGCAGAGCAGTCAAGTATTTCTAAATTGAAGTCACTTTTAACCAAAGATAGCGGGACTGTAGCAAATGGAATTAAGCCATCCAAAATTCTTTTTGGAGATGTTGATTCCATGTCTAAAATGTCTTCAAGTAATGTTTTAAATGTCCCCCAAGTTGGTGTCGATACTGCCTTAAGTGGAACTCACGCCAGCATGGAAACTAAAGATGTTAGTGGCAGCCGCACAGAAGTGGAAGTAAATGAGAAATTGAGTTGCCTTGAAGATGTTGTGGCTAATATGCAGATGGAAGATCACGAAGGATCAGCTTCTGAGCAACCAAACTCATCCAAGGTGGATGTGATCAAAGAGTACCCAGTTGGCATTCAGAGTCAGTTGGATCAAGCAACTGCTACTTGTACTGAAAATATTGTCGATGGGCCATCTAGATGCAGTGGAATGGATCACGCCGATGAGATGGAAGATCACGAAGGATTAGCTATTGAGCAACCAAACTCATCCAAGGTGGATGTGATCAAAGAGTACCCGGTTGGCATTCAGAGTCAGTTGGATCAATCAACTGCTACTTGTACTGAAAATATTGTCAACGGGCCGTCTAGAAGCAGTGGAACGGATCACCACGATGAGGAACAGGTCAAGCCAAAATCTCGTGCAAACAAACAACGGAAAGGCAAAAAGATTTCTGGGAGGCAAAGCCTTGCAGGGGCTGGTACAACGTGGCAAAGTGGGGTGAGAAGAAGTACCAGGTTCAAAACACGACCGTTGGAGTACTGGAAAGGTGAAAGGCTGTTGTACGGACGCGTACATGAGAGCCTAGCAACAGTAATCGGGTTGAAGTATGTATCTCCTGCAAAAGGAAATGGCCAACCAACTATGAAGGTGAAGTCTTTAGTCTCCAATGAGTACAAAGATCTCGTTGAGTTAGCAGCTCTTCACTAA

Protein sequence

MVTEEARHSDVIDPLAAYSGINLFSNAFRTLRDPSKPHDLGTDLDGIHKHLKSMVSRSPSKLVEQARSILDGNSNLMQSEAATFLVKNEKDEEATVKVEENLHERRPALNRKRARFSLKPDARQPSVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGAVLKDLNQQNPSTNKRQRRPGILGRSVRYKHQYSSITTEDDENVDPSQVTLESGSISPSILGTETHPSPHIIDSEKKTDEDVAFEEEEEEEEFVGSVTKAENKVNKILDELLSANCEDLEGDRAINILQECLQIKPINLEKLCLPDLEAIPTMNLKSSSCNLSKRSFISVDNQLQRIETLKSKQDDETLVNPVSTPSSIRSPLASVSALNRRISLSNSSGDPFSAHDIGQSPARDPYLFELSNYLSDAVGIAEQSSISKLKSLLTKDSGTVANGIKPSKILFGDVDSMSKMSSSNVLNVPQVGVDTALSGTHASMETKDVSGSRTEVEVNEKLSCLEDVVANMQMEDHEGSASEQPNSSKVDVIKEYPVGIQSQLDQATATCTENIVDGPSRCSGMDHADEMEDHEGLAIEQPNSSKVDVIKEYPVGIQSQLDQSTATCTENIVNGPSRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSPAKGNGQPTMKVKSLVSNEYKDLVELAALH
Homology
BLAST of HG10023321 vs. NCBI nr
Match: XP_038896841.1 (centromere protein C isoform X2 [Benincasa hispida])

HSP 1 Score: 1203.3 bits (3112), Expect = 0.0e+00
Identity = 640/727 (88.03%), Postives = 661/727 (90.92%), Query Frame = 0

Query: 1   MVTEEARHSDVIDPLAAYSGINLFSNAFRTLRDPSKPHDLGTDLDGIHKHLKSMVSRSPS 60
           MVT+EARHSD IDPLAAYSGINLFS+AF TL DPSKPHDLG DLDGIHKHLKSMVSRSPS
Sbjct: 1   MVTQEARHSDAIDPLAAYSGINLFSSAFGTLPDPSKPHDLGADLDGIHKHLKSMVSRSPS 60

Query: 61  KLVEQARSILDGNSNLMQSEAATFLVKNEKDEEATVKVEENLHERRPALNRKRARFSLKP 120
           KL+EQARSILDGNSNLMQSEAATFLVKNEK+EEATVK EEN  ERRPALNRKRARFSLKP
Sbjct: 61  KLIEQARSILDGNSNLMQSEAATFLVKNEKNEEATVKAEENPQERRPALNRKRARFSLKP 120

Query: 121 DARQPSVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGAVLKDLNQQNPSTNKRQ 180
           DARQP VNLEPTFDIKQLKDPEEFFLAYER ENAKKEIQKQTGAVLKDLNQQNPSTN RQ
Sbjct: 121 DARQPPVNLEPTFDIKQLKDPEEFFLAYERHENAKKEIQKQTGAVLKDLNQQNPSTNTRQ 180

Query: 181 RRPGILGRSVRYKHQYSSITTEDDENVDPSQVTLESGSISPSILGTETHPSPHIIDSEKK 240
           RRPGILGRSVRYKHQYSSITTEDD+NVDPSQVT ESG ISP ++GTETHPSPHIIDS  K
Sbjct: 181 RRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTFESGGISPPVMGTETHPSPHIIDSNNK 240

Query: 241 TDEDVAFEEEEEEEEFVGSVTKAENKVNKILDELLSANCEDLEGDRAINILQECLQIKPI 300
           TDEDVAF   EEEEEFV SVTKAENKVNKILDELLS NC DLEGDRAINILQECLQIKP 
Sbjct: 241 TDEDVAF---EEEEEFVASVTKAENKVNKILDELLSDNCGDLEGDRAINILQECLQIKPF 300

Query: 301 NLEKLCLPDLEAIPTMNLKSSSCNLSKRSFISVDNQLQRIETLKSKQDDETLVNPVSTPS 360
           NLEKLCLPDLEAI TM LKSSS NLSKRS ISV NQLQRIETLKSKQDDE LVNP+S PS
Sbjct: 301 NLEKLCLPDLEAIQTMKLKSSSGNLSKRSLISVVNQLQRIETLKSKQDDENLVNPLSPPS 360

Query: 361 SIRSPLASVSALNRRISLSNSSGDPFSAHDIGQSPARDPYLFELSNYLSDAVGIAEQSSI 420
           SIRSPLAS+SALNRRISLSNSSGDPFSAH I QSPARDPYLF L+N LSDA GIAEQSS+
Sbjct: 361 SIRSPLASLSALNRRISLSNSSGDPFSAHGIDQSPARDPYLFRLNNNLSDAAGIAEQSSV 420

Query: 421 SKLKSLLTKDSGTVANGIKPSKILFGDVDSMSKMSSSNVLNVPQVGVDTALSGTHASMET 480
           SKLKSLLTKD GTVANGIKPSKILF DVDSMSK+SSS VLNVP+VG +T LSGTH SME 
Sbjct: 421 SKLKSLLTKDGGTVANGIKPSKILFEDVDSMSKISSSYVLNVPEVGCETVLSGTHVSMEA 480

Query: 481 KDVSGSRTEVEVNEKLSCLE---DVVANMQMEDHEGSASEQPNSSKVDVIKEYPVGIQSQ 540
           KDVSG   EVEVNEKLSCLE   D VANMQMEDHEGSASEQPNSSKVD+IKEYPVGIQSQ
Sbjct: 481 KDVSGGSIEVEVNEKLSCLEVQVDDVANMQMEDHEGSASEQPNSSKVDLIKEYPVGIQSQ 540

Query: 541 LDQATATCTENIVDGPSRCSGMDHADEMEDHEGLAIEQPNSSKVDVIKEYPVGIQSQLDQ 600
           LDQ+TA C ENI DGPSR SG DH  EMEDH+G A EQPNSS VDVIKEYPVG+Q QLDQ
Sbjct: 541 LDQSTAICIENIADGPSRSSGTDHHYEMEDHKGSASEQPNSSNVDVIKEYPVGMQGQLDQ 600

Query: 601 STATCTENIVNGPSRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVR 660
            TATCTENI +GPSRSSGTDH +EEQ KPKSRANKQ +GKKISGRQSLAGAGTTWQ GVR
Sbjct: 601 PTATCTENIADGPSRSSGTDHLNEEQAKPKSRANKQCRGKKISGRQSLAGAGTTWQGGVR 660

Query: 661 RSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSPAKGNGQPTMKVKSLVSNEYKDL 720
           RSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSPAKGNGQP MKVKSLVSNEYKDL
Sbjct: 661 RSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSPAKGNGQPIMKVKSLVSNEYKDL 720

Query: 721 VELAALH 725
           VELAALH
Sbjct: 721 VELAALH 724

BLAST of HG10023321 vs. NCBI nr
Match: XP_011659552.1 (centromere protein C isoform X3 [Cucumis sativus] >KGN45338.1 hypothetical protein Csa_015680 [Cucumis sativus])

HSP 1 Score: 1161.4 bits (3003), Expect = 0.0e+00
Identity = 621/728 (85.30%), Postives = 653/728 (89.70%), Query Frame = 0

Query: 1   MVTEEARHSDVIDPLAAYSGINLFSNAFRTLRDPSKPHDLGTDLDGIHKHLKSMVSRSPS 60
           M  EEARHSDVIDPLAAYSGINLFS AF TL DPSKPHDLGTDLDGIHK LKSMV RSPS
Sbjct: 4   MANEEARHSDVIDPLAAYSGINLFSTAFGTLPDPSKPHDLGTDLDGIHKRLKSMVLRSPS 63

Query: 61  KLVEQARSILDGNSNLMQSEAATFLVKNEKDEEATVKVEENLHERRPALNRKRARFSLKP 120
           KL+EQARSILDGNSN M SEAATFLVKNEK+EEATVK EENL ERRPALNRKRARFSLKP
Sbjct: 64  KLLEQARSILDGNSNSMISEAATFLVKNEKNEEATVKAEENLQERRPALNRKRARFSLKP 123

Query: 121 DARQPSVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGAVLKDLNQQNPSTNKRQ 180
           DARQP VNLEPTFDIKQLKDPEEFFLAYE+ ENAKKEIQKQTGAVLKDLNQQNPSTN RQ
Sbjct: 124 DARQPPVNLEPTFDIKQLKDPEEFFLAYEKHENAKKEIQKQTGAVLKDLNQQNPSTNTRQ 183

Query: 181 RRPGILGRSVRYKHQYSSITTEDDENVDPSQVTLESGSISPSILGTETHPSPHIIDSEKK 240
           RRPGILGRSVRYKHQYSSI TEDD+NVDPSQVT +SG  SP  LGTETHPSPHIIDSEKK
Sbjct: 184 RRPGILGRSVRYKHQYSSIATEDDQNVDPSQVTFDSGIFSPLKLGTETHPSPHIIDSEKK 243

Query: 241 TDEDVAFEEEEEEEEFVGSVTKAENKVNKILDELLSANCEDLEGDRAINILQECLQIKPI 300
           TDEDVAFEEEEEEEE V S TKAEN++N IL+E LS NCEDLEGDRAINILQE LQIKP+
Sbjct: 244 TDEDVAFEEEEEEEELVASATKAENRINDILNEFLSGNCEDLEGDRAINILQERLQIKPL 303

Query: 301 NLEKLCLPDLEAIPTMNLKSSSCNLSKRSFISVDNQLQRIETLKSKQDDETLVNPVSTPS 360
            LEKLCLPDLEAIPTMNLKSS  NLSKRS ISVDNQLQ+IE LKSKQD+  LVNPVSTPS
Sbjct: 304 TLEKLCLPDLEAIPTMNLKSSRSNLSKRSLISVDNQLQKIEILKSKQDNVNLVNPVSTPS 363

Query: 361 SIRSPLASVSALNRRISLSNSSGDPFSAHDIGQSPARDPYLFELSNYLSDAVGIAEQSSI 420
           S+RSPLAS+SALNRRISLSNSS D FSAH I QSP+RDPYLFEL N+LSDAVG  EQSS+
Sbjct: 364 SMRSPLASLSALNRRISLSNSSSDSFSAHGIDQSPSRDPYLFELGNHLSDAVGNTEQSSV 423

Query: 421 SKLKSLLTKDSGTVANGIKPSKILFGDVDSMSKMSSSNVLNVPQVGVDTALSGTHASMET 480
           SKLK LLT+D GTVANGIKPSKIL GD DSMS +SSSN+LNVPQVG +TALSGT+AS E 
Sbjct: 424 SKLKPLLTRDGGTVANGIKPSKILSGD-DSMSNISSSNILNVPQVGGNTALSGTYASTEA 483

Query: 481 KDVSGSRTEVEVNEKLSCLE---DVVANMQMEDHEGSASEQPNSSKVDVIKEYPVGIQSQ 540
           K+VS S T+VE+NEKLSCLE   D VANMQ+EDHEGSASEQP  S+VD+IKEYPVGI+SQ
Sbjct: 484 KNVSVSSTDVEINEKLSCLEAQADAVANMQIEDHEGSASEQPKLSEVDLIKEYPVGIRSQ 543

Query: 541 LDQATATCTENIVDGPSRCSGMDHADEMEDHEGLAIEQPNSSKVDVIKEYPVGIQSQLDQ 600
           LDQ+ ATCTENIVDG SR SG +H DEMEDHEG A EQP SSKVDVIKEYPV IQSQLDQ
Sbjct: 544 LDQSAATCTENIVDGSSRSSGTEHRDEMEDHEGSASEQPKSSKVDVIKEYPVAIQSQLDQ 603

Query: 601 S-TATCTENIVNGPSRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGV 660
           S T TC ENI +G SRSSGTDHHD EQVKPKSRANKQ KGKKIS RQSLAGAGTTWQSGV
Sbjct: 604 STTTTCAENIADGASRSSGTDHHDGEQVKPKSRANKQHKGKKISRRQSLAGAGTTWQSGV 663

Query: 661 RRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSPAKGNGQPTMKVKSLVSNEYKD 720
           RRSTRFKTRPLEYWKGERLLYGRVHESL TVIGLKYVSPAKGNG+PTMKVKSLVSNEYKD
Sbjct: 664 RRSTRFKTRPLEYWKGERLLYGRVHESLTTVIGLKYVSPAKGNGKPTMKVKSLVSNEYKD 723

Query: 721 LVELAALH 725
           LVELAALH
Sbjct: 724 LVELAALH 730

BLAST of HG10023321 vs. NCBI nr
Match: XP_031745137.1 (centromere protein C isoform X4 [Cucumis sativus])

HSP 1 Score: 1155.2 bits (2987), Expect = 0.0e+00
Identity = 620/728 (85.16%), Postives = 652/728 (89.56%), Query Frame = 0

Query: 1   MVTEEARHSDVIDPLAAYSGINLFSNAFRTLRDPSKPHDLGTDLDGIHKHLKSMVSRSPS 60
           M  EEARHSDVIDPLAAYSGINLFS AF TL DPSKPHDLGTDLDGIHK LKSMV RSPS
Sbjct: 4   MANEEARHSDVIDPLAAYSGINLFSTAFGTLPDPSKPHDLGTDLDGIHKRLKSMVLRSPS 63

Query: 61  KLVEQARSILDGNSNLMQSEAATFLVKNEKDEEATVKVEENLHERRPALNRKRARFSLKP 120
           KL+EQARSILDGNSN M SEAATFLVKNEK+EEATVK EENL ERRPALNRKRARFSLKP
Sbjct: 64  KLLEQARSILDGNSNSMISEAATFLVKNEKNEEATVKAEENLQERRPALNRKRARFSLKP 123

Query: 121 DARQPSVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGAVLKDLNQQNPSTNKRQ 180
           DARQP VNLEPTFDIKQLKDPEEFFLAYE+ ENAKKEIQKQTGAVLKDLNQQNPSTN RQ
Sbjct: 124 DARQPPVNLEPTFDIKQLKDPEEFFLAYEKHENAKKEIQKQTGAVLKDLNQQNPSTNTRQ 183

Query: 181 RRPGILGRSVRYKHQYSSITTEDDENVDPSQVTLESGSISPSILGTETHPSPHIIDSEKK 240
           RRPGILG SVRYKHQYSSI TEDD+NVDPSQVT +SG  SP  LGTETHPSPHIIDSEKK
Sbjct: 184 RRPGILG-SVRYKHQYSSIATEDDQNVDPSQVTFDSGIFSPLKLGTETHPSPHIIDSEKK 243

Query: 241 TDEDVAFEEEEEEEEFVGSVTKAENKVNKILDELLSANCEDLEGDRAINILQECLQIKPI 300
           TDEDVAFEEEEEEEE V S TKAEN++N IL+E LS NCEDLEGDRAINILQE LQIKP+
Sbjct: 244 TDEDVAFEEEEEEEELVASATKAENRINDILNEFLSGNCEDLEGDRAINILQERLQIKPL 303

Query: 301 NLEKLCLPDLEAIPTMNLKSSSCNLSKRSFISVDNQLQRIETLKSKQDDETLVNPVSTPS 360
            LEKLCLPDLEAIPTMNLKSS  NLSKRS ISVDNQLQ+IE LKSKQD+  LVNPVSTPS
Sbjct: 304 TLEKLCLPDLEAIPTMNLKSSRSNLSKRSLISVDNQLQKIEILKSKQDNVNLVNPVSTPS 363

Query: 361 SIRSPLASVSALNRRISLSNSSGDPFSAHDIGQSPARDPYLFELSNYLSDAVGIAEQSSI 420
           S+RSPLAS+SALNRRISLSNSS D FSAH I QSP+RDPYLFEL N+LSDAVG  EQSS+
Sbjct: 364 SMRSPLASLSALNRRISLSNSSSDSFSAHGIDQSPSRDPYLFELGNHLSDAVGNTEQSSV 423

Query: 421 SKLKSLLTKDSGTVANGIKPSKILFGDVDSMSKMSSSNVLNVPQVGVDTALSGTHASMET 480
           SKLK LLT+D GTVANGIKPSKIL GD DSMS +SSSN+LNVPQVG +TALSGT+AS E 
Sbjct: 424 SKLKPLLTRDGGTVANGIKPSKILSGD-DSMSNISSSNILNVPQVGGNTALSGTYASTEA 483

Query: 481 KDVSGSRTEVEVNEKLSCLE---DVVANMQMEDHEGSASEQPNSSKVDVIKEYPVGIQSQ 540
           K+VS S T+VE+NEKLSCLE   D VANMQ+EDHEGSASEQP  S+VD+IKEYPVGI+SQ
Sbjct: 484 KNVSVSSTDVEINEKLSCLEAQADAVANMQIEDHEGSASEQPKLSEVDLIKEYPVGIRSQ 543

Query: 541 LDQATATCTENIVDGPSRCSGMDHADEMEDHEGLAIEQPNSSKVDVIKEYPVGIQSQLDQ 600
           LDQ+ ATCTENIVDG SR SG +H DEMEDHEG A EQP SSKVDVIKEYPV IQSQLDQ
Sbjct: 544 LDQSAATCTENIVDGSSRSSGTEHRDEMEDHEGSASEQPKSSKVDVIKEYPVAIQSQLDQ 603

Query: 601 S-TATCTENIVNGPSRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGV 660
           S T TC ENI +G SRSSGTDHHD EQVKPKSRANKQ KGKKIS RQSLAGAGTTWQSGV
Sbjct: 604 STTTTCAENIADGASRSSGTDHHDGEQVKPKSRANKQHKGKKISRRQSLAGAGTTWQSGV 663

Query: 661 RRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSPAKGNGQPTMKVKSLVSNEYKD 720
           RRSTRFKTRPLEYWKGERLLYGRVHESL TVIGLKYVSPAKGNG+PTMKVKSLVSNEYKD
Sbjct: 664 RRSTRFKTRPLEYWKGERLLYGRVHESLTTVIGLKYVSPAKGNGKPTMKVKSLVSNEYKD 723

Query: 721 LVELAALH 725
           LVELAALH
Sbjct: 724 LVELAALH 729

BLAST of HG10023321 vs. NCBI nr
Match: XP_031745135.1 (centromere protein C isoform X1 [Cucumis sativus])

HSP 1 Score: 1154.0 bits (2984), Expect = 0.0e+00
Identity = 621/736 (84.38%), Postives = 653/736 (88.72%), Query Frame = 0

Query: 1   MVTEEARHSDVIDPLAAYSGINLFSNAFRTLRDPSKPHDLGTDLDGIHKHLKSMVSRSPS 60
           M  EEARHSDVIDPLAAYSGINLFS AF TL DPSKPHDLGTDLDGIHK LKSMV RSPS
Sbjct: 4   MANEEARHSDVIDPLAAYSGINLFSTAFGTLPDPSKPHDLGTDLDGIHKRLKSMVLRSPS 63

Query: 61  KLVEQARSILDGNSNLMQSEAATFLVKNEKDEEATVKVEENLHERRPALNRKRARFSLKP 120
           KL+EQARSILDGNSN M SEAATFLVKNEK+EEATVK EENL ERRPALNRKRARFSLKP
Sbjct: 64  KLLEQARSILDGNSNSMISEAATFLVKNEKNEEATVKAEENLQERRPALNRKRARFSLKP 123

Query: 121 DARQPSVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGAVLKDLNQQNPSTNKRQ 180
           DARQP VNLEPTFDIKQLKDPEEFFLAYE+ ENAKKEIQKQTGAVLKDLNQQNPSTN RQ
Sbjct: 124 DARQPPVNLEPTFDIKQLKDPEEFFLAYEKHENAKKEIQKQTGAVLKDLNQQNPSTNTRQ 183

Query: 181 RRPGILG--------RSVRYKHQYSSITTEDDENVDPSQVTLESGSISPSILGTETHPSP 240
           RRPGILG        RSVRYKHQYSSI TEDD+NVDPSQVT +SG  SP  LGTETHPSP
Sbjct: 184 RRPGILGPKSSRACRRSVRYKHQYSSIATEDDQNVDPSQVTFDSGIFSPLKLGTETHPSP 243

Query: 241 HIIDSEKKTDEDVAFEEEEEEEEFVGSVTKAENKVNKILDELLSANCEDLEGDRAINILQ 300
           HIIDSEKKTDEDVAFEEEEEEEE V S TKAEN++N IL+E LS NCEDLEGDRAINILQ
Sbjct: 244 HIIDSEKKTDEDVAFEEEEEEEELVASATKAENRINDILNEFLSGNCEDLEGDRAINILQ 303

Query: 301 ECLQIKPINLEKLCLPDLEAIPTMNLKSSSCNLSKRSFISVDNQLQRIETLKSKQDDETL 360
           E LQIKP+ LEKLCLPDLEAIPTMNLKSS  NLSKRS ISVDNQLQ+IE LKSKQD+  L
Sbjct: 304 ERLQIKPLTLEKLCLPDLEAIPTMNLKSSRSNLSKRSLISVDNQLQKIEILKSKQDNVNL 363

Query: 361 VNPVSTPSSIRSPLASVSALNRRISLSNSSGDPFSAHDIGQSPARDPYLFELSNYLSDAV 420
           VNPVSTPSS+RSPLAS+SALNRRISLSNSS D FSAH I QSP+RDPYLFEL N+LSDAV
Sbjct: 364 VNPVSTPSSMRSPLASLSALNRRISLSNSSSDSFSAHGIDQSPSRDPYLFELGNHLSDAV 423

Query: 421 GIAEQSSISKLKSLLTKDSGTVANGIKPSKILFGDVDSMSKMSSSNVLNVPQVGVDTALS 480
           G  EQSS+SKLK LLT+D GTVANGIKPSKIL GD DSMS +SSSN+LNVPQVG +TALS
Sbjct: 424 GNTEQSSVSKLKPLLTRDGGTVANGIKPSKILSGD-DSMSNISSSNILNVPQVGGNTALS 483

Query: 481 GTHASMETKDVSGSRTEVEVNEKLSCLE---DVVANMQMEDHEGSASEQPNSSKVDVIKE 540
           GT+AS E K+VS S T+VE+NEKLSCLE   D VANMQ+EDHEGSASEQP  S+VD+IKE
Sbjct: 484 GTYASTEAKNVSVSSTDVEINEKLSCLEAQADAVANMQIEDHEGSASEQPKLSEVDLIKE 543

Query: 541 YPVGIQSQLDQATATCTENIVDGPSRCSGMDHADEMEDHEGLAIEQPNSSKVDVIKEYPV 600
           YPVGI+SQLDQ+ ATCTENIVDG SR SG +H DEMEDHEG A EQP SSKVDVIKEYPV
Sbjct: 544 YPVGIRSQLDQSAATCTENIVDGSSRSSGTEHRDEMEDHEGSASEQPKSSKVDVIKEYPV 603

Query: 601 GIQSQLDQS-TATCTENIVNGPSRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGA 660
            IQSQLDQS T TC ENI +G SRSSGTDHHD EQVKPKSRANKQ KGKKIS RQSLAGA
Sbjct: 604 AIQSQLDQSTTTTCAENIADGASRSSGTDHHDGEQVKPKSRANKQHKGKKISRRQSLAGA 663

Query: 661 GTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSPAKGNGQPTMKVKS 720
           GTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESL TVIGLKYVSPAKGNG+PTMKVKS
Sbjct: 664 GTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLTTVIGLKYVSPAKGNGKPTMKVKS 723

Query: 721 LVSNEYKDLVELAALH 725
           LVSNEYKDLVELAALH
Sbjct: 724 LVSNEYKDLVELAALH 738

BLAST of HG10023321 vs. NCBI nr
Match: XP_031745136.1 (centromere protein C isoform X2 [Cucumis sativus])

HSP 1 Score: 1146.0 bits (2963), Expect = 0.0e+00
Identity = 619/736 (84.10%), Postives = 651/736 (88.45%), Query Frame = 0

Query: 1   MVTEEARHSDVIDPLAAYSGINLFSNAFRTLRDPSKPHDLGTDLDGIHKHLKSMVSRSPS 60
           M  EEARHSDVIDPLAAYSGINLFS AF TL DPSKPHDLGTDLDGIHK LKSMV RSPS
Sbjct: 4   MANEEARHSDVIDPLAAYSGINLFSTAFGTLPDPSKPHDLGTDLDGIHKRLKSMVLRSPS 63

Query: 61  KLVEQARSILDGNSNLMQSEAATFLVKNEKDEEATVKVEENLHERRPALNRKRARFSLKP 120
           KL+EQARSILDGNSN M SEAATFLVKNEK+EEATVK EENL ERRPALNRKRARFSLKP
Sbjct: 64  KLLEQARSILDGNSNSMISEAATFLVKNEKNEEATVKAEENLQERRPALNRKRARFSLKP 123

Query: 121 DARQPSVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGAVLKDLNQQNPSTNKRQ 180
           DARQP VNLEPTFDIKQLKDPEEFFLAYE+ ENAKKEIQKQTGAVLKDLNQQNPSTN RQ
Sbjct: 124 DARQPPVNLEPTFDIKQLKDPEEFFLAYEKHENAKKEIQKQTGAVLKDLNQQNPSTNTRQ 183

Query: 181 RRPGILG--------RSVRYKHQYSSITTEDDENVDPSQVTLESGSISPSILGTETHPSP 240
           RRPGILG        RSVRYKHQYSSI TEDD+NVDPSQVT +SG  SP  LGTETHPSP
Sbjct: 184 RRPGILGPKSSRACRRSVRYKHQYSSIATEDDQNVDPSQVTFDSGIFSPLKLGTETHPSP 243

Query: 241 HIIDSEKKTDEDVAFEEEEEEEEFVGSVTKAENKVNKILDELLSANCEDLEGDRAINILQ 300
           HIIDSEKKTDEDVAFEEEEEEEE V S TKAEN++N IL+E LS NCEDLEGDRAINILQ
Sbjct: 244 HIIDSEKKTDEDVAFEEEEEEEELVASATKAENRINDILNEFLSGNCEDLEGDRAINILQ 303

Query: 301 ECLQIKPINLEKLCLPDLEAIPTMNLKSSSCNLSKRSFISVDNQLQRIETLKSKQDDETL 360
           E LQIKP+ LEKLCLPDLEAIPTMNLKSS  NLSKRS ISVDNQLQ+IE LKSKQD+  L
Sbjct: 304 ERLQIKPLTLEKLCLPDLEAIPTMNLKSSRSNLSKRSLISVDNQLQKIEILKSKQDNVNL 363

Query: 361 VNPVSTPSSIRSPLASVSALNRRISLSNSSGDPFSAHDIGQSPARDPYLFELSNYLSDAV 420
           VNPVSTPSS+RSPLAS+SALNRRISLSNSS D FSAH I QSP+RDPYLFEL N+LSDAV
Sbjct: 364 VNPVSTPSSMRSPLASLSALNRRISLSNSSSDSFSAHGIDQSPSRDPYLFELGNHLSDAV 423

Query: 421 GIAEQSSISKLKSLLTKDSGTVANGIKPSKILFGDVDSMSKMSSSNVLNVPQVGVDTALS 480
           G  EQSS+SKLK LLT+D GTVANGIKPSKIL GD DSMS +SSSN+LNVPQVG +TALS
Sbjct: 424 GNTEQSSVSKLKPLLTRDGGTVANGIKPSKILSGD-DSMSNISSSNILNVPQVGGNTALS 483

Query: 481 GTHASMETKDVSGSRTEVEVNEKLSCLE---DVVANMQMEDHEGSASEQPNSSKVDVIKE 540
           GT+AS E K+VS S T+VE+NEKLSCLE   D VANMQ+EDHEGSASEQP  S+VD+IKE
Sbjct: 484 GTYASTEAKNVSVSSTDVEINEKLSCLEAQADAVANMQIEDHEGSASEQPKLSEVDLIKE 543

Query: 541 YPVGIQSQLDQATATCTENIVDGPSRCSGMDHADEMEDHEGLAIEQPNSSKVDVIKEYPV 600
           YPVGI+SQLDQ+ ATCTENIVDG SR SG +H DEMEDHEG A EQP SSKVDVIKEYPV
Sbjct: 544 YPVGIRSQLDQSAATCTENIVDGSSRSSGTEHRDEMEDHEGSASEQPKSSKVDVIKEYPV 603

Query: 601 GIQSQLDQS-TATCTENIVNGPSRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGA 660
            IQSQLDQS T TC ENI +G SRSSGTDHHD   VKPKSRANKQ KGKKIS RQSLAGA
Sbjct: 604 AIQSQLDQSTTTTCAENIADGASRSSGTDHHD--GVKPKSRANKQHKGKKISRRQSLAGA 663

Query: 661 GTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSPAKGNGQPTMKVKS 720
           GTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESL TVIGLKYVSPAKGNG+PTMKVKS
Sbjct: 664 GTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLTTVIGLKYVSPAKGNGKPTMKVKS 723

Query: 721 LVSNEYKDLVELAALH 725
           LVSNEYKDLVELAALH
Sbjct: 724 LVSNEYKDLVELAALH 736

BLAST of HG10023321 vs. ExPASy Swiss-Prot
Match: Q66LG9 (Centromere protein C OS=Arabidopsis thaliana OX=3702 GN=CENPC PE=2 SV=1)

HSP 1 Score: 229.6 bits (584), Expect = 1.2e-58
Identity = 230/755 (30.46%), Postives = 350/755 (46.36%), Query Frame = 0

Query: 13  DPLAAYSGINLFSNAFRTLRDPSKPHDLGTDLDGIHKHLKSMVSRSPSKLVEQARSILDG 72
           DPL AYSG++LF    ++L +P  P     DL   H  L+SM     S+  EQA++IL+ 
Sbjct: 15  DPLQAYSGLSLFPRTLKSLSNPLPPSYQSEDLQQTHTLLQSMPFEIQSEHQEQAKAILE- 74

Query: 73  NSNLMQSEAATFLVKNEKDEEATVKVEENLHERRPALNRKRARFSLKPDARQPSVNLEPT 132
                           + D +  +    N  ERRP L+RKR  FSL     QP   + P+
Sbjct: 75  ----------------DVDVDVQLNPIPNKRERRPGLDRKRKSFSLHLTTSQPP-PVAPS 134

Query: 133 FDIKQLKDPEEFFLAYERLENAKKEIQKQTGAVLKDLNQQNPSTNKRQRRPGILGRSVR- 192
           FD  +    E+FF AY++ E A +E QKQTG+ + D+ +  PS  +R RRPGI GR  R 
Sbjct: 135 FDPSKYPRSEDFFAAYDKFELANREWQKQTGSSVIDIQENPPS--RRPRRPGIPGRKRRP 194

Query: 193 YKHQYSSITTEDDENVDPSQVTLESGSISPSILGTETHPSPHIIDSEKKTDEDVAFEEEE 252
           +K  ++     D  N++ S+  +   S        E+  + H+   +++ D+        
Sbjct: 195 FKESFTDSYFTDVINLEASEKEIPIASEQ----SLESATAAHVTTVDREVDD-------- 254

Query: 253 EEEEFVGSVTKAENKVNKILDELLSANCEDLEGDRAINILQECLQIKPINLEKLCLPDLE 312
                  S    +  +N +L +LL+ + E+LEGD AI +L+E LQIK  N+EK  +P+ +
Sbjct: 255 -------STVDTDKDLNNVLKDLLACSREELEGDGAIKLLEERLQIKSFNIEKFSIPEFQ 314

Query: 313 AIPTMNLKSSSCNLSKRSFISVDNQLQRIETLKSKQDDETLVNPVSTPSSIRSPLASVSA 372
            +  MNLK+S  N   R  +S    +Q I  LK         N V+   +  SP      
Sbjct: 315 DVRKMNLKASGSNPPNRKSLS---DIQNI--LKG-------TNRVAVRKNSHSPSPQTI- 374

Query: 373 LNRRISLSNSSGDPFSAHDI------GQSPAR---DPYLFELSNYLSDAVGIAEQSS--I 432
             +  S  N   D FS  DI       Q P+     P   ++ N     VG  + +S   
Sbjct: 375 --KHFSSPNPPVDQFSFPDIHNLLPGDQQPSEVNVQPIAKDIPNTSPTNVGTVDVASPFN 434

Query: 433 SKLKSLLTKDSGTVANGIKPSKILFGD------VDSMSKMSSS----NV-LNVPQVGVDT 492
             +     +D   + +GI  S +          +DS+S  SS+    NV +      VD 
Sbjct: 435 DSVVKRSGEDDSHIHSGIHRSHLSRDGNPDICVMDSISNRSSAMLQKNVDMRTKGKEVDV 494

Query: 493 ALSGTHASMETKDVSGSRTEVEVNEKLSCLE--------DVVANMQMED-----HEGSAS 552
            +S + A+  T D      + E+NE+   LE        +V     +E+      +G++S
Sbjct: 495 PMSESGANRNTGD---RENDAEINEETDNLERLAECASKEVTRPFTVEEDSIPYQQGASS 554

Query: 553 EQPNSSKVDVIKEYPVGIQSQLDQATATC----TENIVDGPSRCSGMDHADEME--DHEG 612
           + PN +     ++Y   +   L+ A         EN+  G +    +++A E+    H+ 
Sbjct: 555 KSPNRAP----EQYNT-MGGSLEHAEHNQGLHEEENVNTGSASGLQVENAPEVHKYSHKQ 614

Query: 613 LAIEQPNSSKVDVIKEYPVGIQSQLDQSTATCTENIVNGPSRSSGTDHHDEEQVKPKSRA 672
               +   S    +K+    +  +        T    +   + +    ++ E+ KPK   
Sbjct: 615 TNKRRKRGSSDSNVKKRSKTVHGETGGDKQMKTLPHESRAKKQTKGKSNEREEKKPKKTL 674

Query: 673 NKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGL 725
               +GK  S R+SLA AGT  + GVRRSTR K+RPLEYW+GER LYGR+HESL TVIG+
Sbjct: 675 T--HEGKLFSCRKSLAAAGTKIEGGVRRSTRIKSRPLEYWRGERFLYGRIHESLTTVIGI 705

BLAST of HG10023321 vs. ExPASy TrEMBL
Match: A0A0A0K774 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G440590 PE=3 SV=1)

HSP 1 Score: 1161.4 bits (3003), Expect = 0.0e+00
Identity = 621/728 (85.30%), Postives = 653/728 (89.70%), Query Frame = 0

Query: 1   MVTEEARHSDVIDPLAAYSGINLFSNAFRTLRDPSKPHDLGTDLDGIHKHLKSMVSRSPS 60
           M  EEARHSDVIDPLAAYSGINLFS AF TL DPSKPHDLGTDLDGIHK LKSMV RSPS
Sbjct: 4   MANEEARHSDVIDPLAAYSGINLFSTAFGTLPDPSKPHDLGTDLDGIHKRLKSMVLRSPS 63

Query: 61  KLVEQARSILDGNSNLMQSEAATFLVKNEKDEEATVKVEENLHERRPALNRKRARFSLKP 120
           KL+EQARSILDGNSN M SEAATFLVKNEK+EEATVK EENL ERRPALNRKRARFSLKP
Sbjct: 64  KLLEQARSILDGNSNSMISEAATFLVKNEKNEEATVKAEENLQERRPALNRKRARFSLKP 123

Query: 121 DARQPSVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGAVLKDLNQQNPSTNKRQ 180
           DARQP VNLEPTFDIKQLKDPEEFFLAYE+ ENAKKEIQKQTGAVLKDLNQQNPSTN RQ
Sbjct: 124 DARQPPVNLEPTFDIKQLKDPEEFFLAYEKHENAKKEIQKQTGAVLKDLNQQNPSTNTRQ 183

Query: 181 RRPGILGRSVRYKHQYSSITTEDDENVDPSQVTLESGSISPSILGTETHPSPHIIDSEKK 240
           RRPGILGRSVRYKHQYSSI TEDD+NVDPSQVT +SG  SP  LGTETHPSPHIIDSEKK
Sbjct: 184 RRPGILGRSVRYKHQYSSIATEDDQNVDPSQVTFDSGIFSPLKLGTETHPSPHIIDSEKK 243

Query: 241 TDEDVAFEEEEEEEEFVGSVTKAENKVNKILDELLSANCEDLEGDRAINILQECLQIKPI 300
           TDEDVAFEEEEEEEE V S TKAEN++N IL+E LS NCEDLEGDRAINILQE LQIKP+
Sbjct: 244 TDEDVAFEEEEEEEELVASATKAENRINDILNEFLSGNCEDLEGDRAINILQERLQIKPL 303

Query: 301 NLEKLCLPDLEAIPTMNLKSSSCNLSKRSFISVDNQLQRIETLKSKQDDETLVNPVSTPS 360
            LEKLCLPDLEAIPTMNLKSS  NLSKRS ISVDNQLQ+IE LKSKQD+  LVNPVSTPS
Sbjct: 304 TLEKLCLPDLEAIPTMNLKSSRSNLSKRSLISVDNQLQKIEILKSKQDNVNLVNPVSTPS 363

Query: 361 SIRSPLASVSALNRRISLSNSSGDPFSAHDIGQSPARDPYLFELSNYLSDAVGIAEQSSI 420
           S+RSPLAS+SALNRRISLSNSS D FSAH I QSP+RDPYLFEL N+LSDAVG  EQSS+
Sbjct: 364 SMRSPLASLSALNRRISLSNSSSDSFSAHGIDQSPSRDPYLFELGNHLSDAVGNTEQSSV 423

Query: 421 SKLKSLLTKDSGTVANGIKPSKILFGDVDSMSKMSSSNVLNVPQVGVDTALSGTHASMET 480
           SKLK LLT+D GTVANGIKPSKIL GD DSMS +SSSN+LNVPQVG +TALSGT+AS E 
Sbjct: 424 SKLKPLLTRDGGTVANGIKPSKILSGD-DSMSNISSSNILNVPQVGGNTALSGTYASTEA 483

Query: 481 KDVSGSRTEVEVNEKLSCLE---DVVANMQMEDHEGSASEQPNSSKVDVIKEYPVGIQSQ 540
           K+VS S T+VE+NEKLSCLE   D VANMQ+EDHEGSASEQP  S+VD+IKEYPVGI+SQ
Sbjct: 484 KNVSVSSTDVEINEKLSCLEAQADAVANMQIEDHEGSASEQPKLSEVDLIKEYPVGIRSQ 543

Query: 541 LDQATATCTENIVDGPSRCSGMDHADEMEDHEGLAIEQPNSSKVDVIKEYPVGIQSQLDQ 600
           LDQ+ ATCTENIVDG SR SG +H DEMEDHEG A EQP SSKVDVIKEYPV IQSQLDQ
Sbjct: 544 LDQSAATCTENIVDGSSRSSGTEHRDEMEDHEGSASEQPKSSKVDVIKEYPVAIQSQLDQ 603

Query: 601 S-TATCTENIVNGPSRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGV 660
           S T TC ENI +G SRSSGTDHHD EQVKPKSRANKQ KGKKIS RQSLAGAGTTWQSGV
Sbjct: 604 STTTTCAENIADGASRSSGTDHHDGEQVKPKSRANKQHKGKKISRRQSLAGAGTTWQSGV 663

Query: 661 RRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSPAKGNGQPTMKVKSLVSNEYKD 720
           RRSTRFKTRPLEYWKGERLLYGRVHESL TVIGLKYVSPAKGNG+PTMKVKSLVSNEYKD
Sbjct: 664 RRSTRFKTRPLEYWKGERLLYGRVHESLTTVIGLKYVSPAKGNGKPTMKVKSLVSNEYKD 723

Query: 721 LVELAALH 725
           LVELAALH
Sbjct: 724 LVELAALH 730

BLAST of HG10023321 vs. ExPASy TrEMBL
Match: A0A1S3CDU7 (uncharacterized protein LOC103499749 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103499749 PE=3 SV=1)

HSP 1 Score: 1137.9 bits (2942), Expect = 0.0e+00
Identity = 610/729 (83.68%), Postives = 652/729 (89.44%), Query Frame = 0

Query: 1   MVTEEARHSDVIDPLAAYSGINLFSNAFRTLRDPSKPHDLGTDLDGIHKHLKSMVSRSPS 60
           MV EE R SDVIDPLAAYSGINLF  AF TL DPSKPHDLGTDLDGIHK LKSMV RSPS
Sbjct: 3   MVNEETRPSDVIDPLAAYSGINLFPTAFGTLTDPSKPHDLGTDLDGIHKRLKSMVLRSPS 62

Query: 61  KLVEQARSILDGNSNLMQSEAATFLVKNEKDEEATVKVEENLHERRPALNRKRARFSLKP 120
           KL+EQARSILDGNS  M SEAATFLVKNEK+E A+VK EEN  ERRPALNRKRARFSLKP
Sbjct: 63  KLLEQARSILDGNSKSMISEAATFLVKNEKNEAASVKAEENPQERRPALNRKRARFSLKP 122

Query: 121 DARQPSVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGAVLKDLNQQNPSTNKRQ 180
           DA QP VNLEPTFDIKQLKDPEEFFLAYE+ ENAKKEIQKQ GAVLKDLNQQNPSTN RQ
Sbjct: 123 DAGQPPVNLEPTFDIKQLKDPEEFFLAYEKHENAKKEIQKQMGAVLKDLNQQNPSTNTRQ 182

Query: 181 RRPGILGRSVRYKHQYSSITTEDDENVDPSQVTLESGSISPSILGTETHPSPHIIDSEKK 240
           RRPGILGRSVRYKHQYSSITTEDD+NVDPSQVT +SG  SP  LGTETHPSPHIIDSEKK
Sbjct: 183 RRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTFDSGVFSPLKLGTETHPSPHIIDSEKK 242

Query: 241 TDEDVAFEEEEEEEEFVGSVTKAENKVNKILDELLSANCEDLEGDRAINILQECLQIKPI 300
           TDEDVAFEEEEEEEE V S TKAEN+VN ILDE LS NCEDLEGDRAINILQE LQIKP+
Sbjct: 243 TDEDVAFEEEEEEEELVASATKAENRVNDILDEFLSGNCEDLEGDRAINILQERLQIKPL 302

Query: 301 NLEKLCLPDLEAIPTMNLKSSSCNLSKRSFISVDNQLQRIETLKSKQDDETLVNPVSTPS 360
            LEKLCLPDLEAIPTMNLKS+  NLSKRS ISVDNQLQ+ ETLKSK+D+E LVN VSTPS
Sbjct: 303 TLEKLCLPDLEAIPTMNLKSTRGNLSKRSLISVDNQLQKTETLKSKEDNENLVNLVSTPS 362

Query: 361 SIRSPLASVSALNRRISLSNSSGDPFSAHDIGQSPARDPYLFELSNYLSDAVGIAEQSSI 420
           S+RSPLAS+SALNRRISLSNSSGD FSAH I +SPARDPYLFEL N+LSDAVGI E SS+
Sbjct: 363 SMRSPLASLSALNRRISLSNSSGDSFSAHGIDRSPARDPYLFELGNHLSDAVGITEHSSV 422

Query: 421 SKLKSLLTKDSGTVANGIKPSKILFGDVDSMSKMSSSNVLNVPQVGVDTALSGTHASMET 480
           SKLK LLT+D GT+ANGI+PSKIL GD DSMSK+SSSN+LNV QVG +TALSGT+AS + 
Sbjct: 423 SKLKPLLTRDGGTIANGIQPSKILSGD-DSMSKISSSNILNVLQVGSNTALSGTYASTDA 482

Query: 481 KDVSGSRTEVEVNEKLSCLE---DVVANMQMEDHEGSASEQPNSSKVDVIKEYPVGIQSQ 540
           K+VSGS T+VE+NEKLSCLE   DVVANMQ+ DH+GSASEQP  S+VD+I+EYPVGI+SQ
Sbjct: 483 KNVSGSSTDVEINEKLSCLEAQADVVANMQI-DHQGSASEQPKLSEVDLIEEYPVGIRSQ 542

Query: 541 LDQATATCTENIVDGPSRCSGMDHADEMEDHEGLAIEQPNSSKVDVIKEYPVGIQSQLDQ 600
           LDQ+ ATCTENIVDG SR SG +H DEMEDHEG A EQPNSSKVD+IKEYPVGIQ QLDQ
Sbjct: 543 LDQSAATCTENIVDGSSRSSGTEHHDEMEDHEGSASEQPNSSKVDMIKEYPVGIQIQLDQ 602

Query: 601 S--TATCTENIVNGPSRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSG 660
           S  T TC E IV+G SRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTW+SG
Sbjct: 603 STTTTTCAEKIVDGTSRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWKSG 662

Query: 661 VRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSPAKGNGQPTMKVKSLVSNEYK 720
           VRRSTRFK RPLEYWKGER+LYGRVHESLATVIGLKYVSP KGNG+PTMKVKSLVSNEYK
Sbjct: 663 VRRSTRFKIRPLEYWKGERMLYGRVHESLATVIGLKYVSPEKGNGKPTMKVKSLVSNEYK 722

Query: 721 DLVELAALH 725
           DLV+LAALH
Sbjct: 723 DLVDLAALH 729

BLAST of HG10023321 vs. ExPASy TrEMBL
Match: A0A5A7UUE4 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold339G002780 PE=3 SV=1)

HSP 1 Score: 1134.8 bits (2934), Expect = 0.0e+00
Identity = 609/728 (83.65%), Postives = 651/728 (89.42%), Query Frame = 0

Query: 1   MVTEEARHSDVIDPLAAYSGINLFSNAFRTLRDPSKPHDLGTDLDGIHKHLKSMVSRSPS 60
           MV EE R SDVIDPLAAYSGINLF  AF TL D SKPHDLGTDLDGIHK LKSMV RSPS
Sbjct: 3   MVNEETRPSDVIDPLAAYSGINLFPTAFGTLTDSSKPHDLGTDLDGIHKRLKSMVLRSPS 62

Query: 61  KLVEQARSILDGNSNLMQSEAATFLVKNEKDEEATVKVEENLHERRPALNRKRARFSLKP 120
           KL+EQARSILDGNS  M SEAATFLVKNEK+E A+VK EEN  ERRPALNRKRARFSLKP
Sbjct: 63  KLLEQARSILDGNSKSMISEAATFLVKNEKNEAASVKAEENPQERRPALNRKRARFSLKP 122

Query: 121 DARQPSVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGAVLKDLNQQNPSTNKRQ 180
           DA QP VNLEPTFDIKQLKDPEEFFLAYE+ ENAKKEIQKQ GAVLKDLNQQNPSTN RQ
Sbjct: 123 DAGQPPVNLEPTFDIKQLKDPEEFFLAYEKHENAKKEIQKQMGAVLKDLNQQNPSTNTRQ 182

Query: 181 RRPGILGRSVRYKHQYSSITTEDDENVDPSQVTLESGSISPSILGTETHPSPHIIDSEKK 240
           RRPGILGRSVRYKHQYSSITTEDD+NVDPSQVT +SG  SP  LGTETHPSPHIIDSEKK
Sbjct: 183 RRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTFDSGVFSPLKLGTETHPSPHIIDSEKK 242

Query: 241 TDEDVAFEEEEEEEEFVGSVTKAENKVNKILDELLSANCEDLEGDRAINILQECLQIKPI 300
           TDEDVAFEEEEEEEE V S TKAEN+VN ILDE LS NCEDLEGDRAINILQE LQIKP+
Sbjct: 243 TDEDVAFEEEEEEEELVASATKAENRVNDILDEFLSGNCEDLEGDRAINILQERLQIKPL 302

Query: 301 NLEKLCLPDLEAIPTMNLKSSSCNLSKRSFISVDNQLQRIETLKSKQDDETLVNPVSTPS 360
            LEKLCLPDLEAIPTMNLKS+  NLSKRS ISVDNQLQ+ ETLKSK+D+E LVN VSTPS
Sbjct: 303 TLEKLCLPDLEAIPTMNLKSTRGNLSKRSLISVDNQLQKTETLKSKEDNENLVNLVSTPS 362

Query: 361 SIRSPLASVSALNRRISLSNSSGDPFSAHDIGQSPARDPYLFELSNYLSDAVGIAEQSSI 420
           S+RSPLAS+SALNRRISLSNSSGD FSAH I +SPARDPYLFEL N+LSDAVGI E SS+
Sbjct: 363 SMRSPLASLSALNRRISLSNSSGDSFSAHGIDRSPARDPYLFELGNHLSDAVGITEHSSV 422

Query: 421 SKLKSLLTKDSGTVANGIKPSKILFGDVDSMSKMSSSNVLNVPQVGVDTALSGTHASMET 480
           SKLK LLT+D GT+ANGI+PSKIL GD DSMSK+SSSN+LNV QVG +TALSGT+AS + 
Sbjct: 423 SKLKPLLTRDGGTIANGIQPSKILSGD-DSMSKISSSNILNVLQVGGNTALSGTYASTDA 482

Query: 481 KDVSGSRTEVEVNEKLSCLE---DVVANMQMEDHEGSASEQPNSSKVDVIKEYPVGIQSQ 540
           K+VSGS T+VE+NEKLSCLE   DVVANMQ+ DH+GSASEQP  S+VD+I+EYPVGI+SQ
Sbjct: 483 KNVSGSSTDVEINEKLSCLEAQADVVANMQI-DHQGSASEQPKLSEVDLIEEYPVGIRSQ 542

Query: 541 LDQATATCTENIVDGPSRCSGMDHADEMEDHEGLAIEQPNSSKVDVIKEYPVGIQSQLDQ 600
           LDQ+ ATCTENIVDG SR SG +H DEMEDHEG A EQPNSSKVD+IKEYPVGIQ QLDQ
Sbjct: 543 LDQSAATCTENIVDGSSRSSGTEHHDEMEDHEGSASEQPNSSKVDMIKEYPVGIQIQLDQ 602

Query: 601 S-TATCTENIVNGPSRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGV 660
           S T TC E IV+G SRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTW+SGV
Sbjct: 603 STTTTCAEKIVDGTSRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWKSGV 662

Query: 661 RRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSPAKGNGQPTMKVKSLVSNEYKD 720
           RRSTRFK RPLEYWKGER+LYGRVHESLATVIGLKYVSP KGNG+PTMKVKSLVSNEYKD
Sbjct: 663 RRSTRFKIRPLEYWKGERMLYGRVHESLATVIGLKYVSPEKGNGKPTMKVKSLVSNEYKD 722

Query: 721 LVELAALH 725
           LV+LAALH
Sbjct: 723 LVDLAALH 728

BLAST of HG10023321 vs. ExPASy TrEMBL
Match: A0A1S3CDU5 (uncharacterized protein LOC103499749 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103499749 PE=3 SV=1)

HSP 1 Score: 1129.8 bits (2921), Expect = 0.0e+00
Identity = 608/729 (83.40%), Postives = 650/729 (89.16%), Query Frame = 0

Query: 1   MVTEEARHSDVIDPLAAYSGINLFSNAFRTLRDPSKPHDLGTDLDGIHKHLKSMVSRSPS 60
           MV EE R SDVIDPLAAYSGINLF  AF TL DPSKPHDLGTDLDGIHK LKSMV RSPS
Sbjct: 3   MVNEETRPSDVIDPLAAYSGINLFPTAFGTLTDPSKPHDLGTDLDGIHKRLKSMVLRSPS 62

Query: 61  KLVEQARSILDGNSNLMQSEAATFLVKNEKDEEATVKVEENLHERRPALNRKRARFSLKP 120
           KL+EQARSILDGNS  M SEAATFLVKNEK+E A+VK EEN  ERRPALNRKRARFSLKP
Sbjct: 63  KLLEQARSILDGNSKSMISEAATFLVKNEKNEAASVKAEENPQERRPALNRKRARFSLKP 122

Query: 121 DARQPSVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGAVLKDLNQQNPSTNKRQ 180
           DA QP VNLEPTFDIKQLKDPEEFFLAYE+ ENAKKEIQKQ GAVLKDLNQQNPSTN RQ
Sbjct: 123 DAGQPPVNLEPTFDIKQLKDPEEFFLAYEKHENAKKEIQKQMGAVLKDLNQQNPSTNTRQ 182

Query: 181 RRPGILGRSVRYKHQYSSITTEDDENVDPSQVTLESGSISPSILGTETHPSPHIIDSEKK 240
           RRPGILGRSVRYKHQYSSITTEDD+NVDPSQVT +SG  SP  LGTETHPSPHIIDSEKK
Sbjct: 183 RRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTFDSGVFSPLKLGTETHPSPHIIDSEKK 242

Query: 241 TDEDVAFEEEEEEEEFVGSVTKAENKVNKILDELLSANCEDLEGDRAINILQECLQIKPI 300
           TDEDVAFEEEEEEEE V S TKAEN+VN ILDE LS NCEDLEGDRAINILQE LQIKP+
Sbjct: 243 TDEDVAFEEEEEEEELVASATKAENRVNDILDEFLSGNCEDLEGDRAINILQERLQIKPL 302

Query: 301 NLEKLCLPDLEAIPTMNLKSSSCNLSKRSFISVDNQLQRIETLKSKQDDETLVNPVSTPS 360
            LEKLCLPDLEAIPTMNLKS+  NLSKRS ISVDNQLQ+ ETLKSK+D+E LVN VSTPS
Sbjct: 303 TLEKLCLPDLEAIPTMNLKSTRGNLSKRSLISVDNQLQKTETLKSKEDNENLVNLVSTPS 362

Query: 361 SIRSPLASVSALNRRISLSNSSGDPFSAHDIGQSPARDPYLFELSNYLSDAVGIAEQSSI 420
           S+RSPLAS+SALNRRISLSNSSGD FSAH I +SPARDPYLFEL N+LSDAVGI E SS+
Sbjct: 363 SMRSPLASLSALNRRISLSNSSGDSFSAHGIDRSPARDPYLFELGNHLSDAVGITEHSSV 422

Query: 421 SKLKSLLTKDSGTVANGIKPSKILFGDVDSMSKMSSSNVLNVPQVGVDTALSGTHASMET 480
           SKLK LLT+D GT+ANGI+PSKIL GD DSMSK+SSSN+LNV QVG +TALSGT+AS + 
Sbjct: 423 SKLKPLLTRDGGTIANGIQPSKILSGD-DSMSKISSSNILNVLQVGSNTALSGTYASTDA 482

Query: 481 KDVSGSRTEVEVNEKLSCLE---DVVANMQMEDHEGSASEQPNSSKVDVIKEYPVGIQSQ 540
           K+VSGS T+VE+NEKLSCLE   DVVANMQ+ DH+GSASEQP  S+VD+I+EYPVGI+SQ
Sbjct: 483 KNVSGSSTDVEINEKLSCLEAQADVVANMQI-DHQGSASEQPKLSEVDLIEEYPVGIRSQ 542

Query: 541 LDQATATCTENIVDGPSRCSGMDHADEMEDHEGLAIEQPNSSKVDVIKEYPVGIQSQLDQ 600
           LDQ+ ATCTENIVDG SR SG +H DEMEDHEG A EQPNSSKVD+IKEYPVGIQ QLDQ
Sbjct: 543 LDQSAATCTENIVDGSSRSSGTEHHDEMEDHEGSASEQPNSSKVDMIKEYPVGIQIQLDQ 602

Query: 601 S--TATCTENIVNGPSRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSG 660
           S  T TC E IV+G SRSSGTDHHDE  VKPKSRANKQRKGKKISGRQSLAGAGTTW+SG
Sbjct: 603 STTTTTCAEKIVDGTSRSSGTDHHDE--VKPKSRANKQRKGKKISGRQSLAGAGTTWKSG 662

Query: 661 VRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSPAKGNGQPTMKVKSLVSNEYK 720
           VRRSTRFK RPLEYWKGER+LYGRVHESLATVIGLKYVSP KGNG+PTMKVKSLVSNEYK
Sbjct: 663 VRRSTRFKIRPLEYWKGERMLYGRVHESLATVIGLKYVSPEKGNGKPTMKVKSLVSNEYK 722

Query: 721 DLVELAALH 725
           DLV+LAALH
Sbjct: 723 DLVDLAALH 727

BLAST of HG10023321 vs. ExPASy TrEMBL
Match: A0A1S4E341 (uncharacterized protein LOC103499749 isoform X3 OS=Cucumis melo OX=3656 GN=LOC103499749 PE=3 SV=1)

HSP 1 Score: 1076.2 bits (2782), Expect = 0.0e+00
Identity = 587/729 (80.52%), Postives = 627/729 (86.01%), Query Frame = 0

Query: 1   MVTEEARHSDVIDPLAAYSGINLFSNAFRTLRDPSKPHDLGTDLDGIHKHLKSMVSRSPS 60
           MV EE R SDVIDPLAAYSGINLF  AF TL DPSKPHDLGTDLDGIHK LKSMV RSPS
Sbjct: 3   MVNEETRPSDVIDPLAAYSGINLFPTAFGTLTDPSKPHDLGTDLDGIHKRLKSMVLRSPS 62

Query: 61  KLVEQARSILDGNSNLMQSEAATFLVKNEKDEEATVKVEENLHERRPALNRKRARFSLKP 120
           KL+EQARSILDGNS  M SEAATFLVKNEK+E A+VK EEN  ERRPALNRKRARFSLKP
Sbjct: 63  KLLEQARSILDGNSKSMISEAATFLVKNEKNEAASVKAEENPQERRPALNRKRARFSLKP 122

Query: 121 DARQPSVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGAVLKDLNQQNPSTNKRQ 180
           DA QP VNLEPTFDIKQLKDPEEFFLAYE+ ENAKKEIQKQ GAVLKDLNQQNPSTN RQ
Sbjct: 123 DAGQPPVNLEPTFDIKQLKDPEEFFLAYEKHENAKKEIQKQMGAVLKDLNQQNPSTNTRQ 182

Query: 181 RRPGILGRSVRYKHQYSSITTEDDENVDPSQVTLESGSISPSILGTETHPSPHIIDSEKK 240
           RRPGILGRSVRYKHQYSSITTEDD+NVDPSQVT +SG  SP  LGTETHPSPHIIDSEKK
Sbjct: 183 RRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTFDSGVFSPLKLGTETHPSPHIIDSEKK 242

Query: 241 TDEDVAFEEEEEEEEFVGSVTKAENKVNKILDELLSANCEDLEGDRAINILQECLQIKPI 300
           TDEDVAFEEEEEEEE V S TKAEN+VN ILDE LS NCEDLEGDRAINILQE LQIKP+
Sbjct: 243 TDEDVAFEEEEEEEELVASATKAENRVNDILDEFLSGNCEDLEGDRAINILQERLQIKPL 302

Query: 301 NLEKLCLPDLEAIPTMNLKSSSCNLSKRSFISVDNQLQRIETLKSKQDDETLVNPVSTPS 360
            LEKLCLPDLEAIPTMNLKS+  NLSKRS ISVDNQLQ+ ETLKSK+D+E LVN VSTPS
Sbjct: 303 TLEKLCLPDLEAIPTMNLKSTRGNLSKRSLISVDNQLQKTETLKSKEDNENLVNLVSTPS 362

Query: 361 SIRSPLASVSALNRRISLSNSSGDPFSAHDIGQSPARDPYLFELSNYLSDAVGIAEQSSI 420
           S+RSPLAS+SALNRRISLSNSS                             VGI E SS+
Sbjct: 363 SMRSPLASLSALNRRISLSNSS-----------------------------VGITEHSSV 422

Query: 421 SKLKSLLTKDSGTVANGIKPSKILFGDVDSMSKMSSSNVLNVPQVGVDTALSGTHASMET 480
           SKLK LLT+D GT+ANGI+PSKIL GD DSMSK+SSSN+LNV QVG +TALSGT+AS + 
Sbjct: 423 SKLKPLLTRDGGTIANGIQPSKILSGD-DSMSKISSSNILNVLQVGSNTALSGTYASTDA 482

Query: 481 KDVSGSRTEVEVNEKLSCLE---DVVANMQMEDHEGSASEQPNSSKVDVIKEYPVGIQSQ 540
           K+VSGS T+VE+NEKLSCLE   DVVANMQ+ DH+GSASEQP  S+VD+I+EYPVGI+SQ
Sbjct: 483 KNVSGSSTDVEINEKLSCLEAQADVVANMQI-DHQGSASEQPKLSEVDLIEEYPVGIRSQ 542

Query: 541 LDQATATCTENIVDGPSRCSGMDHADEMEDHEGLAIEQPNSSKVDVIKEYPVGIQSQLDQ 600
           LDQ+ ATCTENIVDG SR SG +H DEMEDHEG A EQPNSSKVD+IKEYPVGIQ QLDQ
Sbjct: 543 LDQSAATCTENIVDGSSRSSGTEHHDEMEDHEGSASEQPNSSKVDMIKEYPVGIQIQLDQ 602

Query: 601 S--TATCTENIVNGPSRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSG 660
           S  T TC E IV+G SRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTW+SG
Sbjct: 603 STTTTTCAEKIVDGTSRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWKSG 662

Query: 661 VRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSPAKGNGQPTMKVKSLVSNEYK 720
           VRRSTRFK RPLEYWKGER+LYGRVHESLATVIGLKYVSP KGNG+PTMKVKSLVSNEYK
Sbjct: 663 VRRSTRFKIRPLEYWKGERMLYGRVHESLATVIGLKYVSPEKGNGKPTMKVKSLVSNEYK 700

Query: 721 DLVELAALH 725
           DLV+LAALH
Sbjct: 723 DLVDLAALH 700

BLAST of HG10023321 vs. TAIR 10
Match: AT1G15660.1 (centromere protein C )

HSP 1 Score: 229.6 bits (584), Expect = 8.2e-60
Identity = 230/755 (30.46%), Postives = 350/755 (46.36%), Query Frame = 0

Query: 13  DPLAAYSGINLFSNAFRTLRDPSKPHDLGTDLDGIHKHLKSMVSRSPSKLVEQARSILDG 72
           DPL AYSG++LF    ++L +P  P     DL   H  L+SM     S+  EQA++IL+ 
Sbjct: 15  DPLQAYSGLSLFPRTLKSLSNPLPPSYQSEDLQQTHTLLQSMPFEIQSEHQEQAKAILE- 74

Query: 73  NSNLMQSEAATFLVKNEKDEEATVKVEENLHERRPALNRKRARFSLKPDARQPSVNLEPT 132
                           + D +  +    N  ERRP L+RKR  FSL     QP   + P+
Sbjct: 75  ----------------DVDVDVQLNPIPNKRERRPGLDRKRKSFSLHLTTSQPP-PVAPS 134

Query: 133 FDIKQLKDPEEFFLAYERLENAKKEIQKQTGAVLKDLNQQNPSTNKRQRRPGILGRSVR- 192
           FD  +    E+FF AY++ E A +E QKQTG+ + D+ +  PS  +R RRPGI GR  R 
Sbjct: 135 FDPSKYPRSEDFFAAYDKFELANREWQKQTGSSVIDIQENPPS--RRPRRPGIPGRKRRP 194

Query: 193 YKHQYSSITTEDDENVDPSQVTLESGSISPSILGTETHPSPHIIDSEKKTDEDVAFEEEE 252
           +K  ++     D  N++ S+  +   S        E+  + H+   +++ D+        
Sbjct: 195 FKESFTDSYFTDVINLEASEKEIPIASEQ----SLESATAAHVTTVDREVDD-------- 254

Query: 253 EEEEFVGSVTKAENKVNKILDELLSANCEDLEGDRAINILQECLQIKPINLEKLCLPDLE 312
                  S    +  +N +L +LL+ + E+LEGD AI +L+E LQIK  N+EK  +P+ +
Sbjct: 255 -------STVDTDKDLNNVLKDLLACSREELEGDGAIKLLEERLQIKSFNIEKFSIPEFQ 314

Query: 313 AIPTMNLKSSSCNLSKRSFISVDNQLQRIETLKSKQDDETLVNPVSTPSSIRSPLASVSA 372
            +  MNLK+S  N   R  +S    +Q I  LK         N V+   +  SP      
Sbjct: 315 DVRKMNLKASGSNPPNRKSLS---DIQNI--LKG-------TNRVAVRKNSHSPSPQTI- 374

Query: 373 LNRRISLSNSSGDPFSAHDI------GQSPAR---DPYLFELSNYLSDAVGIAEQSS--I 432
             +  S  N   D FS  DI       Q P+     P   ++ N     VG  + +S   
Sbjct: 375 --KHFSSPNPPVDQFSFPDIHNLLPGDQQPSEVNVQPIAKDIPNTSPTNVGTVDVASPFN 434

Query: 433 SKLKSLLTKDSGTVANGIKPSKILFGD------VDSMSKMSSS----NV-LNVPQVGVDT 492
             +     +D   + +GI  S +          +DS+S  SS+    NV +      VD 
Sbjct: 435 DSVVKRSGEDDSHIHSGIHRSHLSRDGNPDICVMDSISNRSSAMLQKNVDMRTKGKEVDV 494

Query: 493 ALSGTHASMETKDVSGSRTEVEVNEKLSCLE--------DVVANMQMED-----HEGSAS 552
            +S + A+  T D      + E+NE+   LE        +V     +E+      +G++S
Sbjct: 495 PMSESGANRNTGD---RENDAEINEETDNLERLAECASKEVTRPFTVEEDSIPYQQGASS 554

Query: 553 EQPNSSKVDVIKEYPVGIQSQLDQATATC----TENIVDGPSRCSGMDHADEME--DHEG 612
           + PN +     ++Y   +   L+ A         EN+  G +    +++A E+    H+ 
Sbjct: 555 KSPNRAP----EQYNT-MGGSLEHAEHNQGLHEEENVNTGSASGLQVENAPEVHKYSHKQ 614

Query: 613 LAIEQPNSSKVDVIKEYPVGIQSQLDQSTATCTENIVNGPSRSSGTDHHDEEQVKPKSRA 672
               +   S    +K+    +  +        T    +   + +    ++ E+ KPK   
Sbjct: 615 TNKRRKRGSSDSNVKKRSKTVHGETGGDKQMKTLPHESRAKKQTKGKSNEREEKKPKKTL 674

Query: 673 NKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGL 725
               +GK  S R+SLA AGT  + GVRRSTR K+RPLEYW+GER LYGR+HESL TVIG+
Sbjct: 675 T--HEGKLFSCRKSLAAAGTKIEGGVRRSTRIKSRPLEYWRGERFLYGRIHESLTTVIGI 705

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038896841.10.0e+0088.03centromere protein C isoform X2 [Benincasa hispida][more]
XP_011659552.10.0e+0085.30centromere protein C isoform X3 [Cucumis sativus] >KGN45338.1 hypothetical prote... [more]
XP_031745137.10.0e+0085.16centromere protein C isoform X4 [Cucumis sativus][more]
XP_031745135.10.0e+0084.38centromere protein C isoform X1 [Cucumis sativus][more]
XP_031745136.10.0e+0084.10centromere protein C isoform X2 [Cucumis sativus][more]
Match NameE-valueIdentityDescription
Q66LG91.2e-5830.46Centromere protein C OS=Arabidopsis thaliana OX=3702 GN=CENPC PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0K7740.0e+0085.30Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G440590 PE=3 SV=1[more]
A0A1S3CDU70.0e+0083.68uncharacterized protein LOC103499749 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A5A7UUE40.0e+0083.65Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold... [more]
A0A1S3CDU50.0e+0083.40uncharacterized protein LOC103499749 isoform X2 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A1S4E3410.0e+0080.52uncharacterized protein LOC103499749 isoform X3 OS=Cucumis melo OX=3656 GN=LOC10... [more]
Match NameE-valueIdentityDescription
AT1G15660.18.2e-6030.46centromere protein C [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 260..280
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 199..236
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 604..657
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 614..629
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 166..182
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 166..187
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 202..227
IPR028386Centromere protein C/Mif2/cnp3PANTHERPTHR16684CENTROMERE PROTEIN Ccoord: 45..723

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10023321.1HG10023321.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0051315 attachment of mitotic spindle microtubules to kinetochore
biological_process GO:0051382 kinetochore assembly
biological_process GO:0051455 monopolar spindle attachment to meiosis I kinetochore
cellular_component GO:0000776 kinetochore
cellular_component GO:0005634 nucleus
molecular_function GO:0019237 centromeric DNA binding