Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
CTAAATTTCCAGGTCTCAAGAAGTCCTAGTAAACTTATAGAGCAGGCCAGAGCCATTTTAGACGGTAACTCAAATGTGATGCACTCTGAAATTGCAACATTTCTTGTACATGATGATGAAAACAAAGAAACTACAGCAAAGGTGGAGGAAAATCCACAAGAAAGAAGGCCAGCATTAAACCGTAAGCGGGCAAGGTTTTCTTTAAAACCTGATACTAGGTAATGCATATAATTTACCTGCCTTTTCTTAAGAAAGGAAAAAAAAATCTGGTTGACTTTTATGATATCGTTATCTGATAGACAACCTGCTGTTAACTTGGAGGCAACATTCAACATTAAACAATTGAAAGACCCCGAGGAGTTCTTTTTGGCCTTTGAAAGGCTTGAAAGTAAGTTGTGCTAATTTGTTTTACAACCCAAATTTCAGATGCATTTAGAATTCACATATTTTTCTTCCCATTGTGCAATTGCTCCATAGATGCCAAAATAGAAATACAGAAACAAACGGGAGGAGTTTTGAAGGACTTGAACCAACAGAATCCATCCACGAATACACGCCACCGTAGACCAGGGATTCTTGGGTATAATCACTACCATGCTATTATATTAACAAAATTTTGTGTCTTTTCTTTGCGGCATCATTTTTGTTTGGGAACCAAAATTTGGTTGTTGTCCATGCAGGAGGTCTGTTAGATACAAGCATCAATATTCATCAATAACATCTGAAGATGATCAGAACGTAGAACCCTCTCAAGTTACATTTGAATCAGGTAATATTAGTCCATCAATAATGGGCACAGAAAAATGTCCCAGTCCACCTATAATTGGCTCAGAAAAGAGAACTGGTGAACATGTCCCATTTGAGGAGGAGGAGGAGGAGGAGGAGTTAGTTAGTAAGTAATTTCTAATGGAGATTTATAATGCAACATTCTGCAAGCATCATGGTCCAGTAGATTTTAGATATCTCAAATATCATTGTCCTCTCCCCCTTTCCTCTCCCTCTCTTATTTTTTCATATTTCGGTGTCAGTGTAGACCTTTTTTTGATGCTATGATATTTTTTTCCTCTTTGCAGCCTCAATCACCAAGTCAGAGAACAAAGTGAATAGAATTTTGGATGAGTTACTGTCGGCTAACTGTGAAGATTTAGAAGGTGATCGAGCCATCAACAAATTACAGGAATGCTTGCAGATTAAACCCATCAATTTAGAGAAATTATGCCTTCCAGATTTGCAAGCTATTCAGACAGTGAATTTGAAATCTTCAAGAGGCAATGCGCCAAAGCGTAGTTTGATCAGTGTGGACAATCAATTACAAAGGATAGAGACTTCGAAGTTTAAGCAGGATGATGAAAGTTCTGTTCATCTGCTTTCTACGCCATCCTCAATGAAAAGTCCATTGGCATCAGTATTAGCCCTAAATAGGCAAATTTTGCTTTCAAATTCATCAAGTGATCCATTTTCAGCTCATGACATTGACAAGTCCCCAGCAAGAAATCCTTCCTTTTCTGAACACATTAATCACTTGTCTGACATAGTTGATATTGCAAAGCAGTCGAGTGTTTCTAAACTGAAGTCACCTTTAACCAAAGATGGTGAGGCTGTACCTAATGGAATTAGGTCACCCAAATGCCCTATTGGAGATGTTGATTCTATGTCTAAAATATCTTTGGCTAATGTTTTAAATGTACCCGAAGTTGGTGGCAATGCTGCCTTAAATGGAAGTCATGCCAGCATGGAAGCTAAGGAAATTAGTGGTAGCGACACAGAAGTGGAAGTAAATAAGAAATTGAGTTGTCTTGGAGCCAAAGCAGATGGTTTGGCTAATGCATCAAATGCATTGGATGATGAGGTGCATTTTGTTGTGCTAGTAAATAAGAAATTGAGTTGTCTTGGAGCCAAAGCAGATGGTTTGGCTAATGCATCAAATGCATTGGATGATGAGGTGCATTTTGTTGTGCTAGTTACTATTTTTACCGCTGCCCTTTAGAGTTATAAATGTTTTGAAGTTGGATATTGTGTTTGTTGTGACAATAAACAGATGGAAGATCATGATGAATTAGCTTCAGAGCAACTGAACACATCCAAGGTGGATGCGACTAAAGAGTATCCGTTTGGTATTCAGAGTCAGTTGGGTATGATCACCAGTACCAGAATCATTAGTTGCTCTAGCTGATAATTTCTTTTACAATCTTGTCTAAGGGAAATTATAACCAAACGTTATGTTATTGCTCATCTTAGTGCTTTTGATGGTTGGACTTTTTTTTTAATAATAAATTTAGGATAAATATGTTCTTCTAGATCTGAGGACTAATATATAACTTTATAAGAAATATGTCCTTGATTTTGAAATAAATTTTAGTAGTTAAGCGTGTACCACTTCAATGCTTCACATTATGAAGGAACTCTATAACTATGATGTGTTCTTCTGAATGAAAAGACACATGCTAAAATATTTTTCAATCATGTTATAATTATGAATTGATTTCAGCTTTCTCATATGAAGAACCATTTTCTTTCTTCACAGATTGCACTGTTTATGTTCTTCCATTTTTAGGTTGAAAAGGCAGAAATTAATTGTTTGCTTAATAATTCCCAGTAGATGGATTAGATGATTTCAGACTCTGCATCCTTTGTTTTTTAACAGATCAATCAACTGCTACTTCTACTGATAATAATGTAGACGGGGTGTCCAGGAGCAGTGGAACGGATCACCATGATAAGGTTTTTGACCTTTCCTTTCTTTTTTCCTCTCTTGGACCTTTATTTCCCGTTGCTAATTATTGATGACTTGAAGTGCTTTGTCAAATTATGGTGTTTGTGAAATATGAAGACTTTCTTTATTGTGGTTACCATTGTCCTAAGGAAGTACTGTTTTTTCTTTTTCTAATTAATATATATATATATATATATATATATATATATATATAAATAAAATAAACAATTAACACTTAATTATTTATTGCAAAAATAGAATCATATTCTATAGCNGTTCATTTTTTCCTGCCCGTGTTTGTTTCATAATTCTTAATTTCTCAAAACTACTAATATTGTCACATTATTGTTTACTATTGCTTGATTTCATGTGCTATTCTATTTTTATTAATGTATTGCCTGCTAAAATTTAAAATTAGTAGCAGCATACCAACTCCACTCGCTGCCCCTATGTACTCCGTAGAACATTAGGAATAAGTAAATTAACTTGTCACCACCGAAGGAAGAAATTAGAAAATGATAAAGACCCCTCCTTTTGTTAACTCTACTTGTTTGCTAGAACATGGTTGTTACTGATTTGACTTTTGAGAGAAAATCAAGGATTTTTTGGGTTTTTGATGTATGAACAGGTCAAGCCAAAATCTCATGCAAACAAACAACGCAAAGACAAAAACATTTCTCGGAGGCAAAGCCTTGCAGGTGTTTATACGTAAATTTAACTCAAATTTTAAATTCTATATTATTTAGATTTTTCTTCTGAAAATACTTTGCATATCTGCTAATCTTTCCAGGGGCTGGTACAAAGTGGGAAAGTGGGGTAAGAAGAAGTACGAGGTTCAAAACACGACCTTTGGAGTATTGGAAAGGTGAAAGGCTGTTGTATGGACGTGTACATCAGAGTAAGTGGACGCTTATTATTGCAACTTTCTTTGTTAATGTCTTTCATCAATAGGCATATCCATATGTACATGTTTCTCTTTCTCACAAATTCCTTATTTAAGACTCTTTTCGATTGACAATATTTTGTTAGTCCTGAATGTGGACATGCCTAAAGTCGAGTTTTTCTTCTTGTTCAATTTTAATAAACCATTATATGAACTGGTAAAATAAATGAGTGTTCTTCGATCTTATTCCATGGTGCTATTTCATGTGCATGTAAACTAATTACCGGACTCCTCCTTAGGCCTGGCAACAGTAATCGGGATGAAGTATGTGTCTCCAGCAAAAGGTAATGGCCAACCAACTCTGAAGGTGAAGTCTCTGGTCTCTAACAAGTACAAAGAACTAGTTGAGTTTGCAGCTCTGCAC
mRNA sequence
CTAAATTTCCAGGTCTCAAGAAGTCCTAGTAAACTTATAGAGCAGGCCAGAGCCATTTTAGACGGTAACTCAAATGTGATGCACTCTGAAATTGCAACATTTCTTGTACATGATGATGAAAACAAAGAAACTACAGCAAAGGTGGAGGAAAATCCACAAGAAAGAAGGCCAGCATTAAACCGTAAGCGGGCAAGGTTTTCTTTAAAACCTGATACTAGACAACCTGCTGTTAACTTGGAGGCAACATTCAACATTAAACAATTGAAAGACCCCGAGGAGTTCTTTTTGGCCTTTGAAAGGCTTGAAAATGCCAAAATAGAAATACAGAAACAAACGGGAGGAGTTTTGAAGGACTTGAACCAACAGAATCCATCCACGAATACACGCCACCGTAGACCAGGGATTCTTGGGAGGTCTGTTAGATACAAGCATCAATATTCATCAATAACATCTGAAGATGATCAGAACGTAGAACCCTCTCAAGTTACATTTGAATCAGGTAATATTAGTCCATCAATAATGGGCACAGAAAAATGTCCCAGTCCACCTATAATTGGCTCAGAAAAGAGAACTGGTGAACATGTCCCATTTGAGGAGGAGGAGGAGGAGGAGGAGTTAGTTACCTCAATCACCAAGTCAGAGAACAAAGTGAATAGAATTTTGGATGAGTTACTGTCGGCTAACTGTGAAGATTTAGAAGGTGATCGAGCCATCAACAAATTACAGGAATGCTTGCAGATTAAACCCATCAATTTAGAGAAATTATGCCTTCCAGATTTGCAAGCTATTCAGACAGTGAATTTGAAATCTTCAAGAGGCAATGCGCCAAAGCGTAGTTTGATCAGTGTGGACAATCAATTACAAAGGATAGAGACTTCGAAGTTTAAGCAGGATGATGAAAGTTCTGTTCATCTGCTTTCTACGCCATCCTCAATGAAAAGTCCATTGGCATCAGTATTAGCCCTAAATAGGCAAATTTTGCTTTCAAATTCATCAAGTGATCCATTTTCAGCTCATGACATTGACAAGTCCCCAGCAAGAAATCCTTCCTTTTCTGAACACATTAATCACTTGTCTGACATAGTTGATATTGCAAAGCAGTCGAGTGTTTCTAAACTGAAGTCACCTTTAACCAAAGATGGTGAGGCTGTACCTAATGGAATTAGGTCACCCAAATGCCCTATTGGAGATGTTGATTCTATGTCTAAAATATCTTTGGCTAATGTTTTAAATGTACCCGAAGTTGGTGGCAATGCTGCCTTAAATGGAAGTCATGCCAGCATGGAAGCTAAGGAAATTAGTGGTAGCGACACAGAAGTGGAAGTAAATAAGAAATTGAGTTGTCTTGGAGCCAAAGCAGATGGTTTGGCTAATGCATCAAATGCATTGGATGATGAGGTGCATTTTGTTGTGCTAGTAAATAAGAAATTGAGTTGTCTTGGAGCCAAAGCAGATGGTTTGGCTAATGCATCAAATGCATTGGATGATGAGGTGCATTTTATGGAAGATCATGATGAATTAGCTTCAGAGCAACTGAACACATCCAAGGTGGATGCGACTAAAGAGTATCCGTTTGGTATTCAGAGTCAGTTGGATCAATCAACTGCTACTTCTACTGATAATAATGTAGACGGGGTGTCCAGGAGCAGTGGAACGGATCACCATGATAAGGTCAAGCCAAAATCTCATGCAAACAAACAACGCAAAGACAAAAACATTTCTCGGAGGCAAAGCCTTGCAGGGGCTGGTACAAAGTGGGAAAGTGGGGTAAGAAGAAGTACGAGGTTCAAAACACGACCTTTGGAGTATTGGAAAGGTGAAAGGCTGTTGTATGGACGTGTACATCAGAGCCTGGCAACAGTAATCGGGATGAAGTATGTGTCTCCAGCAAAAGGTAATGGCCAACCAACTCTGAAGGTGAAGTCTCTGGTCTCTAACAAGTACAAAGAACTAGTTGAGTTTGCAGCTCTGCAC
Coding sequence (CDS)
CTAAATTTCCAGGTCTCAAGAAGTCCTAGTAAACTTATAGAGCAGGCCAGAGCCATTTTAGACGGTAACTCAAATGTGATGCACTCTGAAATTGCAACATTTCTTGTACATGATGATGAAAACAAAGAAACTACAGCAAAGGTGGAGGAAAATCCACAAGAAAGAAGGCCAGCATTAAACCGTAAGCGGGCAAGGTTTTCTTTAAAACCTGATACTAGACAACCTGCTGTTAACTTGGAGGCAACATTCAACATTAAACAATTGAAAGACCCCGAGGAGTTCTTTTTGGCCTTTGAAAGGCTTGAAAATGCCAAAATAGAAATACAGAAACAAACGGGAGGAGTTTTGAAGGACTTGAACCAACAGAATCCATCCACGAATACACGCCACCGTAGACCAGGGATTCTTGGGAGGTCTGTTAGATACAAGCATCAATATTCATCAATAACATCTGAAGATGATCAGAACGTAGAACCCTCTCAAGTTACATTTGAATCAGGTAATATTAGTCCATCAATAATGGGCACAGAAAAATGTCCCAGTCCACCTATAATTGGCTCAGAAAAGAGAACTGGTGAACATGTCCCATTTGAGGAGGAGGAGGAGGAGGAGGAGTTAGTTACCTCAATCACCAAGTCAGAGAACAAAGTGAATAGAATTTTGGATGAGTTACTGTCGGCTAACTGTGAAGATTTAGAAGGTGATCGAGCCATCAACAAATTACAGGAATGCTTGCAGATTAAACCCATCAATTTAGAGAAATTATGCCTTCCAGATTTGCAAGCTATTCAGACAGTGAATTTGAAATCTTCAAGAGGCAATGCGCCAAAGCGTAGTTTGATCAGTGTGGACAATCAATTACAAAGGATAGAGACTTCGAAGTTTAAGCAGGATGATGAAAGTTCTGTTCATCTGCTTTCTACGCCATCCTCAATGAAAAGTCCATTGGCATCAGTATTAGCCCTAAATAGGCAAATTTTGCTTTCAAATTCATCAAGTGATCCATTTTCAGCTCATGACATTGACAAGTCCCCAGCAAGAAATCCTTCCTTTTCTGAACACATTAATCACTTGTCTGACATAGTTGATATTGCAAAGCAGTCGAGTGTTTCTAAACTGAAGTCACCTTTAACCAAAGATGGTGAGGCTGTACCTAATGGAATTAGGTCACCCAAATGCCCTATTGGAGATGTTGATTCTATGTCTAAAATATCTTTGGCTAATGTTTTAAATGTACCCGAAGTTGGTGGCAATGCTGCCTTAAATGGAAGTCATGCCAGCATGGAAGCTAAGGAAATTAGTGGTAGCGACACAGAAGTGGAAGTAAATAAGAAATTGAGTTGTCTTGGAGCCAAAGCAGATGGTTTGGCTAATGCATCAAATGCATTGGATGATGAGGTGCATTTTGTTGTGCTAGTAAATAAGAAATTGAGTTGTCTTGGAGCCAAAGCAGATGGTTTGGCTAATGCATCAAATGCATTGGATGATGAGGTGCATTTTATGGAAGATCATGATGAATTAGCTTCAGAGCAACTGAACACATCCAAGGTGGATGCGACTAAAGAGTATCCGTTTGGTATTCAGAGTCAGTTGGATCAATCAACTGCTACTTCTACTGATAATAATGTAGACGGGGTGTCCAGGAGCAGTGGAACGGATCACCATGATAAGGTCAAGCCAAAATCTCATGCAAACAAACAACGCAAAGACAAAAACATTTCTCGGAGGCAAAGCCTTGCAGGGGCTGGTACAAAGTGGGAAAGTGGGGTAAGAAGAAGTACGAGGTTCAAAACACGACCTTTGGAGTATTGGAAAGGTGAAAGGCTGTTGTATGGACGTGTACATCAGAGCCTGGCAACAGTAATCGGGATGAAGTATGTGTCTCCAGCAAAAGGTAATGGCCAACCAACTCTGAAGGTGAAGTCTCTGGTCTCTAACAAGTACAAAGAACTAGTTGAGTTTGCAGCTCTGCAC
Protein sequence
LNFQVSRSPSKLIEQARAILDGNSNVMHSEIATFLVHDDENKETTAKVEENPQERRPALNRKRARFSLKPDTRQPAVNLEATFNIKQLKDPEEFFLAFERLENAKIEIQKQTGGVLKDLNQQNPSTNTRHRRPGILGRSVRYKHQYSSITSEDDQNVEPSQVTFESGNISPSIMGTEKCPSPPIIGSEKRTGEHVPFEEEEEEEELVTSITKSENKVNRILDELLSANCEDLEGDRAINKLQECLQIKPINLEKLCLPDLQAIQTVNLKSSRGNAPKRSLISVDNQLQRIETSKFKQDDESSVHLLSTPSSMKSPLASVLALNRQILLSNSSSDPFSAHDIDKSPARNPSFSEHINHLSDIVDIAKQSSVSKLKSPLTKDGEAVPNGIRSPKCPIGDVDSMSKISLANVLNVPEVGGNAALNGSHASMEAKEISGSDTEVEVNKKLSCLGAKADGLANASNALDDEVHFVVLVNKKLSCLGAKADGLANASNALDDEVHFMEDHDELASEQLNTSKVDATKEYPFGIQSQLDQSTATSTDNNVDGVSRSSGTDHHDKVKPKSHANKQRKDKNISRRQSLAGAGTKWESGVRRSTRFKTRPLEYWKGERLLYGRVHQSLATVIGMKYVSPAKGNGQPTLKVKSLVSNKYKELVEFAALH
Homology
BLAST of MS009459 vs. NCBI nr
Match:
XP_022154052.1 (centromere protein C isoform X1 [Momordica charantia] >XP_022154062.1 centromere protein C isoform X2 [Momordica charantia])
HSP 1 Score: 1164.8 bits (3012), Expect = 0.0e+00
Identity = 620/658 (94.22%), Postives = 620/658 (94.22%), Query Frame = 0
Query: 1 LNFQVSRSPSKLIEQARAILDGNSNVMHSEIATFLVHDDENKETTAKVEENPQERRPALN 60
L VSRSPSKLIEQARAILDGNSNVMHSEIATFLVHDDENKETTAKVEENPQERRPALN
Sbjct: 51 LKSMVSRSPSKLIEQARAILDGNSNVMHSEIATFLVHDDENKETTAKVEENPQERRPALN 110
Query: 61 RKRARFSLKPDTRQPAVNLEATFNIKQLKDPEEFFLAFERLENAKIEIQKQTGGVLKDLN 120
RKRARFSLKPDTRQPAVNLEATFNIKQLKDPEEFFLAFERLENAKIEIQKQT GVLKDLN
Sbjct: 111 RKRARFSLKPDTRQPAVNLEATFNIKQLKDPEEFFLAFERLENAKIEIQKQTRGVLKDLN 170
Query: 121 QQNPSTNTRHRRPGILGRSVRYKHQYSSITSEDDQNVEPSQVTFESGNISPSIMGTEKCP 180
QQNPSTNTRHRRPGILGRSVRYKHQYSSITSEDDQNVEPSQVTFESGNISPSIMGTEKCP
Sbjct: 171 QQNPSTNTRHRRPGILGRSVRYKHQYSSITSEDDQNVEPSQVTFESGNISPSIMGTEKCP 230
Query: 181 SPPIIGSEKRTGEHVPFEEEEEEEELVTSITKSENKVNRILDELLSANCEDLEGDRAINK 240
SPPIIGSEKRTGEHVPFEEEEEEEELVTSITKSENKVNRILDELLSANCEDLEGDRAINK
Sbjct: 231 SPPIIGSEKRTGEHVPFEEEEEEEELVTSITKSENKVNRILDELLSANCEDLEGDRAINK 290
Query: 241 LQECLQIKPINLEKLCLPDLQAIQTVNLKSSRGNAPKRSLISVDNQLQRIETSKFKQDDE 300
LQECLQIKPINLEKLCLPDLQAIQTVNLKSSRGNAPKRSLISVDNQLQRIETSKFKQDDE
Sbjct: 291 LQECLQIKPINLEKLCLPDLQAIQTVNLKSSRGNAPKRSLISVDNQLQRIETSKFKQDDE 350
Query: 301 SSVHLLSTPSSMKSPLASVLALNRQILLSNSSSDPFSAHDIDKSPARNPSFSEHINHLSD 360
SSVHLLSTPSSMKSPLASVLALNRQILLSNSSSDPFSAHDIDKSPARNPSFSEHINHLSD
Sbjct: 351 SSVHLLSTPSSMKSPLASVLALNRQILLSNSSSDPFSAHDIDKSPARNPSFSEHINHLSD 410
Query: 361 IVDIAKQSSVSKLKSPLTKDGEAVPNGIRSPKCPIGDVDSMSKISLANVLNVPEVGGNAA 420
IVDIAKQSSVSKLKSPLTKDGEAVPNGIRSPKCPIGDVDSMSKISLANVLNVPEVGGNAA
Sbjct: 411 IVDIAKQSSVSKLKSPLTKDGEAVPNGIRSPKCPIGDVDSMSKISLANVLNVPEVGGNAA 470
Query: 421 LNGSHASMEAKEISGSDTEVEVNKKLSCLGAKADGLANASNALDDEVHFVVLVNKKLSCL 480
LNGSHASMEAKEISGSDTEVEVNKKLSCLGAKADGLANASNALDDE
Sbjct: 471 LNGSHASMEAKEISGSDTEVEVNKKLSCLGAKADGLANASNALDDE-------------- 530
Query: 481 GAKADGLANASNALDDEVHFMEDHDELASEQLNTSKVDATKEYPFGIQSQLDQSTATSTD 540
MEDHDELASEQLNTSKVDATKEYPFGIQSQLDQSTATSTD
Sbjct: 531 --------------------MEDHDELASEQLNTSKVDATKEYPFGIQSQLDQSTATSTD 590
Query: 541 NNVDGVSRSSGTDHHDKVKPKSHANKQRKDKNISRRQSLAGAGTKWESGVRRSTRFKTRP 600
NNVDGVSRSSGTDHHDKVKPKSHANKQRKDKNISRRQSLAGAGTKWESGVRRSTRFKTRP
Sbjct: 591 NNVDGVSRSSGTDHHDKVKPKSHANKQRKDKNISRRQSLAGAGTKWESGVRRSTRFKTRP 650
Query: 601 LEYWKGERLLYGRVHQSLATVIGMKYVSPAKGNGQPTLKVKSLVSNKYKELVEFAALH 659
LEYWKGERLLYGRVHQSLATVIGMKYVSPAKGNGQPTLKVKSLVSNKYKELVEFAALH
Sbjct: 651 LEYWKGERLLYGRVHQSLATVIGMKYVSPAKGNGQPTLKVKSLVSNKYKELVEFAALH 674
BLAST of MS009459 vs. NCBI nr
Match:
XP_022154071.1 (centromere protein C isoform X3 [Momordica charantia])
HSP 1 Score: 1062.8 bits (2747), Expect = 1.2e-306
Identity = 578/658 (87.84%), Postives = 578/658 (87.84%), Query Frame = 0
Query: 1 LNFQVSRSPSKLIEQARAILDGNSNVMHSEIATFLVHDDENKETTAKVEENPQERRPALN 60
L VSRSPSKLIEQARAILDGNSNVMHSEIATFLVHDDENKETTAKVEENPQERRPALN
Sbjct: 51 LKSMVSRSPSKLIEQARAILDGNSNVMHSEIATFLVHDDENKETTAKVEENPQERRPALN 110
Query: 61 RKRARFSLKPDTRQPAVNLEATFNIKQLKDPEEFFLAFERLENAKIEIQKQTGGVLKDLN 120
RKRARFSLKPDTRQPAVNLEATFNIKQLKDPEEFFLAFERLENAKIEIQKQT GVLKDLN
Sbjct: 111 RKRARFSLKPDTRQPAVNLEATFNIKQLKDPEEFFLAFERLENAKIEIQKQTRGVLKDLN 170
Query: 121 QQNPSTNTRHRRPGILGRSVRYKHQYSSITSEDDQNVEPSQVTFESGNISPSIMGTEKCP 180
QQNPSTNTRHRRPGILGRSVRYKHQYSSITSEDDQNVEPSQVTFES
Sbjct: 171 QQNPSTNTRHRRPGILGRSVRYKHQYSSITSEDDQNVEPSQVTFES-------------- 230
Query: 181 SPPIIGSEKRTGEHVPFEEEEEEEELVTSITKSENKVNRILDELLSANCEDLEGDRAINK 240
SITKSENKVNRILDELLSANCEDLEGDRAINK
Sbjct: 231 ---------------------------ASITKSENKVNRILDELLSANCEDLEGDRAINK 290
Query: 241 LQECLQIKPINLEKLCLPDLQAIQTVNLKSSRGNAPKRSLISVDNQLQRIETSKFKQDDE 300
LQECLQIKPINLEKLCLPDLQAIQTVNLKSSRGNAPKRSLISVDNQLQRIETSKFKQDDE
Sbjct: 291 LQECLQIKPINLEKLCLPDLQAIQTVNLKSSRGNAPKRSLISVDNQLQRIETSKFKQDDE 350
Query: 301 SSVHLLSTPSSMKSPLASVLALNRQILLSNSSSDPFSAHDIDKSPARNPSFSEHINHLSD 360
SSVHLLSTPSSMKSPLASVLALNRQILLSNSSSDPFSAHDIDKSPARNPSFSEHINHLSD
Sbjct: 351 SSVHLLSTPSSMKSPLASVLALNRQILLSNSSSDPFSAHDIDKSPARNPSFSEHINHLSD 410
Query: 361 IVDIAKQSSVSKLKSPLTKDGEAVPNGIRSPKCPIGDVDSMSKISLANVLNVPEVGGNAA 420
IVDIAKQSSVSKLKSPLTKDGEAVPNGIRSPKCPIGDVDSMSKISLANVLNVPEVGGNAA
Sbjct: 411 IVDIAKQSSVSKLKSPLTKDGEAVPNGIRSPKCPIGDVDSMSKISLANVLNVPEVGGNAA 470
Query: 421 LNGSHASMEAKEISGSDTEVEVNKKLSCLGAKADGLANASNALDDEVHFVVLVNKKLSCL 480
LNGSHASMEAKEISGSDTEVEVNKKLSCLGAKADGLANASNALDDE
Sbjct: 471 LNGSHASMEAKEISGSDTEVEVNKKLSCLGAKADGLANASNALDDE-------------- 530
Query: 481 GAKADGLANASNALDDEVHFMEDHDELASEQLNTSKVDATKEYPFGIQSQLDQSTATSTD 540
MEDHDELASEQLNTSKVDATKEYPFGIQSQLDQSTATSTD
Sbjct: 531 --------------------MEDHDELASEQLNTSKVDATKEYPFGIQSQLDQSTATSTD 590
Query: 541 NNVDGVSRSSGTDHHDKVKPKSHANKQRKDKNISRRQSLAGAGTKWESGVRRSTRFKTRP 600
NNVDGVSRSSGTDHHDKVKPKSHANKQRKDKNISRRQSLAGAGTKWESGVRRSTRFKTRP
Sbjct: 591 NNVDGVSRSSGTDHHDKVKPKSHANKQRKDKNISRRQSLAGAGTKWESGVRRSTRFKTRP 633
Query: 601 LEYWKGERLLYGRVHQSLATVIGMKYVSPAKGNGQPTLKVKSLVSNKYKELVEFAALH 659
LEYWKGERLLYGRVHQSLATVIGMKYVSPAKGNGQPTLKVKSLVSNKYKELVEFAALH
Sbjct: 651 LEYWKGERLLYGRVHQSLATVIGMKYVSPAKGNGQPTLKVKSLVSNKYKELVEFAALH 633
BLAST of MS009459 vs. NCBI nr
Match:
XP_023548004.1 (centromere protein C-like isoform X1 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 864.0 bits (2231), Expect = 8.4e-247
Identity = 475/658 (72.19%), Postives = 534/658 (81.16%), Query Frame = 0
Query: 1 LNFQVSRSPSKLIEQARAILDGNSNVMHSEIATFLVHDDENKETTAKVEENPQERRPALN 60
L VSR+PSKLIEQAR+IL+ NSN+M S+ AT LV +++ +E A VEENPQERRPALN
Sbjct: 51 LKSMVSRNPSKLIEQARSILNSNSNLMQSKAATLLVKNEKKEEAAANVEENPQERRPALN 110
Query: 61 RKRARFSLKPDTRQPAVNLEATFNIKQLKDPEEFFLAFERLENAKIEIQKQTGGVLKDLN 120
RKRARFSLKPD RQP VNLE TF+IKQLKDPEEFFLA+ERLENAK EIQKQTG +LKDLN
Sbjct: 111 RKRARFSLKPDARQPPVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGAILKDLN 170
Query: 121 QQNPSTNTRHRRPGILGRSVRYKHQYSSITSEDDQNVEPSQVTFESGNISPSIMGTEKCP 180
QQNPSTNTR RRPGILGRSVRYKHQYSSITSEDDQNVEPSQVTFESG+ISPSI+GTEK
Sbjct: 171 QQNPSTNTRQRRPGILGRSVRYKHQYSSITSEDDQNVEPSQVTFESGSISPSILGTEKDA 230
Query: 181 SPPIIGSEKRTGEHVPFEEEEEEEELVTSITKSENKVNRILDELLSANCEDLEGDRAINK 240
SPPII SE +T E VP EEEEEE V SIT +ENKVN+ILDELLSANCEDLEGDRAINK
Sbjct: 231 SPPIICSEMKTNEEVPL-EEEEEEAFVASITNAENKVNKILDELLSANCEDLEGDRAINK 290
Query: 241 LQECLQIKPINLEKLCLPDLQAIQTVNLKSSRGNAPKRSLISVDNQLQRIETSKFKQDDE 300
LQECLQIKPINLEKLCLPDL+AIQT+NL+SSRGN P+RSLISVD+QLQRIE K KQDDE
Sbjct: 291 LQECLQIKPINLEKLCLPDLEAIQTMNLRSSRGNLPERSLISVDSQLQRIENLKSKQDDE 350
Query: 301 SSVHLLSTPSSMKSPLASVLALNRQILLSNSSSDPFSAHDIDKSPARNPSFSEHINHLSD 360
+SV+ +STP SM+SPLAS+ AL R+I LSNS DPFSAHD+D+S ARNPS E NHLSD
Sbjct: 351 NSVNPISTPFSMRSPLASLSALTRRISLSNSPGDPFSAHDLDQSSARNPSLFELSNHLSD 410
Query: 361 IVDIAKQSSVSKLKSPLTKDGEAVPNGIRSPKCPIGDVDSMSKISLANVLNVPEVGGNAA 420
V IA++ VS+L S LTKD V GI+SPK +GDVDS+SKIS +NVLNVP+ G AA
Sbjct: 411 AVGIAEKLGVSRLMSLLTKDDGTVAKGIKSPKILLGDVDSISKISSSNVLNVPQAGAEAA 470
Query: 421 LNGSHASMEAKEISGSDTEVEVNKKLSCLGAKADGLANASNALDDEVHFVVLVNKKLSCL 480
L+ +HA+MEAK+ISGS TEVEVN+KLS L A+AD +A
Sbjct: 471 LSETHANMEAKDISGSSTEVEVNEKLSFLEAQADAVA----------------------- 530
Query: 481 GAKADGLANASNALDDEVHFMEDHDELASEQLNTSKVDATKEYPFGIQSQLDQSTATSTD 540
A+N LDDE MEDH+ SEQ NTSKVDA KEYP G+Q+QLDQSTAT T+
Sbjct: 531 ---------ATNVLDDE---MEDHEGSTSEQPNTSKVDAIKEYPLGVQTQLDQSTATCTE 590
Query: 541 NNVDGVSRSSGTDHHDKVKPKSHANKQRKDKNISRRQSLAGAGTKWESGVRRSTRFKTRP 600
N VDG SRSSGTD+HDKVK KS A QR+ K +S R+SLAGAGT W+ GVRRSTRFKTRP
Sbjct: 591 NIVDGPSRSSGTDNHDKVKQKSRAGNQREGKRVSGRKSLAGAGTTWQGGVRRSTRFKTRP 650
Query: 601 LEYWKGERLLYGRVHQSLATVIGMKYVSPAKGNGQPTLKVKSLVSNKYKELVEFAALH 659
LEYWKGERLLYGRVH+SLATVIG+KYVSPAKGNGQPTLKVKSLVS++Y ELVE AALH
Sbjct: 651 LEYWKGERLLYGRVHESLATVIGLKYVSPAKGNGQPTLKVKSLVSSEYNELVELAALH 672
BLAST of MS009459 vs. NCBI nr
Match:
XP_022953572.1 (centromere protein C isoform X1 [Cucurbita moschata])
HSP 1 Score: 859.0 bits (2218), Expect = 2.7e-245
Identity = 474/658 (72.04%), Postives = 532/658 (80.85%), Query Frame = 0
Query: 1 LNFQVSRSPSKLIEQARAILDGNSNVMHSEIATFLVHDDENKETTAKVEENPQERRPALN 60
L VSR+PSKLIEQAR+IL+GNSN+M S+ ATFLV +++ +E A VEENPQERRPALN
Sbjct: 51 LKSMVSRNPSKLIEQARSILNGNSNLMQSKAATFLVKNEKKEEAAANVEENPQERRPALN 110
Query: 61 RKRARFSLKPDTRQPAVNLEATFNIKQLKDPEEFFLAFERLENAKIEIQKQTGGVLKDLN 120
RKRARFSLKPD RQP+VNLE TF+IKQLKDPEEFFLA+ERLENAK EIQKQTG +LKDLN
Sbjct: 111 RKRARFSLKPDARQPSVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGAILKDLN 170
Query: 121 QQNPSTNTRHRRPGILGRSVRYKHQYSSITSEDDQNVEPSQVTFESGNISPSIMGTEKCP 180
QQNPSTNTR RRPGILGRSVRYKHQYSSITSEDDQNVEPSQVTFESG+ISPSI+GTEK
Sbjct: 171 QQNPSTNTRQRRPGILGRSVRYKHQYSSITSEDDQNVEPSQVTFESGSISPSILGTEKDA 230
Query: 181 SPPIIGSEKRTGEHVPFEEEEEEEELVTSITKSENKVNRILDELLSANCEDLEGDRAINK 240
SPPII SE +T E VP EEEEE V SIT +ENKVN+ILDELLSANCEDLEGDRAINK
Sbjct: 231 SPPIICSEMKTNEEVPL--EEEEEAFVASITNAENKVNKILDELLSANCEDLEGDRAINK 290
Query: 241 LQECLQIKPINLEKLCLPDLQAIQTVNLKSSRGNAPKRSLISVDNQLQRIETSKFKQDDE 300
LQECLQIKPINLEKLCLPDL+AIQT NL+SSRGN P+RSLISVD+QLQRIE K KQDDE
Sbjct: 291 LQECLQIKPINLEKLCLPDLEAIQTTNLRSSRGNLPERSLISVDSQLQRIENLKSKQDDE 350
Query: 301 SSVHLLSTPSSMKSPLASVLALNRQILLSNSSSDPFSAHDIDKSPARNPSFSEHINHLSD 360
+SV+ +STP SM+SPLAS+ AL R+I LSNS DPFSAHD+D+S ARNPS E NHLSD
Sbjct: 351 NSVNPISTPFSMRSPLASLSALTRRISLSNSPGDPFSAHDLDQSSARNPSLFELSNHLSD 410
Query: 361 IVDIAKQSSVSKLKSPLTKDGEAVPNGIRSPKCPIGDVDSMSKISLANVLNVPEVGGNAA 420
V IA++ VS+L S LTKD V GI+SPK +GDVDS+SKIS +NVLNVP+ G AA
Sbjct: 411 AVGIAEKLGVSRLMSLLTKDDGTVAKGIKSPKILLGDVDSISKISSSNVLNVPQAGAEAA 470
Query: 421 LNGSHASMEAKEISGSDTEVEVNKKLSCLGAKADGLANASNALDDEVHFVVLVNKKLSCL 480
L+ + A+MEAK+ISGS TEVEVN+KLS L A+AD +A
Sbjct: 471 LSETRANMEAKDISGSSTEVEVNEKLSFLEAQADAVA----------------------- 530
Query: 481 GAKADGLANASNALDDEVHFMEDHDELASEQLNTSKVDATKEYPFGIQSQLDQSTATSTD 540
A+N LDDE MEDH+ SEQ NTSKVDA KEYP GIQ+QLDQS AT T+
Sbjct: 531 ---------ATNVLDDE---MEDHEGSTSEQPNTSKVDAIKEYPIGIQTQLDQSIATCTE 590
Query: 541 NNVDGVSRSSGTDHHDKVKPKSHANKQRKDKNISRRQSLAGAGTKWESGVRRSTRFKTRP 600
N VD SRSSGTD+HDKVK KS A QR+ K +S R+SLAGAGT W+ GVRRSTRFKTRP
Sbjct: 591 NIVDRPSRSSGTDNHDKVKQKSRAGNQREGKRVSGRKSLAGAGTTWQGGVRRSTRFKTRP 650
Query: 601 LEYWKGERLLYGRVHQSLATVIGMKYVSPAKGNGQPTLKVKSLVSNKYKELVEFAALH 659
LEYWKGERLLYGRVH+SLATVIG+KYVSPAKGNGQPTLKVKSLVS++Y ELVE AALH
Sbjct: 651 LEYWKGERLLYGRVHESLATVIGLKYVSPAKGNGQPTLKVKSLVSSEYNELVELAALH 671
BLAST of MS009459 vs. NCBI nr
Match:
XP_022992183.1 (centromere protein C-like isoform X1 [Cucurbita maxima])
HSP 1 Score: 859.0 bits (2218), Expect = 2.7e-245
Identity = 473/658 (71.88%), Postives = 534/658 (81.16%), Query Frame = 0
Query: 1 LNFQVSRSPSKLIEQARAILDGNSNVMHSEIATFLVHDDENKETTAKVEENPQERRPALN 60
L VSR+PSKLIEQAR+IL+GNSN+M S+ ATFLV +++ +E A VEENPQERRPALN
Sbjct: 51 LKSMVSRNPSKLIEQARSILNGNSNLMQSKAATFLVKNEKKEEAAANVEENPQERRPALN 110
Query: 61 RKRARFSLKPDTRQPAVNLEATFNIKQLKDPEEFFLAFERLENAKIEIQKQTGGVLKDLN 120
RKRARFSLKPD RQP VNLE TF+IKQLKDPEEFFLA+ERLENAK EIQKQTG +LKDLN
Sbjct: 111 RKRARFSLKPDARQPPVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGAILKDLN 170
Query: 121 QQNPSTNTRHRRPGILGRSVRYKHQYSSITSEDDQNVEPSQVTFESGNISPSIMGTEKCP 180
QQNPSTNTR RRPGILGRSVRYKHQYSSITSEDDQ VEPSQVTFESG+ISPS +GTEK
Sbjct: 171 QQNPSTNTRQRRPGILGRSVRYKHQYSSITSEDDQTVEPSQVTFESGSISPSTLGTEKDA 230
Query: 181 SPPIIGSEKRTGEHVPFEEEEEEEELVTSITKSENKVNRILDELLSANCEDLEGDRAINK 240
SPPII SE +T E VPF EEEEEE V SIT +ENKVN+ILDELLSANCEDLEGD+AINK
Sbjct: 231 SPPIICSEMKTNEEVPF-EEEEEEAFVASITNAENKVNKILDELLSANCEDLEGDQAINK 290
Query: 241 LQECLQIKPINLEKLCLPDLQAIQTVNLKSSRGNAPKRSLISVDNQLQRIETSKFKQDDE 300
LQECLQIKPINLEKLCLPDL+AIQT+NL+SSRGN P+RSLISVD+QLQRIE K KQDDE
Sbjct: 291 LQECLQIKPINLEKLCLPDLEAIQTMNLRSSRGNLPERSLISVDSQLQRIENLKSKQDDE 350
Query: 301 SSVHLLSTPSSMKSPLASVLALNRQILLSNSSSDPFSAHDIDKSPARNPSFSEHINHLSD 360
+SV+ +STP SM+SPLAS+ AL R+I LSNS DPFSAHD+D+S ARNPS E NHLSD
Sbjct: 351 NSVNPISTPFSMRSPLASLSALTRRISLSNSPGDPFSAHDLDQSSARNPSLFELSNHLSD 410
Query: 361 IVDIAKQSSVSKLKSPLTKDGEAVPNGIRSPKCPIGDVDSMSKISLANVLNVPEVGGNAA 420
V IA++ VS+L S LTKD V GI+SPK +GDV+S+SKIS +NVLNVP+ G +AA
Sbjct: 411 AVGIAEKLGVSRLMSLLTKDDGTVAKGIKSPKILLGDVNSISKISSSNVLNVPQAGADAA 470
Query: 421 LNGSHASMEAKEISGSDTEVEVNKKLSCLGAKADGLANASNALDDEVHFVVLVNKKLSCL 480
L+ +HA+MEAK+ISGS EVEVN+KLS L A+AD +A
Sbjct: 471 LSETHANMEAKDISGSSREVEVNEKLSFLEAQADAVA----------------------- 530
Query: 481 GAKADGLANASNALDDEVHFMEDHDELASEQLNTSKVDATKEYPFGIQSQLDQSTATSTD 540
A+N LDDE MEDH+ SEQ NTSKVDA KEYP GIQ+ LDQSTAT T+
Sbjct: 531 ---------ATNVLDDE---MEDHEGSTSEQPNTSKVDAIKEYPIGIQTLLDQSTATCTE 590
Query: 541 NNVDGVSRSSGTDHHDKVKPKSHANKQRKDKNISRRQSLAGAGTKWESGVRRSTRFKTRP 600
N VDG SRSSGTD+HDKVK KS A QR+ K +S R+SLAGAGT W+ GVRRSTRFKTRP
Sbjct: 591 NIVDGPSRSSGTDNHDKVKQKSRAGNQREGKRVSGRKSLAGAGTTWQGGVRRSTRFKTRP 650
Query: 601 LEYWKGERLLYGRVHQSLATVIGMKYVSPAKGNGQPTLKVKSLVSNKYKELVEFAALH 659
LEYWKGERLLYGRVH+SLATVIG+KYVSPAKGNGQPTLKVKSLVS++Y ELVE AALH
Sbjct: 651 LEYWKGERLLYGRVHESLATVIGLKYVSPAKGNGQPTLKVKSLVSSEYNELVELAALH 672
BLAST of MS009459 vs. ExPASy Swiss-Prot
Match:
Q66LG9 (Centromere protein C OS=Arabidopsis thaliana OX=3702 GN=CENPC PE=2 SV=1)
HSP 1 Score: 187.2 bits (474), Expect = 6.0e-46
Identity = 206/684 (30.12%), Postives = 312/684 (45.61%), Query Frame = 0
Query: 28 HSEIATFLVHDDENKETTAKVEENPQERRPALNRKRARFSLKPDTRQPAVNLEATFNIKQ 87
H E A ++ +D + + N +ERRP L+RKR FSL T QP + +F+ +
Sbjct: 64 HQEQAKAIL-EDVDVDVQLNPIPNKRERRPGLDRKRKSFSLHLTTSQPP-PVAPSFDPSK 123
Query: 88 LKDPEEFFLAFERLENAKIEIQKQTGGVLKDLNQQNPSTNTRHRRPGILGRSVR-YKHQY 147
E+FF A+++ E A E QKQTG + D+ + PS R RRPGI GR R +K +
Sbjct: 124 YPRSEDFFAAYDKFELANREWQKQTGSSVIDIQENPPS--RRPRRPGIPGRKRRPFKESF 183
Query: 148 SSITSEDDQNVEPSQVTFESGNISPSIMGTEKCPSPPIIGSEKRTGEHVPFEEEEEEEEL 207
+ D N+E S+ ++ P E T HV + E ++
Sbjct: 184 TDSYFTDVINLEASE---------------KEIPIASEQSLESATAAHVTTVDREVDD-- 243
Query: 208 VTSITKSENKVNRILDELLSANCEDLEGDRAINKLQECLQIKPINLEKLCLPDLQAIQTV 267
S ++ +N +L +LL+ + E+LEGD AI L+E LQIK N+EK +P+ Q ++ +
Sbjct: 244 --STVDTDKDLNNVLKDLLACSREELEGDGAIKLLEERLQIKSFNIEKFSIPEFQDVRKM 303
Query: 268 NLKSSRGNAPKR-SLISVDNQLQRIETSKFKQDDES-SVHLLSTPSSMKSPLASVLALNR 327
NLK+S N P R SL + N L+ +++ S S + SS P+ +
Sbjct: 304 NLKASGSNPPNRKSLSDIQNILKGTNRVAVRKNSHSPSPQTIKHFSSPNPPVDQFSFPDI 363
Query: 328 QILL------SNSSSDPFSAHDIDKSPARNPSFSEHINHLSDIVDIAKQSSVSKL----- 387
LL S + P A DI + N + + +D V S +
Sbjct: 364 HNLLPGDQQPSEVNVQPI-AKDIPNTSPTNVGTVDVASPFNDSVVKRSGEDDSHIHSGIH 423
Query: 388 KSPLTKDGEAVPNGIRSPKCPIGDVDSMSKISLANVLNVPEVGGNAALNGSHASMEAKEI 447
+S L++DG C + + + S L +++ G + S + +
Sbjct: 424 RSHLSRDG-------NPDICVMDSISNRSSAMLQKNVDMRTKGKEVDVPMSESGAN-RNT 483
Query: 448 SGSDTEVEVNKKLSCLGAKADGLANASNALDDEV--HFVVLVNKKLSCLGAKADGLANA- 507
+ + E+N+ + D L + EV F V + GA + A
Sbjct: 484 GDRENDAEINE-------ETDNLERLAECASKEVTRPFTVEEDSIPYQQGASSKSPNRAP 543
Query: 508 ------SNALDDEVHFMEDHDELASEQLNTSKV------DATKEYPFGIQSQLDQSTATS 567
+L+ H H+E E +NT +A + + + + + S
Sbjct: 544 EQYNTMGGSLEHAEHNQGLHEE---ENVNTGSASGLQVENAPEVHKYSHKQTNKRRKRGS 603
Query: 568 TDNNVDGVSRS----SGTDHHDKVKP-KSHANKQRKDKNISR------------------ 627
+D+NV S++ +G D K P +S A KQ K K+ R
Sbjct: 604 SDSNVKKRSKTVHGETGGDKQMKTLPHESRAKKQTKGKSNEREEKKPKKTLTHEGKLFSC 663
Query: 628 RQSLAGAGTKWESGVRRSTRFKTRPLEYWKGERLLYGRVHQSLATVIGMKYVSPAKG-NG 659
R+SLA AGTK E GVRRSTR K+RPLEYW+GER LYGR+H+SL TVIG+KY SP +G
Sbjct: 664 RKSLAAAGTKIEGGVRRSTRIKSRPLEYWRGERFLYGRIHESLTTVIGIKYASPGEGKRD 705
BLAST of MS009459 vs. ExPASy TrEMBL
Match:
A0A6J1DKM1 (centromere protein C isoform X1 OS=Momordica charantia OX=3673 GN=LOC111021402 PE=3 SV=1)
HSP 1 Score: 1164.8 bits (3012), Expect = 0.0e+00
Identity = 620/658 (94.22%), Postives = 620/658 (94.22%), Query Frame = 0
Query: 1 LNFQVSRSPSKLIEQARAILDGNSNVMHSEIATFLVHDDENKETTAKVEENPQERRPALN 60
L VSRSPSKLIEQARAILDGNSNVMHSEIATFLVHDDENKETTAKVEENPQERRPALN
Sbjct: 51 LKSMVSRSPSKLIEQARAILDGNSNVMHSEIATFLVHDDENKETTAKVEENPQERRPALN 110
Query: 61 RKRARFSLKPDTRQPAVNLEATFNIKQLKDPEEFFLAFERLENAKIEIQKQTGGVLKDLN 120
RKRARFSLKPDTRQPAVNLEATFNIKQLKDPEEFFLAFERLENAKIEIQKQT GVLKDLN
Sbjct: 111 RKRARFSLKPDTRQPAVNLEATFNIKQLKDPEEFFLAFERLENAKIEIQKQTRGVLKDLN 170
Query: 121 QQNPSTNTRHRRPGILGRSVRYKHQYSSITSEDDQNVEPSQVTFESGNISPSIMGTEKCP 180
QQNPSTNTRHRRPGILGRSVRYKHQYSSITSEDDQNVEPSQVTFESGNISPSIMGTEKCP
Sbjct: 171 QQNPSTNTRHRRPGILGRSVRYKHQYSSITSEDDQNVEPSQVTFESGNISPSIMGTEKCP 230
Query: 181 SPPIIGSEKRTGEHVPFEEEEEEEELVTSITKSENKVNRILDELLSANCEDLEGDRAINK 240
SPPIIGSEKRTGEHVPFEEEEEEEELVTSITKSENKVNRILDELLSANCEDLEGDRAINK
Sbjct: 231 SPPIIGSEKRTGEHVPFEEEEEEEELVTSITKSENKVNRILDELLSANCEDLEGDRAINK 290
Query: 241 LQECLQIKPINLEKLCLPDLQAIQTVNLKSSRGNAPKRSLISVDNQLQRIETSKFKQDDE 300
LQECLQIKPINLEKLCLPDLQAIQTVNLKSSRGNAPKRSLISVDNQLQRIETSKFKQDDE
Sbjct: 291 LQECLQIKPINLEKLCLPDLQAIQTVNLKSSRGNAPKRSLISVDNQLQRIETSKFKQDDE 350
Query: 301 SSVHLLSTPSSMKSPLASVLALNRQILLSNSSSDPFSAHDIDKSPARNPSFSEHINHLSD 360
SSVHLLSTPSSMKSPLASVLALNRQILLSNSSSDPFSAHDIDKSPARNPSFSEHINHLSD
Sbjct: 351 SSVHLLSTPSSMKSPLASVLALNRQILLSNSSSDPFSAHDIDKSPARNPSFSEHINHLSD 410
Query: 361 IVDIAKQSSVSKLKSPLTKDGEAVPNGIRSPKCPIGDVDSMSKISLANVLNVPEVGGNAA 420
IVDIAKQSSVSKLKSPLTKDGEAVPNGIRSPKCPIGDVDSMSKISLANVLNVPEVGGNAA
Sbjct: 411 IVDIAKQSSVSKLKSPLTKDGEAVPNGIRSPKCPIGDVDSMSKISLANVLNVPEVGGNAA 470
Query: 421 LNGSHASMEAKEISGSDTEVEVNKKLSCLGAKADGLANASNALDDEVHFVVLVNKKLSCL 480
LNGSHASMEAKEISGSDTEVEVNKKLSCLGAKADGLANASNALDDE
Sbjct: 471 LNGSHASMEAKEISGSDTEVEVNKKLSCLGAKADGLANASNALDDE-------------- 530
Query: 481 GAKADGLANASNALDDEVHFMEDHDELASEQLNTSKVDATKEYPFGIQSQLDQSTATSTD 540
MEDHDELASEQLNTSKVDATKEYPFGIQSQLDQSTATSTD
Sbjct: 531 --------------------MEDHDELASEQLNTSKVDATKEYPFGIQSQLDQSTATSTD 590
Query: 541 NNVDGVSRSSGTDHHDKVKPKSHANKQRKDKNISRRQSLAGAGTKWESGVRRSTRFKTRP 600
NNVDGVSRSSGTDHHDKVKPKSHANKQRKDKNISRRQSLAGAGTKWESGVRRSTRFKTRP
Sbjct: 591 NNVDGVSRSSGTDHHDKVKPKSHANKQRKDKNISRRQSLAGAGTKWESGVRRSTRFKTRP 650
Query: 601 LEYWKGERLLYGRVHQSLATVIGMKYVSPAKGNGQPTLKVKSLVSNKYKELVEFAALH 659
LEYWKGERLLYGRVHQSLATVIGMKYVSPAKGNGQPTLKVKSLVSNKYKELVEFAALH
Sbjct: 651 LEYWKGERLLYGRVHQSLATVIGMKYVSPAKGNGQPTLKVKSLVSNKYKELVEFAALH 674
BLAST of MS009459 vs. ExPASy TrEMBL
Match:
A0A6J1DML8 (centromere protein C isoform X3 OS=Momordica charantia OX=3673 GN=LOC111021402 PE=3 SV=1)
HSP 1 Score: 1062.8 bits (2747), Expect = 6.0e-307
Identity = 578/658 (87.84%), Postives = 578/658 (87.84%), Query Frame = 0
Query: 1 LNFQVSRSPSKLIEQARAILDGNSNVMHSEIATFLVHDDENKETTAKVEENPQERRPALN 60
L VSRSPSKLIEQARAILDGNSNVMHSEIATFLVHDDENKETTAKVEENPQERRPALN
Sbjct: 51 LKSMVSRSPSKLIEQARAILDGNSNVMHSEIATFLVHDDENKETTAKVEENPQERRPALN 110
Query: 61 RKRARFSLKPDTRQPAVNLEATFNIKQLKDPEEFFLAFERLENAKIEIQKQTGGVLKDLN 120
RKRARFSLKPDTRQPAVNLEATFNIKQLKDPEEFFLAFERLENAKIEIQKQT GVLKDLN
Sbjct: 111 RKRARFSLKPDTRQPAVNLEATFNIKQLKDPEEFFLAFERLENAKIEIQKQTRGVLKDLN 170
Query: 121 QQNPSTNTRHRRPGILGRSVRYKHQYSSITSEDDQNVEPSQVTFESGNISPSIMGTEKCP 180
QQNPSTNTRHRRPGILGRSVRYKHQYSSITSEDDQNVEPSQVTFES
Sbjct: 171 QQNPSTNTRHRRPGILGRSVRYKHQYSSITSEDDQNVEPSQVTFES-------------- 230
Query: 181 SPPIIGSEKRTGEHVPFEEEEEEEELVTSITKSENKVNRILDELLSANCEDLEGDRAINK 240
SITKSENKVNRILDELLSANCEDLEGDRAINK
Sbjct: 231 ---------------------------ASITKSENKVNRILDELLSANCEDLEGDRAINK 290
Query: 241 LQECLQIKPINLEKLCLPDLQAIQTVNLKSSRGNAPKRSLISVDNQLQRIETSKFKQDDE 300
LQECLQIKPINLEKLCLPDLQAIQTVNLKSSRGNAPKRSLISVDNQLQRIETSKFKQDDE
Sbjct: 291 LQECLQIKPINLEKLCLPDLQAIQTVNLKSSRGNAPKRSLISVDNQLQRIETSKFKQDDE 350
Query: 301 SSVHLLSTPSSMKSPLASVLALNRQILLSNSSSDPFSAHDIDKSPARNPSFSEHINHLSD 360
SSVHLLSTPSSMKSPLASVLALNRQILLSNSSSDPFSAHDIDKSPARNPSFSEHINHLSD
Sbjct: 351 SSVHLLSTPSSMKSPLASVLALNRQILLSNSSSDPFSAHDIDKSPARNPSFSEHINHLSD 410
Query: 361 IVDIAKQSSVSKLKSPLTKDGEAVPNGIRSPKCPIGDVDSMSKISLANVLNVPEVGGNAA 420
IVDIAKQSSVSKLKSPLTKDGEAVPNGIRSPKCPIGDVDSMSKISLANVLNVPEVGGNAA
Sbjct: 411 IVDIAKQSSVSKLKSPLTKDGEAVPNGIRSPKCPIGDVDSMSKISLANVLNVPEVGGNAA 470
Query: 421 LNGSHASMEAKEISGSDTEVEVNKKLSCLGAKADGLANASNALDDEVHFVVLVNKKLSCL 480
LNGSHASMEAKEISGSDTEVEVNKKLSCLGAKADGLANASNALDDE
Sbjct: 471 LNGSHASMEAKEISGSDTEVEVNKKLSCLGAKADGLANASNALDDE-------------- 530
Query: 481 GAKADGLANASNALDDEVHFMEDHDELASEQLNTSKVDATKEYPFGIQSQLDQSTATSTD 540
MEDHDELASEQLNTSKVDATKEYPFGIQSQLDQSTATSTD
Sbjct: 531 --------------------MEDHDELASEQLNTSKVDATKEYPFGIQSQLDQSTATSTD 590
Query: 541 NNVDGVSRSSGTDHHDKVKPKSHANKQRKDKNISRRQSLAGAGTKWESGVRRSTRFKTRP 600
NNVDGVSRSSGTDHHDKVKPKSHANKQRKDKNISRRQSLAGAGTKWESGVRRSTRFKTRP
Sbjct: 591 NNVDGVSRSSGTDHHDKVKPKSHANKQRKDKNISRRQSLAGAGTKWESGVRRSTRFKTRP 633
Query: 601 LEYWKGERLLYGRVHQSLATVIGMKYVSPAKGNGQPTLKVKSLVSNKYKELVEFAALH 659
LEYWKGERLLYGRVHQSLATVIGMKYVSPAKGNGQPTLKVKSLVSNKYKELVEFAALH
Sbjct: 651 LEYWKGERLLYGRVHQSLATVIGMKYVSPAKGNGQPTLKVKSLVSNKYKELVEFAALH 633
BLAST of MS009459 vs. ExPASy TrEMBL
Match:
A0A6J1JYG6 (centromere protein C-like isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111488588 PE=3 SV=1)
HSP 1 Score: 859.0 bits (2218), Expect = 1.3e-245
Identity = 473/658 (71.88%), Postives = 534/658 (81.16%), Query Frame = 0
Query: 1 LNFQVSRSPSKLIEQARAILDGNSNVMHSEIATFLVHDDENKETTAKVEENPQERRPALN 60
L VSR+PSKLIEQAR+IL+GNSN+M S+ ATFLV +++ +E A VEENPQERRPALN
Sbjct: 51 LKSMVSRNPSKLIEQARSILNGNSNLMQSKAATFLVKNEKKEEAAANVEENPQERRPALN 110
Query: 61 RKRARFSLKPDTRQPAVNLEATFNIKQLKDPEEFFLAFERLENAKIEIQKQTGGVLKDLN 120
RKRARFSLKPD RQP VNLE TF+IKQLKDPEEFFLA+ERLENAK EIQKQTG +LKDLN
Sbjct: 111 RKRARFSLKPDARQPPVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGAILKDLN 170
Query: 121 QQNPSTNTRHRRPGILGRSVRYKHQYSSITSEDDQNVEPSQVTFESGNISPSIMGTEKCP 180
QQNPSTNTR RRPGILGRSVRYKHQYSSITSEDDQ VEPSQVTFESG+ISPS +GTEK
Sbjct: 171 QQNPSTNTRQRRPGILGRSVRYKHQYSSITSEDDQTVEPSQVTFESGSISPSTLGTEKDA 230
Query: 181 SPPIIGSEKRTGEHVPFEEEEEEEELVTSITKSENKVNRILDELLSANCEDLEGDRAINK 240
SPPII SE +T E VPF EEEEEE V SIT +ENKVN+ILDELLSANCEDLEGD+AINK
Sbjct: 231 SPPIICSEMKTNEEVPF-EEEEEEAFVASITNAENKVNKILDELLSANCEDLEGDQAINK 290
Query: 241 LQECLQIKPINLEKLCLPDLQAIQTVNLKSSRGNAPKRSLISVDNQLQRIETSKFKQDDE 300
LQECLQIKPINLEKLCLPDL+AIQT+NL+SSRGN P+RSLISVD+QLQRIE K KQDDE
Sbjct: 291 LQECLQIKPINLEKLCLPDLEAIQTMNLRSSRGNLPERSLISVDSQLQRIENLKSKQDDE 350
Query: 301 SSVHLLSTPSSMKSPLASVLALNRQILLSNSSSDPFSAHDIDKSPARNPSFSEHINHLSD 360
+SV+ +STP SM+SPLAS+ AL R+I LSNS DPFSAHD+D+S ARNPS E NHLSD
Sbjct: 351 NSVNPISTPFSMRSPLASLSALTRRISLSNSPGDPFSAHDLDQSSARNPSLFELSNHLSD 410
Query: 361 IVDIAKQSSVSKLKSPLTKDGEAVPNGIRSPKCPIGDVDSMSKISLANVLNVPEVGGNAA 420
V IA++ VS+L S LTKD V GI+SPK +GDV+S+SKIS +NVLNVP+ G +AA
Sbjct: 411 AVGIAEKLGVSRLMSLLTKDDGTVAKGIKSPKILLGDVNSISKISSSNVLNVPQAGADAA 470
Query: 421 LNGSHASMEAKEISGSDTEVEVNKKLSCLGAKADGLANASNALDDEVHFVVLVNKKLSCL 480
L+ +HA+MEAK+ISGS EVEVN+KLS L A+AD +A
Sbjct: 471 LSETHANMEAKDISGSSREVEVNEKLSFLEAQADAVA----------------------- 530
Query: 481 GAKADGLANASNALDDEVHFMEDHDELASEQLNTSKVDATKEYPFGIQSQLDQSTATSTD 540
A+N LDDE MEDH+ SEQ NTSKVDA KEYP GIQ+ LDQSTAT T+
Sbjct: 531 ---------ATNVLDDE---MEDHEGSTSEQPNTSKVDAIKEYPIGIQTLLDQSTATCTE 590
Query: 541 NNVDGVSRSSGTDHHDKVKPKSHANKQRKDKNISRRQSLAGAGTKWESGVRRSTRFKTRP 600
N VDG SRSSGTD+HDKVK KS A QR+ K +S R+SLAGAGT W+ GVRRSTRFKTRP
Sbjct: 591 NIVDGPSRSSGTDNHDKVKQKSRAGNQREGKRVSGRKSLAGAGTTWQGGVRRSTRFKTRP 650
Query: 601 LEYWKGERLLYGRVHQSLATVIGMKYVSPAKGNGQPTLKVKSLVSNKYKELVEFAALH 659
LEYWKGERLLYGRVH+SLATVIG+KYVSPAKGNGQPTLKVKSLVS++Y ELVE AALH
Sbjct: 651 LEYWKGERLLYGRVHESLATVIGLKYVSPAKGNGQPTLKVKSLVSSEYNELVELAALH 672
BLAST of MS009459 vs. ExPASy TrEMBL
Match:
A0A6J1GNL2 (centromere protein C isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111456073 PE=3 SV=1)
HSP 1 Score: 859.0 bits (2218), Expect = 1.3e-245
Identity = 474/658 (72.04%), Postives = 532/658 (80.85%), Query Frame = 0
Query: 1 LNFQVSRSPSKLIEQARAILDGNSNVMHSEIATFLVHDDENKETTAKVEENPQERRPALN 60
L VSR+PSKLIEQAR+IL+GNSN+M S+ ATFLV +++ +E A VEENPQERRPALN
Sbjct: 51 LKSMVSRNPSKLIEQARSILNGNSNLMQSKAATFLVKNEKKEEAAANVEENPQERRPALN 110
Query: 61 RKRARFSLKPDTRQPAVNLEATFNIKQLKDPEEFFLAFERLENAKIEIQKQTGGVLKDLN 120
RKRARFSLKPD RQP+VNLE TF+IKQLKDPEEFFLA+ERLENAK EIQKQTG +LKDLN
Sbjct: 111 RKRARFSLKPDARQPSVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGAILKDLN 170
Query: 121 QQNPSTNTRHRRPGILGRSVRYKHQYSSITSEDDQNVEPSQVTFESGNISPSIMGTEKCP 180
QQNPSTNTR RRPGILGRSVRYKHQYSSITSEDDQNVEPSQVTFESG+ISPSI+GTEK
Sbjct: 171 QQNPSTNTRQRRPGILGRSVRYKHQYSSITSEDDQNVEPSQVTFESGSISPSILGTEKDA 230
Query: 181 SPPIIGSEKRTGEHVPFEEEEEEEELVTSITKSENKVNRILDELLSANCEDLEGDRAINK 240
SPPII SE +T E VP EEEEE V SIT +ENKVN+ILDELLSANCEDLEGDRAINK
Sbjct: 231 SPPIICSEMKTNEEVPL--EEEEEAFVASITNAENKVNKILDELLSANCEDLEGDRAINK 290
Query: 241 LQECLQIKPINLEKLCLPDLQAIQTVNLKSSRGNAPKRSLISVDNQLQRIETSKFKQDDE 300
LQECLQIKPINLEKLCLPDL+AIQT NL+SSRGN P+RSLISVD+QLQRIE K KQDDE
Sbjct: 291 LQECLQIKPINLEKLCLPDLEAIQTTNLRSSRGNLPERSLISVDSQLQRIENLKSKQDDE 350
Query: 301 SSVHLLSTPSSMKSPLASVLALNRQILLSNSSSDPFSAHDIDKSPARNPSFSEHINHLSD 360
+SV+ +STP SM+SPLAS+ AL R+I LSNS DPFSAHD+D+S ARNPS E NHLSD
Sbjct: 351 NSVNPISTPFSMRSPLASLSALTRRISLSNSPGDPFSAHDLDQSSARNPSLFELSNHLSD 410
Query: 361 IVDIAKQSSVSKLKSPLTKDGEAVPNGIRSPKCPIGDVDSMSKISLANVLNVPEVGGNAA 420
V IA++ VS+L S LTKD V GI+SPK +GDVDS+SKIS +NVLNVP+ G AA
Sbjct: 411 AVGIAEKLGVSRLMSLLTKDDGTVAKGIKSPKILLGDVDSISKISSSNVLNVPQAGAEAA 470
Query: 421 LNGSHASMEAKEISGSDTEVEVNKKLSCLGAKADGLANASNALDDEVHFVVLVNKKLSCL 480
L+ + A+MEAK+ISGS TEVEVN+KLS L A+AD +A
Sbjct: 471 LSETRANMEAKDISGSSTEVEVNEKLSFLEAQADAVA----------------------- 530
Query: 481 GAKADGLANASNALDDEVHFMEDHDELASEQLNTSKVDATKEYPFGIQSQLDQSTATSTD 540
A+N LDDE MEDH+ SEQ NTSKVDA KEYP GIQ+QLDQS AT T+
Sbjct: 531 ---------ATNVLDDE---MEDHEGSTSEQPNTSKVDAIKEYPIGIQTQLDQSIATCTE 590
Query: 541 NNVDGVSRSSGTDHHDKVKPKSHANKQRKDKNISRRQSLAGAGTKWESGVRRSTRFKTRP 600
N VD SRSSGTD+HDKVK KS A QR+ K +S R+SLAGAGT W+ GVRRSTRFKTRP
Sbjct: 591 NIVDRPSRSSGTDNHDKVKQKSRAGNQREGKRVSGRKSLAGAGTTWQGGVRRSTRFKTRP 650
Query: 601 LEYWKGERLLYGRVHQSLATVIGMKYVSPAKGNGQPTLKVKSLVSNKYKELVEFAALH 659
LEYWKGERLLYGRVH+SLATVIG+KYVSPAKGNGQPTLKVKSLVS++Y ELVE AALH
Sbjct: 651 LEYWKGERLLYGRVHESLATVIGLKYVSPAKGNGQPTLKVKSLVSSEYNELVELAALH 671
BLAST of MS009459 vs. ExPASy TrEMBL
Match:
A0A1S3CDU5 (uncharacterized protein LOC103499749 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103499749 PE=3 SV=1)
HSP 1 Score: 820.1 bits (2117), Expect = 6.7e-234
Identity = 459/678 (67.70%), Postives = 531/678 (78.32%), Query Frame = 0
Query: 1 LNFQVSRSPSKLIEQARAILDGNSNVMHSEIATFLVHDDENKETTAKVEENPQERRPALN 60
L V RSPSKL+EQAR+ILDGNS M SE ATFLV +++N+ + K EENPQERRPALN
Sbjct: 53 LKSMVLRSPSKLLEQARSILDGNSKSMISEAATFLVKNEKNEAASVKAEENPQERRPALN 112
Query: 61 RKRARFSLKPDTRQPAVNLEATFNIKQLKDPEEFFLAFERLENAKIEIQKQTGGVLKDLN 120
RKRARFSLKPD QP VNLE TF+IKQLKDPEEFFLA+E+ ENAK EIQKQ G VLKDLN
Sbjct: 113 RKRARFSLKPDAGQPPVNLEPTFDIKQLKDPEEFFLAYEKHENAKKEIQKQMGAVLKDLN 172
Query: 121 QQNPSTNTRHRRPGILGRSVRYKHQYSSITSEDDQNVEPSQVTFESGNISPSIMGTEKCP 180
QQNPSTNTR RRPGILGRSVRYKHQYSSIT+EDDQNV+PSQVTF+SG SP +GTE P
Sbjct: 173 QQNPSTNTRQRRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTFDSGVFSPLKLGTETHP 232
Query: 181 SPPIIGSEKRTGEHVPFEEEEEEEELVTSITKSENKVNRILDELLSANCEDLEGDRAINK 240
SP II SEK+T E V FEEEEEEEELV S TK+EN+VN ILDE LS NCEDLEGDRAIN
Sbjct: 233 SPHIIDSEKKTDEDVAFEEEEEEEELVASATKAENRVNDILDEFLSGNCEDLEGDRAINI 292
Query: 241 LQECLQIKPINLEKLCLPDLQAIQTVNLKSSRGNAPKRSLISVDNQLQRIETSKFKQDDE 300
LQE LQIKP+ LEKLCLPDL+AI T+NLKS+RGN KRSLISVDNQLQ+ ET K K+D+E
Sbjct: 293 LQERLQIKPLTLEKLCLPDLEAIPTMNLKSTRGNLSKRSLISVDNQLQKTETLKSKEDNE 352
Query: 301 SSVHLLSTPSSMKSPLASVLALNRQILLSNSSSDPFSAHDIDKSPARNPSFSEHINHLSD 360
+ V+L+STPSSM+SPLAS+ ALNR+I LSNSS D FSAH ID+SPAR+P E NHLSD
Sbjct: 353 NLVNLVSTPSSMRSPLASLSALNRRISLSNSSGDSFSAHGIDRSPARDPYLFELGNHLSD 412
Query: 361 IVDIAKQSSVSKLKSPLTKDGEAVPNGIRSPKCPIGDVDSMSKISLANVLNVPEVGGNAA 420
V I + SSVSKLK LT+DG + NGI+ K GD DSMSKIS +N+LNV +VG N A
Sbjct: 413 AVGITEHSSVSKLKPLLTRDGGTIANGIQPSKILSGD-DSMSKISSSNILNVLQVGSNTA 472
Query: 421 LNGSHASMEAKEISGSDTEVEVNKKLSCLGAKADGLANAS-------------NALDDEV 480
L+G++AS +AK +SGS T+VE+N+KLSCL A+AD +AN + +D
Sbjct: 473 LSGTYASTDAKNVSGSSTDVEINEKLSCLEAQADVVANMQIDHQGSASEQPKLSEVDLIE 532
Query: 481 HFVVLVNKKL-----SCLGAKADGLANASNALDDEVHFMEDHDELASEQLNTSKVDATKE 540
+ V + +L +C DG + +S + MEDH+ ASEQ N+SKVD KE
Sbjct: 533 EYPVGIRSQLDQSAATCTENIVDGSSRSSGT--EHHDEMEDHEGSASEQPNSSKVDMIKE 592
Query: 541 YPFGIQSQLDQSTATST--DNNVDGVSRSSGTDHHDKVKPKSHANKQRKDKNISRRQSLA 600
YP GIQ QLDQST T+T + VDG SRSSGTDHHD+VKPKS ANKQRK K IS RQSLA
Sbjct: 593 YPVGIQIQLDQSTTTTTCAEKIVDGTSRSSGTDHHDEVKPKSRANKQRKGKKISGRQSLA 652
Query: 601 GAGTKWESGVRRSTRFKTRPLEYWKGERLLYGRVHQSLATVIGMKYVSPAKGNGQPTLKV 659
GAGT W+SGVRRSTRFK RPLEYWKGER+LYGRVH+SLATVIG+KYVSP KGNG+PT+KV
Sbjct: 653 GAGTTWKSGVRRSTRFKIRPLEYWKGERMLYGRVHESLATVIGLKYVSPEKGNGKPTMKV 712
BLAST of MS009459 vs. TAIR 10
Match:
AT1G15660.1 (centromere protein C )
HSP 1 Score: 187.2 bits (474), Expect = 4.3e-47
Identity = 206/684 (30.12%), Postives = 312/684 (45.61%), Query Frame = 0
Query: 28 HSEIATFLVHDDENKETTAKVEENPQERRPALNRKRARFSLKPDTRQPAVNLEATFNIKQ 87
H E A ++ +D + + N +ERRP L+RKR FSL T QP + +F+ +
Sbjct: 64 HQEQAKAIL-EDVDVDVQLNPIPNKRERRPGLDRKRKSFSLHLTTSQPP-PVAPSFDPSK 123
Query: 88 LKDPEEFFLAFERLENAKIEIQKQTGGVLKDLNQQNPSTNTRHRRPGILGRSVR-YKHQY 147
E+FF A+++ E A E QKQTG + D+ + PS R RRPGI GR R +K +
Sbjct: 124 YPRSEDFFAAYDKFELANREWQKQTGSSVIDIQENPPS--RRPRRPGIPGRKRRPFKESF 183
Query: 148 SSITSEDDQNVEPSQVTFESGNISPSIMGTEKCPSPPIIGSEKRTGEHVPFEEEEEEEEL 207
+ D N+E S+ ++ P E T HV + E ++
Sbjct: 184 TDSYFTDVINLEASE---------------KEIPIASEQSLESATAAHVTTVDREVDD-- 243
Query: 208 VTSITKSENKVNRILDELLSANCEDLEGDRAINKLQECLQIKPINLEKLCLPDLQAIQTV 267
S ++ +N +L +LL+ + E+LEGD AI L+E LQIK N+EK +P+ Q ++ +
Sbjct: 244 --STVDTDKDLNNVLKDLLACSREELEGDGAIKLLEERLQIKSFNIEKFSIPEFQDVRKM 303
Query: 268 NLKSSRGNAPKR-SLISVDNQLQRIETSKFKQDDES-SVHLLSTPSSMKSPLASVLALNR 327
NLK+S N P R SL + N L+ +++ S S + SS P+ +
Sbjct: 304 NLKASGSNPPNRKSLSDIQNILKGTNRVAVRKNSHSPSPQTIKHFSSPNPPVDQFSFPDI 363
Query: 328 QILL------SNSSSDPFSAHDIDKSPARNPSFSEHINHLSDIVDIAKQSSVSKL----- 387
LL S + P A DI + N + + +D V S +
Sbjct: 364 HNLLPGDQQPSEVNVQPI-AKDIPNTSPTNVGTVDVASPFNDSVVKRSGEDDSHIHSGIH 423
Query: 388 KSPLTKDGEAVPNGIRSPKCPIGDVDSMSKISLANVLNVPEVGGNAALNGSHASMEAKEI 447
+S L++DG C + + + S L +++ G + S + +
Sbjct: 424 RSHLSRDG-------NPDICVMDSISNRSSAMLQKNVDMRTKGKEVDVPMSESGAN-RNT 483
Query: 448 SGSDTEVEVNKKLSCLGAKADGLANASNALDDEV--HFVVLVNKKLSCLGAKADGLANA- 507
+ + E+N+ + D L + EV F V + GA + A
Sbjct: 484 GDRENDAEINE-------ETDNLERLAECASKEVTRPFTVEEDSIPYQQGASSKSPNRAP 543
Query: 508 ------SNALDDEVHFMEDHDELASEQLNTSKV------DATKEYPFGIQSQLDQSTATS 567
+L+ H H+E E +NT +A + + + + + S
Sbjct: 544 EQYNTMGGSLEHAEHNQGLHEE---ENVNTGSASGLQVENAPEVHKYSHKQTNKRRKRGS 603
Query: 568 TDNNVDGVSRS----SGTDHHDKVKP-KSHANKQRKDKNISR------------------ 627
+D+NV S++ +G D K P +S A KQ K K+ R
Sbjct: 604 SDSNVKKRSKTVHGETGGDKQMKTLPHESRAKKQTKGKSNEREEKKPKKTLTHEGKLFSC 663
Query: 628 RQSLAGAGTKWESGVRRSTRFKTRPLEYWKGERLLYGRVHQSLATVIGMKYVSPAKG-NG 659
R+SLA AGTK E GVRRSTR K+RPLEYW+GER LYGR+H+SL TVIG+KY SP +G
Sbjct: 664 RKSLAAAGTKIEGGVRRSTRIKSRPLEYWRGERFLYGRIHESLTTVIGIKYASPGEGKRD 705
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_022154052.1 | 0.0e+00 | 94.22 | centromere protein C isoform X1 [Momordica charantia] >XP_022154062.1 centromere... | [more] |
XP_022154071.1 | 1.2e-306 | 87.84 | centromere protein C isoform X3 [Momordica charantia] | [more] |
XP_023548004.1 | 8.4e-247 | 72.19 | centromere protein C-like isoform X1 [Cucurbita pepo subsp. pepo] | [more] |
XP_022953572.1 | 2.7e-245 | 72.04 | centromere protein C isoform X1 [Cucurbita moschata] | [more] |
XP_022992183.1 | 2.7e-245 | 71.88 | centromere protein C-like isoform X1 [Cucurbita maxima] | [more] |
Match Name | E-value | Identity | Description | |
Q66LG9 | 6.0e-46 | 30.12 | Centromere protein C OS=Arabidopsis thaliana OX=3702 GN=CENPC PE=2 SV=1 | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1DKM1 | 0.0e+00 | 94.22 | centromere protein C isoform X1 OS=Momordica charantia OX=3673 GN=LOC111021402 P... | [more] |
A0A6J1DML8 | 6.0e-307 | 87.84 | centromere protein C isoform X3 OS=Momordica charantia OX=3673 GN=LOC111021402 P... | [more] |
A0A6J1JYG6 | 1.3e-245 | 71.88 | centromere protein C-like isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111488588... | [more] |
A0A6J1GNL2 | 1.3e-245 | 72.04 | centromere protein C isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111456073 PE... | [more] |
A0A1S3CDU5 | 6.7e-234 | 67.70 | uncharacterized protein LOC103499749 isoform X2 OS=Cucumis melo OX=3656 GN=LOC10... | [more] |
Match Name | E-value | Identity | Description | |
AT1G15660.1 | 4.3e-47 | 30.12 | centromere protein C | [more] |