CmoCh19G011270 (gene) Cucurbita moschata (Rifu)

NameCmoCh19G011270
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionNucleolar complex-associated protein, putative
LocationCmo_Chr19 : 9556953 .. 9558630 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAATGTTGATCCTCAGGACTTTTCTGTCCAGCTGTACAATATTGTACTCGAGTACAGGCCTGTAAGGTTAGTGCTGAACTCTAAACTAGAATGTTAAAATTGTTGCTGGTTCTTTTAAAAGCAAGCCTGGTTAATGTTTGTTCCAAGGATTGTTTTAATATTTATGCTCTTCATTACAACAAATTCTCTGATAACCTAATCTTGGGAAGAATTTCTGATGTTACACTGGGATTCATCACCAAGCATCCAGTCGTTATGTGAACCTCAGATTAAAGCTTTCAAGTTTATAAGAATTCTTGGACATTACTTGCTTATCCCCATGTAAACCATAACAATTTTGATTTTAATTTTTAGGGTAAGTTCATTTGTGGGCAGTCTGGTGCACATGCCTTGGGGTCCTTGTTATGATGTAATTTTAATACCTTGTTCCCTGGATATTATGATTGTTTATCACTAAGGGTGAGACTACATTAATATTTATAGTTTCTACTACTCATCTGCCATCTGCTATCATTTATGTTTTATTATTATTATTTTTGAATGGGTAACATTCAAGTTTCACTCACAGAGACCATGGTGGATTGTTAGCTGAAGCTTTGAAGATAATGTTGTGCGATAATAGACAGCATGCCATGCAGAAGGCAGCTGCATTTATTAAGCGTTTGGCTACTTTCTCATTATGCTTTGAATCTGCAGAGTCGTTGGCAGGTATATTAAAATTCCTTTTAGTTTTGATGACTTTTATTTAACCGTGAATTTGTGTGGCTATTTACTGCCTTGGTCACCGTAAAGCATCTTCTTCAGAAAAATGTCAAGTGCCGCAACCTTTTGGAAAACGATTCTTGGGGAGGTTCAGTGTCTGGCTCAATTGCGGTAAGGTTTCGTTGTAACTAGTTGAGATATATGACAGCTTTGCATGATGTTTATCTCTCCCGTATTTCACGAACACGAGAAAACAAAAAATCACCTAGCTTTTCTGCTATGACATGTTTAGAACGACTTTTTCAAGTGAAAAAAAGTATTATAAGCATTTAGAAAGTCGGTCAAATTAGGCGATTTCGTAACGGCCTGTGAAATTGAGCTTATTGATTCAATATCAGACATCCTTCTTCCCACCTCAACCTACTCACAGGAGGAAAGAGATAAACTAATTTTTGCTGATTTGATATGCAGAAATACTAGCCATATGCTTCTGATCCAACTTTGAGCGGTGCTCTTGCTTCTGTCCTTTGGGAACTTAATCTTCTGTGGAAGCATTATCATCCAGCTGTCTCAAAGATGTCAACGAGCATATCAAGCATGAATAGTGCTCAAAACCAAGTGTACATCTCCACGGTTTCTCCCCAACAGGCATTCAAAGACTTGTCGCTGGAATAGGAGTCTTTCAACCCAAAATTTAATGTCCGAAAAGTTGAAAAGAAAAAAAGAGCTAGCGAAGTGAAGGAGAAACTTTCAACAAGATTCTTTCTTCTCCGGGACATCAAGGACAATGAAAGGCTGAGAGGTGAATTGGACCGTACCACTTTGTCTTTGCAGCGATATGAAGAACACAAAAGGCAAAAGAGAAAAACTAAAAGATCAAGGTATGTTTAACTTTGTTATTTGTGATTTCTTGGTGTTAAAATCTTCATGGTTGCAAGAAATGTAGATTAATTAAGCTGAGAGGAAATGTGTAG

mRNA sequence

ATGAATGTTGATCCTCAGGACTTTTCTGTCCAGCTGTACAATATTGTACTCGAGTACAGGCCTGTAAGTTTCACTCACAGAGACCATGGTGGATTGTTAGCTGAAGCTTTGAAGATAATGTTGTGCGATAATAGACAGCATGCCATGCAGAAGGCAGCTGCATTTATTAAGCGTTTGGCTACTTTCTCATTATGCTTTGAATCTGCAGAGTCGTTGGCAGCATCTTCTTCAGAAAAATGTCAAGTGCCGCAACCTTTTGGAAAACGATTCTTGGGGAGGTTCAGTGTCTGGCTCAATTGCGCTTTGCATGATGTTTATCTCTCCCGTATTTCACGAACACGAGAAAACAAAAAATCACCTAGCTTTTCTGCTATGACATGTTTAGAACGACTTTTTCAACCATATGCTTCTGATCCAACTTTGAGCGGTGCTCTTGCTTCTGTCCTTTGGGAACTTAATCTTCTGTGGAAGCATTATCATCCAGCTGTCTCAAAGATGTCAACGAGCATATCAAGCATGAATAGTGCTCAAAACCAAGTGTACATCTCCACGGAGTCTTTCAACCCAAAATTTAATGTCCGAAAAGTTGAAAAGAAAAAAAGAGCTAGCGAAGTGAAGGAGAAACTTTCAACAAGATTCTTTCTTCTCCGGGACATCAAGGACAATGAAAGGCTGAGAGGTGAATTGGACCGTACCACTTTGTCTTTGCAGCGATATGAAGAACACAAAAGGCAAAAGAGAAAAACTAAAAGATCAAGATTAATTAAGCTGAGAGGAAATGTGTAG

Coding sequence (CDS)

ATGAATGTTGATCCTCAGGACTTTTCTGTCCAGCTGTACAATATTGTACTCGAGTACAGGCCTGTAAGTTTCACTCACAGAGACCATGGTGGATTGTTAGCTGAAGCTTTGAAGATAATGTTGTGCGATAATAGACAGCATGCCATGCAGAAGGCAGCTGCATTTATTAAGCGTTTGGCTACTTTCTCATTATGCTTTGAATCTGCAGAGTCGTTGGCAGCATCTTCTTCAGAAAAATGTCAAGTGCCGCAACCTTTTGGAAAACGATTCTTGGGGAGGTTCAGTGTCTGGCTCAATTGCGCTTTGCATGATGTTTATCTCTCCCGTATTTCACGAACACGAGAAAACAAAAAATCACCTAGCTTTTCTGCTATGACATGTTTAGAACGACTTTTTCAACCATATGCTTCTGATCCAACTTTGAGCGGTGCTCTTGCTTCTGTCCTTTGGGAACTTAATCTTCTGTGGAAGCATTATCATCCAGCTGTCTCAAAGATGTCAACGAGCATATCAAGCATGAATAGTGCTCAAAACCAAGTGTACATCTCCACGGAGTCTTTCAACCCAAAATTTAATGTCCGAAAAGTTGAAAAGAAAAAAAGAGCTAGCGAAGTGAAGGAGAAACTTTCAACAAGATTCTTTCTTCTCCGGGACATCAAGGACAATGAAAGGCTGAGAGGTGAATTGGACCGTACCACTTTGTCTTTGCAGCGATATGAAGAACACAAAAGGCAAAAGAGAAAAACTAAAAGATCAAGATTAATTAAGCTGAGAGGAAATGTGTAG
BLAST of CmoCh19G011270 vs. TrEMBL
Match: A0A0A0K7H5_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_7G238410 PE=4 SV=1)

HSP 1 Score: 271.6 bits (693), Expect = 1.0e-69
Identity = 168/283 (59.36%), Postives = 186/283 (65.72%), Query Frame = 1

Query: 1   MNVDPQDFSVQLYNIVLEYRPVSFTHRDHGGLLAEALKIMLCDNRQHAMQKAAAFIKRLA 60
           +NVD QDF VQLYNIVL+YRP     RD GGLLAEALKIMLCD+RQH MQKAAAFIKRLA
Sbjct: 569 LNVDLQDFFVQLYNIVLDYRP----GRDQGGLLAEALKIMLCDDRQHDMQKAAAFIKRLA 628

Query: 61  TFSLCFESAESLAASSSEKCQVPQPFGKRFLGRFSVWLNCALHDVYLSRISRTRENKKSP 120
           TFSLCF SAESLAA  + +                   +  L +V    +        S 
Sbjct: 629 TFSLCFGSAESLAALVTVR-------------------HLLLKNVKCRNLLENDAGGGSV 688

Query: 121 SFSAMTCLERLFQPYASDPTLSGALASVLWELNLLWKHYHPAVSKMSTSISSMNSAQNQV 180
           S S        +QPYA+DP LSGALASVLWEL+LLWKHYHPAVS M+  IS+MNSAQNQV
Sbjct: 689 SGSIAK-----YQPYATDPNLSGALASVLWELDLLWKHYHPAVSTMAAGISNMNSAQNQV 748

Query: 181 YIS--------------TESFNPKFNVRKVEKKKRAS----------------EVKEKLS 240
           YIS               ESFNP+FN RK+ K+KR S                EVKEKLS
Sbjct: 749 YISIVSPQQAFKDLSLEQESFNPQFNARKINKRKRGSESSQSTLDTCGTIDENEVKEKLS 808

Query: 241 TRFFLLRDIKDNERLRGELDRTTLSLQRYEEHKRQKRKTKRSR 254
           TRFFLLRDIKDNERLR ELDRTTLSLQ YEE+KRQKRKTK+SR
Sbjct: 809 TRFFLLRDIKDNERLRSELDRTTLSLQLYEEYKRQKRKTKKSR 823

BLAST of CmoCh19G011270 vs. TrEMBL
Match: A0A061FAC6_THECC (Nucleolar complex protein 3 isoform 1 OS=Theobroma cacao GN=TCM_032975 PE=4 SV=1)

HSP 1 Score: 226.5 bits (576), Expect = 3.8e-56
Identity = 148/287 (51.57%), Postives = 173/287 (60.28%), Query Frame = 1

Query: 1   MNVDPQDFSVQLYNIVLEYRPVSFTHRDHGGLLAEALKIMLCDNRQHAMQKAAAFIKRLA 60
           +NVD QDF VQLYN+VLEYRP     RD GG+LAEALKIMLCD+RQH MQKAAAF KRLA
Sbjct: 570 LNVDLQDFFVQLYNLVLEYRP----GRDQGGVLAEALKIMLCDDRQHDMQKAAAFAKRLA 629

Query: 61  TFSLCFESAESLAASSSEKCQVPQPFGKRFLGRFSVWLNCALHDVYLSRISRTRENKKSP 120
           TFSLCF SAES+AA  + K                   N    +V    +        S 
Sbjct: 630 TFSLCFGSAESMAALVTLK-------------------NLLQKNVKCRNLLENDAGGGSV 689

Query: 121 SFSAMTCLERLFQPYASDPTLSGALASVLWELNLLWKHYHPAVSKMSTSISSMNSAQNQV 180
           S S        +QPYASDP LSGALASVLWELNLL KHYHP VS ++ SIS MN+AQNQV
Sbjct: 690 SGSIAK-----YQPYASDPNLSGALASVLWELNLLSKHYHPTVSTLAASISCMNTAQNQV 749

Query: 181 YIS-------------TESFNPKFNVRKVEKKKR-----------------ASEVKEKLS 240
           Y+S              ESF+PKF+ +K   K++                  +EV +KL 
Sbjct: 750 YLSITPQQAFINLSLEQESFDPKFSTQKSNNKRKRGTGPSTLASINPTSIDENEVSKKLG 809

Query: 241 TRFFLLRDIKDNERLRGELDRTTLSLQRYEEHKRQ----KRKTKRSR 254
             F LLRDIK+NERLRGELDRT  SLQ YEE+K+Q    K KTK+S+
Sbjct: 810 RHFMLLRDIKENERLRGELDRTRSSLQLYEEYKKQRKSLKHKTKKSK 828

BLAST of CmoCh19G011270 vs. TrEMBL
Match: A0A061FB71_THECC (Binding isoform 2 OS=Theobroma cacao GN=TCM_032975 PE=4 SV=1)

HSP 1 Score: 226.5 bits (576), Expect = 3.8e-56
Identity = 148/287 (51.57%), Postives = 173/287 (60.28%), Query Frame = 1

Query: 1   MNVDPQDFSVQLYNIVLEYRPVSFTHRDHGGLLAEALKIMLCDNRQHAMQKAAAFIKRLA 60
           +NVD QDF VQLYN+VLEYRP     RD GG+LAEALKIMLCD+RQH MQKAAAF KRLA
Sbjct: 390 LNVDLQDFFVQLYNLVLEYRP----GRDQGGVLAEALKIMLCDDRQHDMQKAAAFAKRLA 449

Query: 61  TFSLCFESAESLAASSSEKCQVPQPFGKRFLGRFSVWLNCALHDVYLSRISRTRENKKSP 120
           TFSLCF SAES+AA  + K                   N    +V    +        S 
Sbjct: 450 TFSLCFGSAESMAALVTLK-------------------NLLQKNVKCRNLLENDAGGGSV 509

Query: 121 SFSAMTCLERLFQPYASDPTLSGALASVLWELNLLWKHYHPAVSKMSTSISSMNSAQNQV 180
           S S        +QPYASDP LSGALASVLWELNLL KHYHP VS ++ SIS MN+AQNQV
Sbjct: 510 SGSIAK-----YQPYASDPNLSGALASVLWELNLLSKHYHPTVSTLAASISCMNTAQNQV 569

Query: 181 YIS-------------TESFNPKFNVRKVEKKKR-----------------ASEVKEKLS 240
           Y+S              ESF+PKF+ +K   K++                  +EV +KL 
Sbjct: 570 YLSITPQQAFINLSLEQESFDPKFSTQKSNNKRKRGTGPSTLASINPTSIDENEVSKKLG 629

Query: 241 TRFFLLRDIKDNERLRGELDRTTLSLQRYEEHKRQ----KRKTKRSR 254
             F LLRDIK+NERLRGELDRT  SLQ YEE+K+Q    K KTK+S+
Sbjct: 630 RHFMLLRDIKENERLRGELDRTRSSLQLYEEYKKQRKSLKHKTKKSK 648

BLAST of CmoCh19G011270 vs. TrEMBL
Match: A0A061FAQ4_THECC (Nucleolar complex protein 3 isoform 3 OS=Theobroma cacao GN=TCM_032975 PE=4 SV=1)

HSP 1 Score: 226.5 bits (576), Expect = 3.8e-56
Identity = 148/287 (51.57%), Postives = 173/287 (60.28%), Query Frame = 1

Query: 1   MNVDPQDFSVQLYNIVLEYRPVSFTHRDHGGLLAEALKIMLCDNRQHAMQKAAAFIKRLA 60
           +NVD QDF VQLYN+VLEYRP     RD GG+LAEALKIMLCD+RQH MQKAAAF KRLA
Sbjct: 421 LNVDLQDFFVQLYNLVLEYRP----GRDQGGVLAEALKIMLCDDRQHDMQKAAAFAKRLA 480

Query: 61  TFSLCFESAESLAASSSEKCQVPQPFGKRFLGRFSVWLNCALHDVYLSRISRTRENKKSP 120
           TFSLCF SAES+AA  + K                   N    +V    +        S 
Sbjct: 481 TFSLCFGSAESMAALVTLK-------------------NLLQKNVKCRNLLENDAGGGSV 540

Query: 121 SFSAMTCLERLFQPYASDPTLSGALASVLWELNLLWKHYHPAVSKMSTSISSMNSAQNQV 180
           S S        +QPYASDP LSGALASVLWELNLL KHYHP VS ++ SIS MN+AQNQV
Sbjct: 541 SGSIAK-----YQPYASDPNLSGALASVLWELNLLSKHYHPTVSTLAASISCMNTAQNQV 600

Query: 181 YIS-------------TESFNPKFNVRKVEKKKR-----------------ASEVKEKLS 240
           Y+S              ESF+PKF+ +K   K++                  +EV +KL 
Sbjct: 601 YLSITPQQAFINLSLEQESFDPKFSTQKSNNKRKRGTGPSTLASINPTSIDENEVSKKLG 660

Query: 241 TRFFLLRDIKDNERLRGELDRTTLSLQRYEEHKRQ----KRKTKRSR 254
             F LLRDIK+NERLRGELDRT  SLQ YEE+K+Q    K KTK+S+
Sbjct: 661 RHFMLLRDIKENERLRGELDRTRSSLQLYEEYKKQRKSLKHKTKKSK 679

BLAST of CmoCh19G011270 vs. TrEMBL
Match: M5WM32_PRUPE (Uncharacterized protein (Fragment) OS=Prunus persica GN=PRUPE_ppa016725mg PE=4 SV=1)

HSP 1 Score: 218.0 bits (554), Expect = 1.4e-53
Identity = 143/292 (48.97%), Postives = 176/292 (60.27%), Query Frame = 1

Query: 1   MNVDPQDFSVQLYNIVLEYRPVSFTHRDHGGLLAEALKIMLCDNRQHAMQKAAAFIKRLA 60
           +NVD QDF VQLYNI+LEYRP     RD G +LAEALKIMLC++RQH MQKAAAF+KRLA
Sbjct: 570 LNVDLQDFFVQLYNIILEYRP----GRDQGEVLAEALKIMLCEDRQHDMQKAAAFVKRLA 629

Query: 61  TFSLCFESAESLAASSSEKCQVPQPFGKRFLGRFSVWLNCALHDVYLSRISRTRENKKSP 120
           TFSLC  SAES+AA  + K                   +  L +V    +        S 
Sbjct: 630 TFSLCSGSAESMAALVTLK-------------------HLLLKNVKCRNLLENDAGGGSV 689

Query: 121 SFSAMTCLERLFQPYASDPTLSGALASVLWELNLLWKHYHPAVSKMSTSISSMNSAQNQV 180
           S S        + PYASDP LSGALASVLWELNLL +HYHPAVS M++SISSMN+A NQV
Sbjct: 690 SGSVAK-----YHPYASDPNLSGALASVLWELNLLTQHYHPAVSSMASSISSMNTAHNQV 749

Query: 181 YIST---------------ESFNPKFNVRKVE-KKKRASE-------------------- 240
           Y+ST               ESF P  +++K   K+KR S+                    
Sbjct: 750 YLSTISPQQAFTDFSLERPESFKPPSDIKKSNNKRKRGSDPSVSAVIETSADTTSIDEDD 809

Query: 241 VKEKLSTRFFLLRDIKDNERLRGELDRTTLSLQRYEEHKRQKRKTKRSRLIK 257
           V++KLS  F LLRDIK+N+RLR ELD TT S+Q YEE+K+QK+K K+ ++ K
Sbjct: 810 VRKKLSAHFMLLRDIKENQRLRAELDGTTSSIQLYEEYKQQKKKAKKPKVKK 833

BLAST of CmoCh19G011270 vs. TAIR10
Match: AT1G79150.1 (AT1G79150.1 binding)

HSP 1 Score: 169.9 bits (429), Expect = 2.1e-42
Identity = 114/282 (40.43%), Postives = 159/282 (56.38%), Query Frame = 1

Query: 1   MNVDPQDFSVQLYNIVLEYRPVSFTHRDHGGLLAEALKIMLCDNRQHAMQKAAAFIKRLA 60
           +NVD QDF VQLYN++LEYRP     RD G +LAE+LKIMLCD+R   MQKAAAF+KRLA
Sbjct: 572 LNVDLQDFFVQLYNLILEYRP----GRDSGVILAESLKIMLCDDRHQDMQKAAAFVKRLA 631

Query: 61  TFSLCFESAESLAASSSEKCQVPQPFGKRFLGRFSVWLNCALHDVYLSRISRTRENKKSP 120
           TF+LCF  AES++A  + K  + +    R         N   +D     +S +       
Sbjct: 632 TFALCFGCAESMSALVTLKTLLQKNVKCR---------NLLENDAGGGSVSGSIAK---- 691

Query: 121 SFSAMTCLERLFQPYASDPTLSGALASVLWELNLLWKHYHPAVSKMSTSISSMNSAQNQV 180
                      +QPYA+DP LSGALA+VLWEL+LL KHYHPA+S M+T++S+MN++Q+Q 
Sbjct: 692 -----------YQPYATDPNLSGALATVLWELSLLSKHYHPAISTMATTVSNMNTSQSQT 751

Query: 181 YIST--------------ESFNPKFNVRKVEKKKRASEVKE---------------KLST 240
           ++S               ESF PK   RK+  K++   + E               KL  
Sbjct: 752 FLSAVTPQQAFADFSLVKESFEPKNESRKLNNKRKRESLPEEAKNVPEIDMVKLSKKLKE 811

Query: 241 RFFLLRDIKDNERLRGELDRTTLSLQRYEEHKRQKRKTKRSR 254
            F +LRDIK++ER+R EL ++       +++   K+K K  +
Sbjct: 812 NFTILRDIKEDERVRMELLQSEEKKPLKKQNNVVKKKLKNPK 825

BLAST of CmoCh19G011270 vs. NCBI nr
Match: gi|659092555|ref|XP_008447119.1| (PREDICTED: nucleolar complex protein 3 homolog [Cucumis melo])

HSP 1 Score: 277.7 bits (709), Expect = 2.1e-71
Identity = 171/283 (60.42%), Postives = 190/283 (67.14%), Query Frame = 1

Query: 1   MNVDPQDFSVQLYNIVLEYRPVSFTHRDHGGLLAEALKIMLCDNRQHAMQKAAAFIKRLA 60
           +NVD QDF VQLYNIVL+YRP     RD GGLLAEALKIMLCD+RQH MQKAAAFIKRLA
Sbjct: 570 LNVDLQDFFVQLYNIVLDYRP----GRDQGGLLAEALKIMLCDDRQHDMQKAAAFIKRLA 629

Query: 61  TFSLCFESAESLAASSSEKCQVPQPFGKRFLGRFSVWLNCALHDVYLSRISRTRENKKSP 120
           TFSLCF SAESLAA  + +                   +  L +V    +        S 
Sbjct: 630 TFSLCFGSAESLAALVTVR-------------------HLLLKNVKCRNLLENDAGGGSV 689

Query: 121 SFSAMTCLERLFQPYASDPTLSGALASVLWELNLLWKHYHPAVSKMSTSISSMNSAQNQV 180
           S S        +QPYA+DP LSGALASVLWEL+LLWKHYHPAVSKM+ SIS+MNSAQNQV
Sbjct: 690 SGSIAK-----YQPYATDPNLSGALASVLWELDLLWKHYHPAVSKMAASISNMNSAQNQV 749

Query: 181 YIST--------------ESFNPKFNVRKVEKKKRAS----------------EVKEKLS 240
           YIST              ESFNP+FN RK+ K+KRAS                EVKEKLS
Sbjct: 750 YISTVSPQQAFKDLSLEQESFNPQFNTRKISKRKRASESSQSTPNTCGTIDENEVKEKLS 809

Query: 241 TRFFLLRDIKDNERLRGELDRTTLSLQRYEEHKRQKRKTKRSR 254
           TRFFLLRDIKDNERLR EL+RTTLSLQ YEE+KRQKRKTK+SR
Sbjct: 810 TRFFLLRDIKDNERLRSELERTTLSLQLYEEYKRQKRKTKKSR 824

BLAST of CmoCh19G011270 vs. NCBI nr
Match: gi|449444134|ref|XP_004139830.1| (PREDICTED: nucleolar complex protein 3 homolog [Cucumis sativus])

HSP 1 Score: 271.6 bits (693), Expect = 1.5e-69
Identity = 168/283 (59.36%), Postives = 186/283 (65.72%), Query Frame = 1

Query: 1   MNVDPQDFSVQLYNIVLEYRPVSFTHRDHGGLLAEALKIMLCDNRQHAMQKAAAFIKRLA 60
           +NVD QDF VQLYNIVL+YRP     RD GGLLAEALKIMLCD+RQH MQKAAAFIKRLA
Sbjct: 569 LNVDLQDFFVQLYNIVLDYRP----GRDQGGLLAEALKIMLCDDRQHDMQKAAAFIKRLA 628

Query: 61  TFSLCFESAESLAASSSEKCQVPQPFGKRFLGRFSVWLNCALHDVYLSRISRTRENKKSP 120
           TFSLCF SAESLAA  + +                   +  L +V    +        S 
Sbjct: 629 TFSLCFGSAESLAALVTVR-------------------HLLLKNVKCRNLLENDAGGGSV 688

Query: 121 SFSAMTCLERLFQPYASDPTLSGALASVLWELNLLWKHYHPAVSKMSTSISSMNSAQNQV 180
           S S        +QPYA+DP LSGALASVLWEL+LLWKHYHPAVS M+  IS+MNSAQNQV
Sbjct: 689 SGSIAK-----YQPYATDPNLSGALASVLWELDLLWKHYHPAVSTMAAGISNMNSAQNQV 748

Query: 181 YIS--------------TESFNPKFNVRKVEKKKRAS----------------EVKEKLS 240
           YIS               ESFNP+FN RK+ K+KR S                EVKEKLS
Sbjct: 749 YISIVSPQQAFKDLSLEQESFNPQFNARKINKRKRGSESSQSTLDTCGTIDENEVKEKLS 808

Query: 241 TRFFLLRDIKDNERLRGELDRTTLSLQRYEEHKRQKRKTKRSR 254
           TRFFLLRDIKDNERLR ELDRTTLSLQ YEE+KRQKRKTK+SR
Sbjct: 809 TRFFLLRDIKDNERLRSELDRTTLSLQLYEEYKRQKRKTKKSR 823

BLAST of CmoCh19G011270 vs. NCBI nr
Match: gi|590612725|ref|XP_007022463.1| (Nucleolar complex protein 3 isoform 3 [Theobroma cacao])

HSP 1 Score: 226.5 bits (576), Expect = 5.5e-56
Identity = 148/287 (51.57%), Postives = 173/287 (60.28%), Query Frame = 1

Query: 1   MNVDPQDFSVQLYNIVLEYRPVSFTHRDHGGLLAEALKIMLCDNRQHAMQKAAAFIKRLA 60
           +NVD QDF VQLYN+VLEYRP     RD GG+LAEALKIMLCD+RQH MQKAAAF KRLA
Sbjct: 421 LNVDLQDFFVQLYNLVLEYRP----GRDQGGVLAEALKIMLCDDRQHDMQKAAAFAKRLA 480

Query: 61  TFSLCFESAESLAASSSEKCQVPQPFGKRFLGRFSVWLNCALHDVYLSRISRTRENKKSP 120
           TFSLCF SAES+AA  + K                   N    +V    +        S 
Sbjct: 481 TFSLCFGSAESMAALVTLK-------------------NLLQKNVKCRNLLENDAGGGSV 540

Query: 121 SFSAMTCLERLFQPYASDPTLSGALASVLWELNLLWKHYHPAVSKMSTSISSMNSAQNQV 180
           S S        +QPYASDP LSGALASVLWELNLL KHYHP VS ++ SIS MN+AQNQV
Sbjct: 541 SGSIAK-----YQPYASDPNLSGALASVLWELNLLSKHYHPTVSTLAASISCMNTAQNQV 600

Query: 181 YIS-------------TESFNPKFNVRKVEKKKR-----------------ASEVKEKLS 240
           Y+S              ESF+PKF+ +K   K++                  +EV +KL 
Sbjct: 601 YLSITPQQAFINLSLEQESFDPKFSTQKSNNKRKRGTGPSTLASINPTSIDENEVSKKLG 660

Query: 241 TRFFLLRDIKDNERLRGELDRTTLSLQRYEEHKRQ----KRKTKRSR 254
             F LLRDIK+NERLRGELDRT  SLQ YEE+K+Q    K KTK+S+
Sbjct: 661 RHFMLLRDIKENERLRGELDRTRSSLQLYEEYKKQRKSLKHKTKKSK 679

BLAST of CmoCh19G011270 vs. NCBI nr
Match: gi|590612722|ref|XP_007022462.1| (Binding isoform 2 [Theobroma cacao])

HSP 1 Score: 226.5 bits (576), Expect = 5.5e-56
Identity = 148/287 (51.57%), Postives = 173/287 (60.28%), Query Frame = 1

Query: 1   MNVDPQDFSVQLYNIVLEYRPVSFTHRDHGGLLAEALKIMLCDNRQHAMQKAAAFIKRLA 60
           +NVD QDF VQLYN+VLEYRP     RD GG+LAEALKIMLCD+RQH MQKAAAF KRLA
Sbjct: 390 LNVDLQDFFVQLYNLVLEYRP----GRDQGGVLAEALKIMLCDDRQHDMQKAAAFAKRLA 449

Query: 61  TFSLCFESAESLAASSSEKCQVPQPFGKRFLGRFSVWLNCALHDVYLSRISRTRENKKSP 120
           TFSLCF SAES+AA  + K                   N    +V    +        S 
Sbjct: 450 TFSLCFGSAESMAALVTLK-------------------NLLQKNVKCRNLLENDAGGGSV 509

Query: 121 SFSAMTCLERLFQPYASDPTLSGALASVLWELNLLWKHYHPAVSKMSTSISSMNSAQNQV 180
           S S        +QPYASDP LSGALASVLWELNLL KHYHP VS ++ SIS MN+AQNQV
Sbjct: 510 SGSIAK-----YQPYASDPNLSGALASVLWELNLLSKHYHPTVSTLAASISCMNTAQNQV 569

Query: 181 YIS-------------TESFNPKFNVRKVEKKKR-----------------ASEVKEKLS 240
           Y+S              ESF+PKF+ +K   K++                  +EV +KL 
Sbjct: 570 YLSITPQQAFINLSLEQESFDPKFSTQKSNNKRKRGTGPSTLASINPTSIDENEVSKKLG 629

Query: 241 TRFFLLRDIKDNERLRGELDRTTLSLQRYEEHKRQ----KRKTKRSR 254
             F LLRDIK+NERLRGELDRT  SLQ YEE+K+Q    K KTK+S+
Sbjct: 630 RHFMLLRDIKENERLRGELDRTRSSLQLYEEYKKQRKSLKHKTKKSK 648

BLAST of CmoCh19G011270 vs. NCBI nr
Match: gi|590612718|ref|XP_007022461.1| (Nucleolar complex protein 3 isoform 1 [Theobroma cacao])

HSP 1 Score: 226.5 bits (576), Expect = 5.5e-56
Identity = 148/287 (51.57%), Postives = 173/287 (60.28%), Query Frame = 1

Query: 1   MNVDPQDFSVQLYNIVLEYRPVSFTHRDHGGLLAEALKIMLCDNRQHAMQKAAAFIKRLA 60
           +NVD QDF VQLYN+VLEYRP     RD GG+LAEALKIMLCD+RQH MQKAAAF KRLA
Sbjct: 570 LNVDLQDFFVQLYNLVLEYRP----GRDQGGVLAEALKIMLCDDRQHDMQKAAAFAKRLA 629

Query: 61  TFSLCFESAESLAASSSEKCQVPQPFGKRFLGRFSVWLNCALHDVYLSRISRTRENKKSP 120
           TFSLCF SAES+AA  + K                   N    +V    +        S 
Sbjct: 630 TFSLCFGSAESMAALVTLK-------------------NLLQKNVKCRNLLENDAGGGSV 689

Query: 121 SFSAMTCLERLFQPYASDPTLSGALASVLWELNLLWKHYHPAVSKMSTSISSMNSAQNQV 180
           S S        +QPYASDP LSGALASVLWELNLL KHYHP VS ++ SIS MN+AQNQV
Sbjct: 690 SGSIAK-----YQPYASDPNLSGALASVLWELNLLSKHYHPTVSTLAASISCMNTAQNQV 749

Query: 181 YIS-------------TESFNPKFNVRKVEKKKR-----------------ASEVKEKLS 240
           Y+S              ESF+PKF+ +K   K++                  +EV +KL 
Sbjct: 750 YLSITPQQAFINLSLEQESFDPKFSTQKSNNKRKRGTGPSTLASINPTSIDENEVSKKLG 809

Query: 241 TRFFLLRDIKDNERLRGELDRTTLSLQRYEEHKRQ----KRKTKRSR 254
             F LLRDIK+NERLRGELDRT  SLQ YEE+K+Q    K KTK+S+
Sbjct: 810 RHFMLLRDIKENERLRGELDRTRSSLQLYEEYKKQRKSLKHKTKKSK 828

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0K7H5_CUCSA1.0e-6959.36Uncharacterized protein OS=Cucumis sativus GN=Csa_7G238410 PE=4 SV=1[more]
A0A061FAC6_THECC3.8e-5651.57Nucleolar complex protein 3 isoform 1 OS=Theobroma cacao GN=TCM_032975 PE=4 SV=1[more]
A0A061FB71_THECC3.8e-5651.57Binding isoform 2 OS=Theobroma cacao GN=TCM_032975 PE=4 SV=1[more]
A0A061FAQ4_THECC3.8e-5651.57Nucleolar complex protein 3 isoform 3 OS=Theobroma cacao GN=TCM_032975 PE=4 SV=1[more]
M5WM32_PRUPE1.4e-5348.97Uncharacterized protein (Fragment) OS=Prunus persica GN=PRUPE_ppa016725mg PE=4 S... [more]
Match NameE-valueIdentityDescription
AT1G79150.12.1e-4240.43 binding[more]
Match NameE-valueIdentityDescription
gi|659092555|ref|XP_008447119.1|2.1e-7160.42PREDICTED: nucleolar complex protein 3 homolog [Cucumis melo][more]
gi|449444134|ref|XP_004139830.1|1.5e-6959.36PREDICTED: nucleolar complex protein 3 homolog [Cucumis sativus][more]
gi|590612725|ref|XP_007022463.1|5.5e-5651.57Nucleolar complex protein 3 isoform 3 [Theobroma cacao][more]
gi|590612722|ref|XP_007022462.1|5.5e-5651.57Binding isoform 2 [Theobroma cacao][more]
gi|590612718|ref|XP_007022461.1|5.5e-5651.57Nucleolar complex protein 3 isoform 1 [Theobroma cacao][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR005612CCAAT-binding_factor
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0044699 single-organism process
biological_process GO:0009560 embryo sac egg cell differentiation
biological_process GO:0006406 mRNA export from nucleus
biological_process GO:0009220 pyrimidine ribonucleotide biosynthetic process
biological_process GO:0042991 transcription factor import into nucleus
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
cellular_component GO:0005622 intracellular
molecular_function GO:0003674 molecular_function
molecular_function GO:0005488 binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh19G011270.1CmoCh19G011270.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR005612CCAAT-binding factorPFAMPF03914CBFcoord: 8..166
score: 1.5
NoneNo IPR availableunknownCoilCoilcoord: 222..242
scor
NoneNo IPR availablePANTHERPTHR14428NUCLEOLAR COMPLEX PROTEIN 3coord: 1..255
score: 7.8

The following gene(s) are paralogous to this gene:

None