CmoCh02G015020 (gene) Cucurbita moschata (Rifu)

NameCmoCh02G015020
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionRNA-binding NOB1
LocationCmo_Chr02 : 8767695 .. 8770525 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAGCTTTCGCGTTCGTTCTCCCTCTACTTCTTGCAGCCATGGAGAGCCCTTATCCGGCTTCATGCTGGAGCAATGTCGTCAAATCTCAACCTGCTCCGAAGCCTCAGCATCAGACTCCTACCTCCACCGTCCAAGTATTCGCCGACAGCTGCAAGTCCAGTCAAGGCGTTGCTGTTGCTGTTGTTGATGCCAACGCCATCATTCAAGGGGGAGAAAAGCTCTCCACCTGCGCGGACAAATTTGTTTCCGTTCCTGAGGTTTTGGATGAGGTTCGCGATCCCGTCTCTCGCCACAGACTCGCCTTCGTCCCCTTCACTCTCGAATCCATGGATCCCTCCCCCGAAGCTCTCAATAAAGGTAGACTGTTGAATTATGGTCCCTGTCTGTTTCTACGCGTTAATTGTGTTGTTAGTTTGAAGTTTAAACGGTTGTTGTTTCTAAGGACCGCTCTGATGTAAGGTTTAAGCGGAACTTCTGTTAGAGTGAGTTTGGGATGGTTTTTTTAGAGTCGTGTTTCCGAAGTGAAAGCCTTTTTGAAATAACAATGATTTAGCGATACCACTATATCAGTGTGCATTCCTTTTTTCTACCTTTCTTATGTATTTTCTGTTGAATCTTACAGTAATCAAGTTTGCAAGGGCAACTGGTGACTTACAGACCCTTTCAGATGTTGATATTAAACTTATTGCCCTCACTTACACGTTGGAGACTCAGATCCATGGGACCAAACATCTCCGTGAGTGTCCTCCCCCTGTCCACATGGTTAATACAAAGAGGTTACCTGAGAAAGACTTGCCTGGGTGGGGCTCTAACGTTCCTAATCTGGAAGAGTGGGAAGCATTAGAGCAAGATGCTGATGCTCCGCTTGATACCACATCAAAGATCCTTCCTTTGCAGGATTTAAACCTGAACATAGTCCCTTCAGATGGCCAATCCGAAGATCTTTCATTAGAGCACAAGGATGAGCATAACTCTGAGCATCCAGATGAAACTGAGAGTGGTTCAAGAAGATCAAGGAGATATCCTCCAAAGAAAAAGGAAATTAATATCGAAGGGAAGAAAATGGTGGCTGATGGAATTGATGCATCTCAGGGACAATATGATGACAATGAGGGTGATTGGACACCTGCTGTCAGTCGAAGTACTCAGAGAAGATATCTTAGGAGGAAAGCCCGGCGTGAATATTATGAGGCCTTAGCTGAAAAGGACAGTCAGCAAGATGTTGAAACCACAGATGGTGATGTCCTAGTGGAAAATAACAGATCAGGTCAATCACAGGATCAAATCTCGGAACCAATTACCGGAAATGGGAATGACTGTCAGATAGCAGAAGGGACAAACAACAATGAAAACCTCTCTGAAATTTTGAATCAGATGAGATTGGAAGAAGATTCATTAAATGCTTTTCACATGGAAGGGCTTGACGCTTCGAAAAAAGAAGAATTGGACATAAGTGAGGTGGAGAATATTGTAGCTGCTGAGGGTATCAACGATACAGCGAAGGATGAGATGGAACATTTAGAATACTCGAGTCAGACTAATGAAAGTGTAGACACATCAAATATAGATGATATTAGCAGTGATCAGAGTTGGATGCTGAGATCCTTGTCTGAATCAAGCGTAGCTTGTGTAACTGGTGACTTTGCAATGCAGAATGTCCTTTTGCAAATGGGTTTACGTTTGTTGGCTCCAGGAGGAATGCAGATCCGCCAACTGCATAGGTATGTATATTTAGTGGTGATTTGCTATTATGTGATCTTTAAAAGTTTGTTAACTCACCAATTACGAATCGTTTTTATAATCAAATTCGATCATCCAAAAGTCGCAACTAAATGATTCACCTTTTTGACAACCTTCTTTCCAGTCGGTGGTTTGGTTGTGTTCATTTTATAGCTCTAGCCAACTTTCTGGTCCGAGTCAGTATAAATTTCCTTGTAATGTTGATTGAGTAGTTTGAGATGAAGCCATGCAATGTAGTGATGAATTATGTCCTCAAATCTTCATGATGTGTCGTTCAGGTGGATTTTGAAATGTCACGCCTGCTACAATGTCACAGCTGAAATTGGAAAGATCTTTTGTCCCAAGTGTGGAAACGGTGGAACTTTGCGCAAGGTAGCTGTTACAGTTGGCGAGAACGGAGTTGTGTTAGCTTCCCGTAAGCCAAGGATTACTCTGCGTGGCACAAAGGTAGTTTCTCTGATTGTATTGTCCATCAGAAGAATAATTAATTTCATTTTCAAATGTGCCTTTTCCTTTTATGTACAGTTTTCACTTCCTCTACCCCAAGGTGGAAGGGATGCCATCACCAAGAATCTTGTTTTACGTGAAGATCAACTACCGCAGAAGTTTCTTCATCCCAAGACCAAGAAGAAAGTCAATAAACAGGTTATTTCCAATCCAAGTTGCCTCACATTTATCTTTAACCAGTCTGTTGTTTTGGTTTATTTATGATCGGAATTCGTGAAGTTTTCAACTCACCTTCATGACCTTGTAATCTCCATTTTTCATTCTCAGGGAGACGATTTCTTTGCCGTGGATGATTTCTTCAGCCATCATAACACTGATAAGAGAGCTCCTTTGCAGCCTCCCGTAAGGCAAGCTTTGGCAGTTTTTAGCGGGAAGAGAAATCCAAACGATAACCATTACTCTCGGTCTCATCATAAATAGACTATTGTTCATGCATTTGATTTTCAATGCTGCTTTCAGAGTTTCCAATTTTGATGAATATCTTATGTGATTGAGTATAGTTAATGTATTCAAATTAAAGAGCTTACAATGAGTGCAATTTTGCCCTGAATTTTCATCAAAATACCTGTCCGGACTGATTTTAT

mRNA sequence

AAGCTTTCGCGTTCGTTCTCCCTCTACTTCTTGCAGCCATGGAGAGCCCTTATCCGGCTTCATGCTGGAGCAATGTCGTCAAATCTCAACCTGCTCCGAAGCCTCAGCATCAGACTCCTACCTCCACCGTCCAAGTATTCGCCGACAGCTGCAAGTCCAGTCAAGGCGTTGCTGTTGCTGTTGTTGATGCCAACGCCATCATTCAAGGGGGAGAAAAGCTCTCCACCTGCGCGGACAAATTTGTTTCCGTTCCTGAGGTTTTGGATGAGGTTCGCGATCCCGTCTCTCGCCACAGACTCGCCTTCGTCCCCTTCACTCTCGAATCCATGGATCCCTCCCCCGAAGCTCTCAATAAAGTAATCAAGTTTGCAAGGGCAACTGGTGACTTACAGACCCTTTCAGATGTTGATATTAAACTTATTGCCCTCACTTACACGTTGGAGACTCAGATCCATGGGACCAAACATCTCCGTGAGTGTCCTCCCCCTGTCCACATGGTTAATACAAAGAGGTTACCTGAGAAAGACTTGCCTGGGTGGGGCTCTAACGTTCCTAATCTGGAAGAGTGGGAAGCATTAGAGCAAGATGCTGATGCTCCGCTTGATACCACATCAAAGATCCTTCCTTTGCAGGATTTAAACCTGAACATAGTCCCTTCAGATGGCCAATCCGAAGATCTTTCATTAGAGCACAAGGATGAGCATAACTCTGAGCATCCAGATGAAACTGAGAGTGGTTCAAGAAGATCAAGGAGATATCCTCCAAAGAAAAAGGAAATTAATATCGAAGGGAAGAAAATGGTGGCTGATGGAATTGATGCATCTCAGGGACAATATGATGACAATGAGGGTGATTGGACACCTGCTGTCAGTCGAAGTACTCAGAGAAGATATCTTAGGAGGAAAGCCCGGCGTGAATATTATGAGGCCTTAGCTGAAAAGGACAGTCAGCAAGATGTTGAAACCACAGATGGTGATGTCCTAGTGGAAAATAACAGATCAGGTCAATCACAGGATCAAATCTCGGAACCAATTACCGGAAATGGGAATGACTGTCAGATAGCAGAAGGGACAAACAACAATGAAAACCTCTCTGAAATTTTGAATCAGATGAGATTGGAAGAAGATTCATTAAATGCTTTTCACATGGAAGGGCTTGACGCTTCGAAAAAAGAAGAATTGGACATAAGTGAGGTGGAGAATATTGTAGCTGCTGAGGGTATCAACGATACAGCGAAGGATGAGATGGAACATTTAGAATACTCGAGTCAGACTAATGAAAGTGTAGACACATCAAATATAGATGATATTAGCAGTGATCAGAGTTGGATGCTGAGATCCTTGTCTGAATCAAGCGTAGCTTGTGTAACTGGTGACTTTGCAATGCAGAATGTCCTTTTGCAAATGGGTTTACGTTTGTTGGCTCCAGGAGGAATGCAGATCCGCCAACTGCATAGGTGGATTTTGAAATGTCACGCCTGCTACAATGTCACAGCTGAAATTGGAAAGATCTTTTGTCCCAAGTGTGGAAACGGTGGAACTTTGCGCAAGGTAGCTGTTACAGTTGGCGAGAACGGAGTTGTGTTAGCTTCCCGTAAGCCAAGGATTACTCTGCGTGGCACAAAGTTTTCACTTCCTCTACCCCAAGGTGGAAGGGATGCCATCACCAAGAATCTTGTTTTACGTGAAGATCAACTACCGCAGAAGTTTCTTCATCCCAAGACCAAGAAGAAAGTCAATAAACAGGGAGACGATTTCTTTGCCGTGGATGATTTCTTCAGCCATCATAACACTGATAAGAGAGCTCCTTTGCAGCCTCCCGTAAGGCAAGCTTTGGCAGTTTTTAGCGGGAAGAGAAATCCAAACGATAACCATTACTCTCGGTCTCATCATAAATAGACTATTGTTCATGCATTTGATTTTCAATGCTGCTTTCAGAGTTTCCAATTTTGATGAATATCTTATGTGATTGAGTATAGTTAATGTATTCAAATTAAAGAGCTTACAATGAGTGCAATTTTGCCCTGAATTTTCATCAAAATACCTGTCCGGACTGATTTTAT

Coding sequence (CDS)

ATGGAGAGCCCTTATCCGGCTTCATGCTGGAGCAATGTCGTCAAATCTCAACCTGCTCCGAAGCCTCAGCATCAGACTCCTACCTCCACCGTCCAAGTATTCGCCGACAGCTGCAAGTCCAGTCAAGGCGTTGCTGTTGCTGTTGTTGATGCCAACGCCATCATTCAAGGGGGAGAAAAGCTCTCCACCTGCGCGGACAAATTTGTTTCCGTTCCTGAGGTTTTGGATGAGGTTCGCGATCCCGTCTCTCGCCACAGACTCGCCTTCGTCCCCTTCACTCTCGAATCCATGGATCCCTCCCCCGAAGCTCTCAATAAAGTAATCAAGTTTGCAAGGGCAACTGGTGACTTACAGACCCTTTCAGATGTTGATATTAAACTTATTGCCCTCACTTACACGTTGGAGACTCAGATCCATGGGACCAAACATCTCCGTGAGTGTCCTCCCCCTGTCCACATGGTTAATACAAAGAGGTTACCTGAGAAAGACTTGCCTGGGTGGGGCTCTAACGTTCCTAATCTGGAAGAGTGGGAAGCATTAGAGCAAGATGCTGATGCTCCGCTTGATACCACATCAAAGATCCTTCCTTTGCAGGATTTAAACCTGAACATAGTCCCTTCAGATGGCCAATCCGAAGATCTTTCATTAGAGCACAAGGATGAGCATAACTCTGAGCATCCAGATGAAACTGAGAGTGGTTCAAGAAGATCAAGGAGATATCCTCCAAAGAAAAAGGAAATTAATATCGAAGGGAAGAAAATGGTGGCTGATGGAATTGATGCATCTCAGGGACAATATGATGACAATGAGGGTGATTGGACACCTGCTGTCAGTCGAAGTACTCAGAGAAGATATCTTAGGAGGAAAGCCCGGCGTGAATATTATGAGGCCTTAGCTGAAAAGGACAGTCAGCAAGATGTTGAAACCACAGATGGTGATGTCCTAGTGGAAAATAACAGATCAGGTCAATCACAGGATCAAATCTCGGAACCAATTACCGGAAATGGGAATGACTGTCAGATAGCAGAAGGGACAAACAACAATGAAAACCTCTCTGAAATTTTGAATCAGATGAGATTGGAAGAAGATTCATTAAATGCTTTTCACATGGAAGGGCTTGACGCTTCGAAAAAAGAAGAATTGGACATAAGTGAGGTGGAGAATATTGTAGCTGCTGAGGGTATCAACGATACAGCGAAGGATGAGATGGAACATTTAGAATACTCGAGTCAGACTAATGAAAGTGTAGACACATCAAATATAGATGATATTAGCAGTGATCAGAGTTGGATGCTGAGATCCTTGTCTGAATCAAGCGTAGCTTGTGTAACTGGTGACTTTGCAATGCAGAATGTCCTTTTGCAAATGGGTTTACGTTTGTTGGCTCCAGGAGGAATGCAGATCCGCCAACTGCATAGGTGGATTTTGAAATGTCACGCCTGCTACAATGTCACAGCTGAAATTGGAAAGATCTTTTGTCCCAAGTGTGGAAACGGTGGAACTTTGCGCAAGGTAGCTGTTACAGTTGGCGAGAACGGAGTTGTGTTAGCTTCCCGTAAGCCAAGGATTACTCTGCGTGGCACAAAGTTTTCACTTCCTCTACCCCAAGGTGGAAGGGATGCCATCACCAAGAATCTTGTTTTACGTGAAGATCAACTACCGCAGAAGTTTCTTCATCCCAAGACCAAGAAGAAAGTCAATAAACAGGGAGACGATTTCTTTGCCGTGGATGATTTCTTCAGCCATCATAACACTGATAAGAGAGCTCCTTTGCAGCCTCCCGTAAGGCAAGCTTTGGCAGTTTTTAGCGGGAAGAGAAATCCAAACGATAACCATTACTCTCGGTCTCATCATAAATAG
BLAST of CmoCh02G015020 vs. Swiss-Prot
Match: NOB1_MACFA (RNA-binding protein NOB1 OS=Macaca fascicularis GN=NOB1 PE=2 SV=1)

HSP 1 Score: 125.6 bits (314), Expect = 1.9e-27
Identity = 74/176 (42.05%), Postives = 109/176 (61.93%), Query Frame = 1

Query: 440 VACVTGDFAMQNVLLQMGLRLLAPGGMQIRQLHRWILKCHACYNVTAEIGKIFCPKCGNG 499
           V CVT DFAMQNVLLQMGL +LA  GM IR+   +IL+CH C+  T+++ ++FC  CGN 
Sbjct: 232 VGCVTTDFAMQNVLLQMGLHVLAVNGMLIREARSYILRCHGCFKTTSDMSRVFCAHCGN- 291

Query: 500 GTLRKVAVTVGENGVVLA--SRKPRI-TLRGTKFSLPLPQGGRDAITKNLVLREDQLPQK 559
            TL+KV+VTV ++G +    SR P++   RG ++SLP P+GG+ A+  +L   + + PQ 
Sbjct: 292 KTLKKVSVTVSDDGALHMHFSRNPKVLNPRGLRYSLPTPKGGKYAVNPHLT-EDQRFPQL 351

Query: 560 FLHPKTKKKVNKQGDDFFA-VDDFFSHHNTDKRAPLQPPVRQALAVFSGKRNPNDN 612
            L  K ++K N    D+ A V  F  +  + + A LQ  VR +  + +G+R  N N
Sbjct: 352 RLSRKARQKTNVFAPDYIAGVSPFVENDVSSRSATLQ--VRDS-TLGAGRRRLNPN 402

BLAST of CmoCh02G015020 vs. Swiss-Prot
Match: NOB1_HUMAN (RNA-binding protein NOB1 OS=Homo sapiens GN=NOB1 PE=1 SV=1)

HSP 1 Score: 125.2 bits (313), Expect = 2.5e-27
Identity = 74/176 (42.05%), Postives = 109/176 (61.93%), Query Frame = 1

Query: 440 VACVTGDFAMQNVLLQMGLRLLAPGGMQIRQLHRWILKCHACYNVTAEIGKIFCPKCGNG 499
           V C+T DFAMQNVLLQMGL +LA  GM IR+   +IL+CH C+  T+++ ++FC  CGN 
Sbjct: 232 VGCLTTDFAMQNVLLQMGLHVLAVNGMLIREARSYILRCHGCFKTTSDMSRVFCSHCGN- 291

Query: 500 GTLRKVAVTVGENGVVLA--SRKPRI-TLRGTKFSLPLPQGGRDAITKNLVLREDQLPQK 559
            TL+KV+VTV ++G +    SR P++   RG ++SLP P+GG+ AI  +L   + + PQ 
Sbjct: 292 KTLKKVSVTVSDDGTLHMHFSRNPKVLNPRGLRYSLPTPKGGKYAINPHLT-EDQRFPQL 351

Query: 560 FLHPKTKKKVNKQGDDFFA-VDDFFSHHNTDKRAPLQPPVRQALAVFSGKRNPNDN 612
            L  K ++K N    D+ A V  F  +  + + A LQ  VR +  + +G+R  N N
Sbjct: 352 RLSQKARQKTNVFAPDYIAGVSPFVENDISSRSATLQ--VRDS-TLGAGRRRLNPN 402

BLAST of CmoCh02G015020 vs. Swiss-Prot
Match: NOB1_RAT (RNA-binding protein NOB1 OS=Rattus norvegicus GN=Nob1 PE=2 SV=1)

HSP 1 Score: 120.9 bits (302), Expect = 4.8e-26
Identity = 81/217 (37.33%), Postives = 125/217 (57.60%), Query Frame = 1

Query: 400 KDEMEHLEYSSQTNES-VDTSNIDDISSDQSWMLRSLSESSVACVTGDFAMQNVLLQMGL 459
           ++E + LE S       +  SNI  I  + S       +  V CVT DFAMQNVLLQMGL
Sbjct: 190 EEEEDELEDSDDDGGGWITPSNIKQIQHE-SEQCDIPKDVQVGCVTTDFAMQNVLLQMGL 249

Query: 460 RLLAPGGMQIRQLHRWILKCHACYNVTAEIGKIFCPKCGNGGTLRKVAVTVGENGVVLA- 519
            +LA  GM +R+   +IL+CH C+  T+++ ++FC  CGN  TL+KV+VT+ ++G +   
Sbjct: 250 HVLAVNGMLVREARSYILRCHGCFKTTSDMNRVFCGHCGN-KTLKKVSVTINDDGTLHMH 309

Query: 520 -SRKPRI-TLRGTKFSLPLPQGGRDAITKNLVLREDQLPQKFLHPKTKKKVNKQGDDFFA 579
            SR P++   RG ++SLP P+GG+ A+  +L   + + PQ  L  K ++K N    D+ A
Sbjct: 310 FSRNPKVLNPRGLRYSLPTPKGGKYAVNPHLT-EDQRFPQLRLSHKARQKTNVFAPDYIA 369

Query: 580 -VDDFFSHHNTDKRAPLQPPVRQALAVFSGKRNPNDN 612
            V  F  +  + + A LQ  VR ++ + +G+R  N N
Sbjct: 370 GVSPFAENDISSRSAILQ--VRDSM-LGAGRRRLNPN 400

BLAST of CmoCh02G015020 vs. Swiss-Prot
Match: NOB1_PONAB (RNA-binding protein NOB1 OS=Pongo abelii GN=NOB1 PE=2 SV=1)

HSP 1 Score: 120.2 bits (300), Expect = 8.2e-26
Identity = 87/246 (35.37%), Postives = 135/246 (54.88%), Query Frame = 1

Query: 385 EVENIVAAEGINDTAKDEMEHLEYSSQTNESVDT-------SNIDDISSDQSWMLRSLSE 444
           E++ ++   G +  + +E E   +  + ++S D        SNI  I  +         +
Sbjct: 170 ELQELLIDRGEDIPSDEEEEENGFEDRRDDSDDDGGGWITPSNIKQIQQELE-QCDVPED 229

Query: 445 SSVACVTGDFAMQNVLLQMGLRLLAPGGMQIRQLHRWILKCHACYNVTAEIGKIFCPKCG 504
             V CVT DFAMQNVLLQMGL +LA  GM IR+   +IL+CH C+  T+++ ++FC  CG
Sbjct: 230 VRVGCVTTDFAMQNVLLQMGLHVLAVNGMLIREARSYILRCHGCFKTTSDMSRVFCSHCG 289

Query: 505 NGGTLRKVAVTVGENGVVLA--SRKPRI-TLRGTKFSLPLPQGGRDAITKNLVLREDQLP 564
           N  TL+KV+VTV ++G +    SR P++   RG ++SLP P+GG+ AI  +L   + + P
Sbjct: 290 N-KTLKKVSVTVSDDGTLHMHFSRNPKVLNPRGLRYSLPTPKGGKYAINPHLT-EDQRFP 349

Query: 565 QKFLHPKTKKKVNKQGDDFFA-VDDFFSHHNTDKRAPLQPPVRQALAVFSGKRNPNDNHY 620
           Q  L  K ++K N    D+ A V  F  +  + + A LQ  VR + ++ +G+R  N N  
Sbjct: 350 QLRLSRKARQKTNVFAPDYVAGVSPFVENDISSRSATLQ--VRDS-SLGAGRRRLNPNAS 409

BLAST of CmoCh02G015020 vs. Swiss-Prot
Match: NOB1_BOVIN (RNA-binding protein NOB1 OS=Bos taurus GN=NOB1 PE=2 SV=1)

HSP 1 Score: 119.8 bits (299), Expect = 1.1e-25
Identity = 84/237 (35.44%), Postives = 129/237 (54.43%), Query Frame = 1

Query: 379 EELDISEVENIVAAEGINDTAKDEMEHLEYSSQTNESVDTSNIDDISSDQSWMLRSLSES 438
           +EL +   E++   E   +   DE +  +        +  SNI  I  +         + 
Sbjct: 173 QELLMDGGEDVPNEEEDEENGLDERQDEDSDDDGGGWITPSNIKQIQQEMK-QCAVPKDV 232

Query: 439 SVACVTGDFAMQNVLLQMGLRLLAPGGMQIRQLHRWILKCHACYNVTAEIGKIFCPKCGN 498
            V CVT DFAMQNVLLQMGL +LA  GM IR+   +IL+CH C+  T+++ ++FC  CGN
Sbjct: 233 RVGCVTTDFAMQNVLLQMGLHVLAVNGMLIREARSYILRCHGCFKTTSDMSRVFCAHCGN 292

Query: 499 GGTLRKVAVTVGENGVVLA--SRKPRI-TLRGTKFSLPLPQGGRDAITKNLVLREDQLPQ 558
             TL+KV+VTV ++G +    SR P++   RG ++SLP P+GG+ AI  +L   + + PQ
Sbjct: 293 -KTLKKVSVTVSDDGTLHMHFSRNPKVLNPRGLRYSLPTPKGGKYAINPHLT-EDQRFPQ 352

Query: 559 KFLHPKTKKKVNKQGDDFFA-VDDFFSHHNTDKRAPLQPPVRQALAVFSGKRNPNDN 612
             L  K ++K +    D+ A V  F  +  + + A LQ  VR +  + +G+R  N N
Sbjct: 353 LRLSRKARQKTDVFAPDYVAGVSPFAENDISSRSATLQ--VRDS-TLGAGRRRLNPN 403

BLAST of CmoCh02G015020 vs. TrEMBL
Match: A0A0A0KML5_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G154760 PE=4 SV=1)

HSP 1 Score: 1085.1 bits (2805), Expect = 0.0e+00
Identity = 554/620 (89.35%), Postives = 582/620 (93.87%), Query Frame = 1

Query: 1   MESPYPASCWSNVVKSQPAPKPQHQTPTSTVQVFADSCKSSQGVAVAVVDANAIIQGGEK 60
           ME+P PASCWSNVVK+QPAPKPQHQTP+S+VQVFADSCKSS+GVAVAVVDANAIIQGG+K
Sbjct: 1   METPSPASCWSNVVKTQPAPKPQHQTPSSSVQVFADSCKSSKGVAVAVVDANAIIQGGDK 60

Query: 61  LSTCADKFVSVPEVLDEVRDPVSRHRLAFVPFTLESMDPSPEALNKVIKFARATGDLQTL 120
           LS+ ADKFVSVPEVLDE+RDPVSRHRLAFVPFTLESMDPSP+ALNKVIKFARATGDLQTL
Sbjct: 61  LSSSADKFVSVPEVLDEIRDPVSRHRLAFVPFTLESMDPSPDALNKVIKFARATGDLQTL 120

Query: 121 SDVDIKLIALTYTLETQIHGTKHLRECPPPVHMVNTKRLPEKDLPGWGSNVPNLEEWEAL 180
           SDVDIKLIALTYTLETQIHGTKHLRECPPPVHMVNTKRLPEKD+PGWGSNVPNLEEWEAL
Sbjct: 121 SDVDIKLIALTYTLETQIHGTKHLRECPPPVHMVNTKRLPEKDMPGWGSNVPNLEEWEAL 180

Query: 181 EQDADAPLDTTSKILPLQDLNLNIVPSDGQSEDLSLEHKDEHNSEHPDETESGSRRSRRY 240
           EQDAD P   TSKILPLQDLNLNI+PSDGQSEDLSLEHKD+ N EH DETES SRRSRRY
Sbjct: 181 EQDADDPSRLTSKILPLQDLNLNIIPSDGQSEDLSLEHKDDDNLEHLDETESDSRRSRRY 240

Query: 241 PPKKKEINIEGKKMVADGIDASQGQYDDNEGDWTPAVSRSTQRRYLRRKARREYYEALAE 300
           PPKKKEINIEGKKMVADGIDASQGQYDDNEGDWTPAVSRSTQRRY RRKARREYYE+LAE
Sbjct: 241 PPKKKEINIEGKKMVADGIDASQGQYDDNEGDWTPAVSRSTQRRYHRRKARREYYESLAE 300

Query: 301 KDSQQDVETTDGDVLVENNRSGQSQDQISE-PITGNGNDCQIAEGTNNNENLSEILNQMR 360
           KDSQQDVETT+GD+ VE N SGQS+D+ISE P TGNGN+ QI EGTNNNEN+SEIL QMR
Sbjct: 301 KDSQQDVETTNGDIHVEFNGSGQSEDKISELPNTGNGNESQIGEGTNNNENISEILKQMR 360

Query: 361 LEEDSLNAFHMEGLDASKKEELDISEVENIVAAEGINDTAKDEMEHLEYSSQTNESVDTS 420
           LEEDSLNA HM    AS KE  D SE EN VA EG  D  KDEMEH+E +SQTNESVD S
Sbjct: 361 LEEDSLNALHM---SASTKEGSDESEGENAVAVEGTKDAEKDEMEHMEDASQTNESVDMS 420

Query: 421 NIDDISSDQSWMLRSLSESSVACVTGDFAMQNVLLQMGLRLLAPGGMQIRQLHRWILKCH 480
           N+DD+SSDQSWMLRSLSESSVACVTGD+AMQNVLLQMGLRLLAPGGMQIRQLHRWILKCH
Sbjct: 421 NVDDVSSDQSWMLRSLSESSVACVTGDYAMQNVLLQMGLRLLAPGGMQIRQLHRWILKCH 480

Query: 481 ACYNVTAEIGKIFCPKCGNGGTLRKVAVTVGENGVVLASRKPRITLRGTKFSLPLPQGGR 540
           ACYNVTAEIG+IFCPKCGNGGTLRKVAVTVGENGVVLA+RKPRITLRGTKFSLPLPQGGR
Sbjct: 481 ACYNVTAEIGRIFCPKCGNGGTLRKVAVTVGENGVVLAARKPRITLRGTKFSLPLPQGGR 540

Query: 541 DAITKNLVLREDQLPQKFLHPKTKKKVNKQGDDFFAVDDFFSHHNTDKRAPLQPPVRQAL 600
           DAITKNLVLREDQLPQKFLHPKTKKKVNKQGD+FFAVDDFFSHHNTDKRAPLQPPVRQAL
Sbjct: 541 DAITKNLVLREDQLPQKFLHPKTKKKVNKQGDEFFAVDDFFSHHNTDKRAPLQPPVRQAL 600

Query: 601 AVFSGKRNPNDNHYSRSHHK 620
           AVFSGKRNPNDNHYSRS H+
Sbjct: 601 AVFSGKRNPNDNHYSRSKHR 617

BLAST of CmoCh02G015020 vs. TrEMBL
Match: A0A061G6A8_THECC (RNA-binding protein nob1, putative isoform 1 OS=Theobroma cacao GN=TCM_016231 PE=4 SV=1)

HSP 1 Score: 832.0 bits (2148), Expect = 4.7e-238
Identity = 437/624 (70.03%), Postives = 515/624 (82.53%), Query Frame = 1

Query: 3   SPYPASCWSNVVKSQPAPKPQHQTPTS-TVQVFADSCKSSQGVAVAVVDANAIIQGGEKL 62
           +P PASCWSNV+KSQP PKPQ Q  T+ T Q+F +SCKS++G+AVAVVDANA+I+GGEKL
Sbjct: 10  NPNPASCWSNVLKSQP-PKPQTQKQTAATTQLFVESCKSTKGIAVAVVDANAVIEGGEKL 69

Query: 63  STCADKFVSVPEVLDEVRDPVSRHRLAFVPFTLESMDPSPEALNKVIKFARATGDLQTLS 122
           +  AD+FV+VPEVL E+RDPVSRHRLAF+PF+++SM+PS +ALNKVIKFARATGDLQTLS
Sbjct: 70  NNSADRFVTVPEVLAEIRDPVSRHRLAFIPFSIDSMEPSSDALNKVIKFARATGDLQTLS 129

Query: 123 DVDIKLIALTYTLETQIHGTKHLRECPPPVHMVNTKRLPEKDLPGWGSNVPNLEEWEALE 182
           DVD+KLIALTYTLE QIHGT H+R+ PPPVH+VN KRLPE+DLPGWGSNVPNL+EWEALE
Sbjct: 130 DVDLKLIALTYTLEAQIHGTNHIRDAPPPVHVVNVKRLPERDLPGWGSNVPNLDEWEALE 189

Query: 183 QDADAPLDTTSKILPLQDLNLNIVPSDGQSEDLSLEHKDEHNSEHPDETESGSRRSRRYP 242
           ++A+   ++ S+ILPL+DLN+N +PSD  SED S+E K E +SE+ ++ E G RR RRY 
Sbjct: 190 REAEGGTNSNSRILPLKDLNMNTLPSDNGSEDGSVEIKSETHSENQEDVEHGFRRPRRYL 249

Query: 243 PKKKEINIEGKKMVADGIDASQGQYDDNEGDWTPAVSRSTQRRYLRRKARREYYEALAEK 302
           P+KKE+ IEGKKMVADGIDASQGQ DDN  +W PAVSRST RRYLRRKARREYYEAL EK
Sbjct: 250 PQKKEVKIEGKKMVADGIDASQGQIDDNGDNWQPAVSRSTHRRYLRRKARREYYEALVEK 309

Query: 303 DSQQDVETTDGDVLVENNRSGQSQDQISEPITGNG--NDCQIAEGTNNNENLSEILNQMR 362
           D Q+D+E +              ++ + +  +GNG   + + AE    +E+LS IL QMR
Sbjct: 310 DCQEDMEKS------------MDKNNVEDAHSGNGILEETERAEEKKGDEDLSSILKQMR 369

Query: 363 LEEDSLNAFHMEGLDASKKEELDISEVENI-VAAEGIN-DTAKDEMEHLEYSSQTNESVD 422
           LEEDSL A         + EE++I+   N+ ++ EG   D   +E++ LE SSQTNE+VD
Sbjct: 370 LEEDSLEALQ-------EAEEVEITVEANVNLSVEGNKMDLVNEELDQLEMSSQTNETVD 429

Query: 423 TSNIDDISSDQSWMLRSLSESSVACVTGDFAMQNVLLQMGLRLLAPGGMQIRQLHRWILK 482
            S  DD+S +QSWMLRSLSESSVACVTGDFAMQNV+LQMGLRLLAPGGMQIRQLHRWILK
Sbjct: 430 ASYTDDVSCEQSWMLRSLSESSVACVTGDFAMQNVILQMGLRLLAPGGMQIRQLHRWILK 489

Query: 483 CHACYNVTAEIGKIFCPKCGNGGTLRKVAVTVGENGVVLASRKPRITLRGTKFSLPLPQG 542
           CHACYNVTAEIG+IFCPKCGNGGTLRKVAVTVGENG+VLAS +PRI+LRGTKFSLPLPQG
Sbjct: 490 CHACYNVTAEIGRIFCPKCGNGGTLRKVAVTVGENGIVLASHRPRISLRGTKFSLPLPQG 549

Query: 543 GRDAITKNLVLREDQLPQKFLHPKTKKKVNKQGDD--FFAVDDFFSHHNTDKRAPLQPPV 602
           GRDAITKNL+LREDQLPQKFL+PKTKKKVNKQGDD  F  VD F   H+TDKRAPLQPPV
Sbjct: 550 GRDAITKNLILREDQLPQKFLYPKTKKKVNKQGDDDLFMGVDTF--THHTDKRAPLQPPV 609

Query: 603 RQALAVFSGKRNPNDNHYSRSHHK 620
           R+ALAVF+GKRNPNDNHYSRS HK
Sbjct: 610 RKALAVFTGKRNPNDNHYSRSKHK 611

BLAST of CmoCh02G015020 vs. TrEMBL
Match: A0A061G5P4_THECC (RNA-binding protein nob1, putative isoform 2 OS=Theobroma cacao GN=TCM_016231 PE=4 SV=1)

HSP 1 Score: 830.5 bits (2144), Expect = 1.4e-237
Identity = 436/623 (69.98%), Postives = 514/623 (82.50%), Query Frame = 1

Query: 3   SPYPASCWSNVVKSQPAPKPQHQTPTS-TVQVFADSCKSSQGVAVAVVDANAIIQGGEKL 62
           +P PASCWSNV+KSQP PKPQ Q  T+ T Q+F +SCKS++G+AVAVVDANA+I+GGEKL
Sbjct: 10  NPNPASCWSNVLKSQP-PKPQTQKQTAATTQLFVESCKSTKGIAVAVVDANAVIEGGEKL 69

Query: 63  STCADKFVSVPEVLDEVRDPVSRHRLAFVPFTLESMDPSPEALNKVIKFARATGDLQTLS 122
           +  AD+FV+VPEVL E+RDPVSRHRLAF+PF+++SM+PS +ALNKVIKFARATGDLQTLS
Sbjct: 70  NNSADRFVTVPEVLAEIRDPVSRHRLAFIPFSIDSMEPSSDALNKVIKFARATGDLQTLS 129

Query: 123 DVDIKLIALTYTLETQIHGTKHLRECPPPVHMVNTKRLPEKDLPGWGSNVPNLEEWEALE 182
           DVD+KLIALTYTLE QIHGT H+R+ PPPVH+VN KRLPE+DLPGWGSNVPNL+EWEALE
Sbjct: 130 DVDLKLIALTYTLEAQIHGTNHIRDAPPPVHVVNVKRLPERDLPGWGSNVPNLDEWEALE 189

Query: 183 QDADAPLDTTSKILPLQDLNLNIVPSDGQSEDLSLEHKDEHNSEHPDETESGSRRSRRYP 242
           ++A+   ++ S+ILPL+DLN+N +PSD  SED S+E K E +SE+ ++ E G RR RRY 
Sbjct: 190 REAEGGTNSNSRILPLKDLNMNTLPSDNGSEDGSVEIKSETHSENQEDVEHGFRRPRRYL 249

Query: 243 PKKKEINIEGKKMVADGIDASQGQYDDNEGDWTPAVSRSTQRRYLRRKARREYYEALAEK 302
           P+KKE+ IEGKKMVADGIDASQGQ DDN  +W PAVSRST RRYLRRKARREYYEAL EK
Sbjct: 250 PQKKEVKIEGKKMVADGIDASQGQIDDNGDNWQPAVSRSTHRRYLRRKARREYYEALVEK 309

Query: 303 DSQQDVETTDGDVLVENNRSGQSQDQISEPITGNG--NDCQIAEGTNNNENLSEILNQMR 362
           D Q+D+E +              ++ + +  +GNG   + + AE    +E+LS IL QMR
Sbjct: 310 DCQEDMEKS------------MDKNNVEDAHSGNGILEETERAEEKKGDEDLSSILKQMR 369

Query: 363 LEEDSLNAFHMEGLDASKKEELDISEVENI-VAAEGIN-DTAKDEMEHLEYSSQTNESVD 422
           LEEDSL A         + EE++I+   N+ ++ EG   D   +E++ LE SSQTNE+VD
Sbjct: 370 LEEDSLEALQ-------EAEEVEITVEANVNLSVEGNKMDLVNEELDQLEMSSQTNETVD 429

Query: 423 TSNIDDISSDQSWMLRSLSESSVACVTGDFAMQNVLLQMGLRLLAPGGMQIRQLHRWILK 482
            S  DD+S +QSWMLRSLSESSVACVTGDFAMQNV+LQMGLRLLAPGGMQIRQLHRWILK
Sbjct: 430 ASYTDDVSCEQSWMLRSLSESSVACVTGDFAMQNVILQMGLRLLAPGGMQIRQLHRWILK 489

Query: 483 CHACYNVTAEIGKIFCPKCGNGGTLRKVAVTVGENGVVLASRKPRITLRGTKFSLPLPQG 542
           CHACYNVTAEIG+IFCPKCGNGGTLRKVAVTVGENG+VLAS +PRI+LRGTKFSLPLPQG
Sbjct: 490 CHACYNVTAEIGRIFCPKCGNGGTLRKVAVTVGENGIVLASHRPRISLRGTKFSLPLPQG 549

Query: 543 GRDAITKNLVLREDQLPQKFLHPKTKKKVNKQGDD--FFAVDDFFSHHNTDKRAPLQPPV 602
           GRDAITKNL+LREDQLPQKFL+PKTKKKVNKQGDD  F  VD F   H+TDKRAPLQPPV
Sbjct: 550 GRDAITKNLILREDQLPQKFLYPKTKKKVNKQGDDDLFMGVDTF--THHTDKRAPLQPPV 609

Query: 603 RQALAVFSGKRNPNDNHYSRSHH 619
           R+ALAVF+GKRNPNDNHYSRS H
Sbjct: 610 RKALAVFTGKRNPNDNHYSRSKH 610

BLAST of CmoCh02G015020 vs. TrEMBL
Match: W9QSA3_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_024306 PE=4 SV=1)

HSP 1 Score: 812.4 bits (2097), Expect = 3.8e-232
Identity = 416/618 (67.31%), Postives = 496/618 (80.26%), Query Frame = 1

Query: 4   PYPASCWSNVVKSQPAPKPQHQTPTSTVQVFADSCKSSQGVAVAVVDANAIIQGGEKLST 63
           P PA CWSN++K+Q APKPQ+ +P+ TV VFA+SCKS++G+AVAVVDANAII GGE+LS 
Sbjct: 9   PSPAPCWSNLLKNQTAPKPQNPSPSPTVGVFAESCKSTKGIAVAVVDANAIIDGGERLSQ 68

Query: 64  CADKFVSVPEVLDEVRDPVSRHRLAFVPFTLESMDPSPEALNKVIKFARATGDLQTLSDV 123
           CADKFVSVPEV+DEVRDPVSRHRLAF+PF+++S++PSPE+LNKVIKFARATGDLQTLSDV
Sbjct: 69  CADKFVSVPEVMDEVRDPVSRHRLAFIPFSVQSIEPSPESLNKVIKFARATGDLQTLSDV 128

Query: 124 DIKLIALTYTLETQIHGTKHLRECPPPVHMVNTKRLPEKDLPGWGSNVPNLEEWEALEQD 183
           D+KLIALTYTLE QIHGT+HLRECPPP+H VN KRLPEKD+PGWGSNVPNLEEWEALE  
Sbjct: 129 DLKLIALTYTLEAQIHGTEHLRECPPPIHTVNVKRLPEKDMPGWGSNVPNLEEWEALEHQ 188

Query: 184 ADAPLDTTSKILPLQDLNLNIVPSDGQSEDLSLEHKDEHNSEHPDETESGSRRSRRYPPK 243
           A+   D  S+ILPL+DLNLN+V   G           EH +   D+ E    R RRYPPK
Sbjct: 189 AEDKPDEDSRILPLKDLNLNVVSDAGS----------EHQT---DDGEENVGRPRRYPPK 248

Query: 244 KKEINIEGKKMVADGIDASQGQYDDNEGDWTPAVSRSTQRRYLRRKARREYYEALAEKDS 303
           KKEINIEGKKMV DGIDAS+G++D +EGDW PAVSRST RR+LRRKARR+YYE+L+EKD 
Sbjct: 249 KKEINIEGKKMVTDGIDASRGEFDGDEGDWLPAVSRSTHRRFLRRKARRDYYESLSEKDG 308

Query: 304 QQDVETTDGDVLVENNRSGQSQDQISEPITGNGNDCQIAEGTNNNENLSEILNQMRLEED 363
                 ++ + + E+   GQ ++   E   G   + ++ EG N+ ENLS IL+QMRLEED
Sbjct: 309 -----FSEKNEMTEDKNDGQ-ENGTKEEKNGGVEENEVREGKNDEENLSTILHQMRLEED 368

Query: 364 SLNAFHMEGLDASKKEELDISEVENIVAAEGIN-DTAKDEMEHLEYSSQTNESVDTSNI- 423
            +    ++GL             E    AEG   D   +  +HLE SS+ N+ +D SN+ 
Sbjct: 369 LVEEDLVKGLQ------------EGNANAEGDQTDMVSEGHDHLEVSSEINDCIDASNVE 428

Query: 424 DDISSDQSWMLRSLSESSVACVTGDFAMQNVLLQMGLRLLAPGGMQIRQLHRWILKCHAC 483
           DD SS+ SWMLRSLSESSVAC+T DFAMQNV+LQMGLRL+APGGMQIRQLHRW+L+CHAC
Sbjct: 429 DDASSEHSWMLRSLSESSVACITSDFAMQNVILQMGLRLVAPGGMQIRQLHRWVLRCHAC 488

Query: 484 YNVTAEIGKIFCPKCGNGGTLRKVAVTVGENGVVLASRKPRITLRGTKFSLPLPQGGRDA 543
           Y VTAEIG+IFCPKCGNGGTLRKVAVTVGENG+ LA+R+PRITLRGTKFSLPLPQGGRDA
Sbjct: 489 YTVTAEIGRIFCPKCGNGGTLRKVAVTVGENGITLAARRPRITLRGTKFSLPLPQGGRDA 548

Query: 544 ITKNLVLREDQLPQKFLHPKTKKKVNKQGDDFFAVDDFFSHHNTDKRAPLQPPVRQALAV 603
           ++KN++LREDQLPQKFL+PKTKKK  KQGDD++  DD FSHH++ K+AP QPPVR+ALAV
Sbjct: 549 VSKNVILREDQLPQKFLYPKTKKKSTKQGDDYYVSDDIFSHHHSHKKAPFQPPVRKALAV 595

Query: 604 FSGKRNPNDNHYSRSHHK 620
           FSGKRNPNDNHY+RS HK
Sbjct: 609 FSGKRNPNDNHYTRSKHK 595

BLAST of CmoCh02G015020 vs. TrEMBL
Match: A0A0B0Q0J6_GOSAR (RNA-binding NOB1 OS=Gossypium arboreum GN=F383_14923 PE=4 SV=1)

HSP 1 Score: 806.2 bits (2081), Expect = 2.8e-230
Identity = 429/625 (68.64%), Postives = 499/625 (79.84%), Query Frame = 1

Query: 6   PASCWSNVVKSQPAPKPQHQTPTS-TVQVFADSCKSSQGVAVAVVDANAIIQGGEKLSTC 65
           P S WSNV+KSQP PKP  Q+ T+ T Q+F +SCKS++G+AVAVVDANA+I+GGEKL+  
Sbjct: 13  PISGWSNVLKSQP-PKPHTQSQTTATSQIFVESCKSTKGIAVAVVDANAVIEGGEKLNNT 72

Query: 66  ADKFVSVPEVLDEVRDPVSRHRLAFVPFTLESMDPSPEALNKVIKFARATGDLQTLSDVD 125
           AD+FV+VPEVL E+RDPVSRHRLAF+PF+++S++PSP+ALNKVIKFARATGDLQTLSDVD
Sbjct: 73  ADRFVTVPEVLAEIRDPVSRHRLAFIPFSIDSVEPSPDALNKVIKFARATGDLQTLSDVD 132

Query: 126 IKLIALTYTLETQIHGTKHLRECPPPVHMVNTKRLPEKDLPGWGSNVPNLEEWEALEQDA 185
           +KLIALTYTLE+QIHGT HLR+ PPPVH+VN KRLPE+DLPGWGSNVPNLEEWEALE + 
Sbjct: 133 LKLIALTYTLESQIHGTNHLRDAPPPVHVVNVKRLPERDLPGWGSNVPNLEEWEALEPET 192

Query: 186 DAPLDTTSKILPLQDLNLNIVPSDGQSEDLSLEHKDEHNSEHPDETESGSRRSRRYPPKK 245
               ++ S+ILPL+DLN+N V SD  SED+ +E K E +SE+ ++ + G RR RRY P+K
Sbjct: 193 GDGFNSNSRILPLKDLNMNYVSSDNNSEDVLVETKSETHSENQEDIDQGFRRPRRYLPQK 252

Query: 246 KEINIEGKKMVADGIDASQGQYDDNEGDWTPAVSRSTQRRYLRRKARREYYEALAEKDSQ 305
           KE+ IEGKKMVADGIDASQG  DDN  DW PAVSRST RRYLRRKARRE+YEAL EKD Q
Sbjct: 253 KEVKIEGKKMVADGIDASQGHLDDNADDWLPAVSRSTHRRYLRRKARREFYEALVEKDCQ 312

Query: 306 QDVETTDGDVLVENNRSGQS-------QDQISEPITGNG--NDCQIAEGTNNNENLSEIL 365
           +D+E      L ++N    S       Q    E  +GNG   + + AE    + +LS IL
Sbjct: 313 EDMEKG----LEKSNSEDASGCPDRPLQQSAEEVHSGNGISEEAERAEVDKGDCDLSSIL 372

Query: 366 NQMRLEEDSLNAFHMEGLDASKKEELDISEVENIVAAEGINDTAKDEMEHLEYSSQTNES 425
            QMRLEED            +  EE  + +  N+       D   +E++ LE SSQTNE+
Sbjct: 373 KQMRLEEDPARTLGEAKETETTAEEAMLDDSMNLAV-----DGDSEELDQLEMSSQTNET 432

Query: 426 VDTSNIDDISSDQSWMLRSLSESSVACVTGDFAMQNVLLQMGLRLLAPGGMQIRQLHRWI 485
           VD S  DD+SS+QSWMLRSLSES+VACVTGDFAMQNVLLQMGLRLLAPGGMQIRQLHRW+
Sbjct: 433 VDASFTDDVSSEQSWMLRSLSESTVACVTGDFAMQNVLLQMGLRLLAPGGMQIRQLHRWV 492

Query: 486 LKCHACYNVTAEIGKIFCPKCGNGGTLRKVAVTVGENGVVLASRKPRITLRGTKFSLPLP 545
           LKCHACY VTAEIG+IFCPKCGNGGTLRKVAVTVGENG+VLAS +PRITLRGTKFSLPLP
Sbjct: 493 LKCHACYTVTAEIGRIFCPKCGNGGTLRKVAVTVGENGIVLASHRPRITLRGTKFSLPLP 552

Query: 546 QGGRDAITKNLVLREDQLPQKFLHPKTKKKVNKQG-DDFFAVDDFFSHHNTDKRAPLQPP 605
           QGGRDAITKNL+LREDQLPQKFL+PKTKKKVNKQG DD F   D F+HH TDKRAPLQPP
Sbjct: 553 QGGRDAITKNLILREDQLPQKFLYPKTKKKVNKQGDDDLFMAGDTFTHH-TDKRAPLQPP 612

Query: 606 VRQALAVFSGKRNPNDNHYSRSHHK 620
           VR+ALAVFSGKRNPNDNHYSRS  K
Sbjct: 613 VRKALAVFSGKRNPNDNHYSRSKQK 626

BLAST of CmoCh02G015020 vs. TAIR10
Match: AT5G41190.1 (AT5G41190.1 Nin one binding (NOB1) Zn-ribbon like (InterPro:IPR014881), D-site 20S pre-rRNA nuclease (InterPro:IPR017117))

HSP 1 Score: 700.3 bits (1806), Expect = 1.1e-201
Identity = 367/623 (58.91%), Postives = 468/623 (75.12%), Query Frame = 1

Query: 4   PYPASCWSNVVKSQPAPKPQ-HQTPTSTVQVFADSCKSSQGVAVAVVDANAIIQGGEKLS 63
           P P S WS++VK  P  KP  +    + +     +CKS++G+++AVVDANAII+G + L+
Sbjct: 3   PKPTSMWSSIVKKDPPSKPPVNDGAPAAILGMVGNCKSTKGISIAVVDANAIIEGRQSLT 62

Query: 64  TCADKFVSVPEVLDEVRDPVSRHRLAFVPFTLESMDPSPEALNKVIKFARATGDLQTLSD 123
             ADKFV+VPEVL E+RDP SR RLAF+PFT+++M+PSPE+L+KVIKFARATGDLQ+LSD
Sbjct: 63  NFADKFVTVPEVLSEIRDPASRRRLAFIPFTIDTMEPSPESLSKVIKFARATGDLQSLSD 122

Query: 124 VDIKLIALTYTLETQIHGTKHLRECPPPVHMVNTKRLPEKDLPGWGSNVPNLEEWEALEQ 183
           VD+KLIAL+YTLE Q++GTK+LR+ PPP+  V  KRLPEKDLPGWGSNV NLEEWEALE 
Sbjct: 123 VDLKLIALSYTLEAQVYGTKNLRDVPPPIQTVRVKRLPEKDLPGWGSNVANLEEWEALEN 182

Query: 184 DADAPLDTTSKILPLQDLNLNIVPSDGQSEDLSLEHKDEHNSEHPDETESGSRRSRRYPP 243
           + +   +  SKILPL+DLN+NI+ SD  SE  S+     H   H ++ + G ++ RRYPP
Sbjct: 183 ETEEKSNANSKILPLKDLNMNIIASDNVSEVGSVV---SHTENHEEDVQEGGKKHRRYPP 242

Query: 244 KKKEINIEGKKMVADGIDASQGQYDDNE--GDWTPAVSRSTQRRYLRRKARREYYEALAE 303
           KK EI +EGK MV +G+DASQGQYDD++   DW PAVSRST  +YLRRKAR E+Y ALAE
Sbjct: 243 KKTEIKLEGK-MVVEGVDASQGQYDDDDDASDWRPAVSRSTHSKYLRRKARWEHYNALAE 302

Query: 304 KDSQQDVETTDGDVLVENNRSGQSQDQISEPITGNGNDCQIAEGTNNNENLSEILNQMRL 363
           ++ Q+D                Q  D+     T   N+    +   N E++S IL  MRL
Sbjct: 303 QEIQKD----------------QEADKARH--TKEANETHAKDSGKNGEDISSILKDMRL 362

Query: 364 EEDSLNAFHMEGLDASKKEEL----DISEVENIVAAEGINDTAKDEMEHLEYSSQTNESV 423
           EE+SL A   E  + + +  L    D  + +  V AEGI D A   +E+LE +S+  ++ 
Sbjct: 363 EEESLRALQEETEETNAEATLINGEDDIDHDIEVEAEGI-DVANQALENLEIASEAEDTF 422

Query: 424 DTSNI-DDISSDQSWMLRSLSESSVACVTGDFAMQNVLLQMGLRLLAPGGMQIRQLHRWI 483
           + S+I DD SS+QSW LR+LSESSVAC+TGD+AMQNV+LQMGLRLLAPGGMQIRQLHRWI
Sbjct: 423 EASSIGDDGSSEQSWSLRALSESSVACITGDYAMQNVILQMGLRLLAPGGMQIRQLHRWI 482

Query: 484 LKCHACYNVTAEIGKIFCPKCGNGGTLRKVAVTVGENGVVLASRKPRITLRGTKFSLPLP 543
           LKCHACY VT EIG+IFCPKCGNGGTLRKVAVT+G NG ++A+ KPRITLRGT++S+P+P
Sbjct: 483 LKCHACYTVTPEIGRIFCPKCGNGGTLRKVAVTIGANGAIIAACKPRITLRGTQYSIPMP 542

Query: 544 QGGRDAITKNLVLREDQLPQKFLHPKTKKKVNKQGDDFFAVDDFFSHHNTDKRAPLQPPV 603
           +GGR+AITKNL+LREDQLPQK LHP+TKKK +K GD++F  DD F +H++D++APLQPPV
Sbjct: 543 KGGREAITKNLILREDQLPQKLLHPRTKKKASKPGDEYFVSDDVFLNHHSDRKAPLQPPV 602

Query: 604 RQALAVFSGKRNPNDNHYSRSHH 619
           R+A++VFS KRNPNDNHYSRS H
Sbjct: 603 RKAMSVFSQKRNPNDNHYSRSMH 602

BLAST of CmoCh02G015020 vs. NCBI nr
Match: gi|449452344|ref|XP_004143919.1| (PREDICTED: RNA-binding protein NOB1 [Cucumis sativus])

HSP 1 Score: 1085.1 bits (2805), Expect = 0.0e+00
Identity = 554/620 (89.35%), Postives = 582/620 (93.87%), Query Frame = 1

Query: 1   MESPYPASCWSNVVKSQPAPKPQHQTPTSTVQVFADSCKSSQGVAVAVVDANAIIQGGEK 60
           ME+P PASCWSNVVK+QPAPKPQHQTP+S+VQVFADSCKSS+GVAVAVVDANAIIQGG+K
Sbjct: 1   METPSPASCWSNVVKTQPAPKPQHQTPSSSVQVFADSCKSSKGVAVAVVDANAIIQGGDK 60

Query: 61  LSTCADKFVSVPEVLDEVRDPVSRHRLAFVPFTLESMDPSPEALNKVIKFARATGDLQTL 120
           LS+ ADKFVSVPEVLDE+RDPVSRHRLAFVPFTLESMDPSP+ALNKVIKFARATGDLQTL
Sbjct: 61  LSSSADKFVSVPEVLDEIRDPVSRHRLAFVPFTLESMDPSPDALNKVIKFARATGDLQTL 120

Query: 121 SDVDIKLIALTYTLETQIHGTKHLRECPPPVHMVNTKRLPEKDLPGWGSNVPNLEEWEAL 180
           SDVDIKLIALTYTLETQIHGTKHLRECPPPVHMVNTKRLPEKD+PGWGSNVPNLEEWEAL
Sbjct: 121 SDVDIKLIALTYTLETQIHGTKHLRECPPPVHMVNTKRLPEKDMPGWGSNVPNLEEWEAL 180

Query: 181 EQDADAPLDTTSKILPLQDLNLNIVPSDGQSEDLSLEHKDEHNSEHPDETESGSRRSRRY 240
           EQDAD P   TSKILPLQDLNLNI+PSDGQSEDLSLEHKD+ N EH DETES SRRSRRY
Sbjct: 181 EQDADDPSRLTSKILPLQDLNLNIIPSDGQSEDLSLEHKDDDNLEHLDETESDSRRSRRY 240

Query: 241 PPKKKEINIEGKKMVADGIDASQGQYDDNEGDWTPAVSRSTQRRYLRRKARREYYEALAE 300
           PPKKKEINIEGKKMVADGIDASQGQYDDNEGDWTPAVSRSTQRRY RRKARREYYE+LAE
Sbjct: 241 PPKKKEINIEGKKMVADGIDASQGQYDDNEGDWTPAVSRSTQRRYHRRKARREYYESLAE 300

Query: 301 KDSQQDVETTDGDVLVENNRSGQSQDQISE-PITGNGNDCQIAEGTNNNENLSEILNQMR 360
           KDSQQDVETT+GD+ VE N SGQS+D+ISE P TGNGN+ QI EGTNNNEN+SEIL QMR
Sbjct: 301 KDSQQDVETTNGDIHVEFNGSGQSEDKISELPNTGNGNESQIGEGTNNNENISEILKQMR 360

Query: 361 LEEDSLNAFHMEGLDASKKEELDISEVENIVAAEGINDTAKDEMEHLEYSSQTNESVDTS 420
           LEEDSLNA HM    AS KE  D SE EN VA EG  D  KDEMEH+E +SQTNESVD S
Sbjct: 361 LEEDSLNALHM---SASTKEGSDESEGENAVAVEGTKDAEKDEMEHMEDASQTNESVDMS 420

Query: 421 NIDDISSDQSWMLRSLSESSVACVTGDFAMQNVLLQMGLRLLAPGGMQIRQLHRWILKCH 480
           N+DD+SSDQSWMLRSLSESSVACVTGD+AMQNVLLQMGLRLLAPGGMQIRQLHRWILKCH
Sbjct: 421 NVDDVSSDQSWMLRSLSESSVACVTGDYAMQNVLLQMGLRLLAPGGMQIRQLHRWILKCH 480

Query: 481 ACYNVTAEIGKIFCPKCGNGGTLRKVAVTVGENGVVLASRKPRITLRGTKFSLPLPQGGR 540
           ACYNVTAEIG+IFCPKCGNGGTLRKVAVTVGENGVVLA+RKPRITLRGTKFSLPLPQGGR
Sbjct: 481 ACYNVTAEIGRIFCPKCGNGGTLRKVAVTVGENGVVLAARKPRITLRGTKFSLPLPQGGR 540

Query: 541 DAITKNLVLREDQLPQKFLHPKTKKKVNKQGDDFFAVDDFFSHHNTDKRAPLQPPVRQAL 600
           DAITKNLVLREDQLPQKFLHPKTKKKVNKQGD+FFAVDDFFSHHNTDKRAPLQPPVRQAL
Sbjct: 541 DAITKNLVLREDQLPQKFLHPKTKKKVNKQGDEFFAVDDFFSHHNTDKRAPLQPPVRQAL 600

Query: 601 AVFSGKRNPNDNHYSRSHHK 620
           AVFSGKRNPNDNHYSRS H+
Sbjct: 601 AVFSGKRNPNDNHYSRSKHR 617

BLAST of CmoCh02G015020 vs. NCBI nr
Match: gi|659073777|ref|XP_008437247.1| (PREDICTED: RNA-binding protein NOB1 [Cucumis melo])

HSP 1 Score: 1071.2 bits (2769), Expect = 6.7e-310
Identity = 551/620 (88.87%), Postives = 580/620 (93.55%), Query Frame = 1

Query: 1   MESPYPASCWSNVVKSQPAPKPQHQTPTSTVQVFADSCKSSQGVAVAVVDANAIIQGGEK 60
           MESP PASCWSNVVK+QPAPKPQHQ+PTS+VQVFADSCKSS+GVAVAVVDANAIIQGG+K
Sbjct: 1   MESPSPASCWSNVVKTQPAPKPQHQSPTSSVQVFADSCKSSKGVAVAVVDANAIIQGGDK 60

Query: 61  LSTCADKFVSVPEVLDEVRDPVSRHRLAFVPFTLESMDPSPEALNKVIKFARATGDLQTL 120
           LS+ ADKFVSVPEVLDE+RDPVSRHRLAFVPFTLESMDPSP+ALNKVIKFARATGDLQTL
Sbjct: 61  LSSSADKFVSVPEVLDEIRDPVSRHRLAFVPFTLESMDPSPDALNKVIKFARATGDLQTL 120

Query: 121 SDVDIKLIALTYTLETQIHGTKHLRECPPPVHMVNTKRLPEKDLPGWGSNVPNLEEWEAL 180
           SDVDIKLIALTYTLETQIHGTKHLRECPPPVHMVNTKRLPEKD+PGWGSNVPNLEEWEAL
Sbjct: 121 SDVDIKLIALTYTLETQIHGTKHLRECPPPVHMVNTKRLPEKDMPGWGSNVPNLEEWEAL 180

Query: 181 EQDADAPLDTTSKILPLQDLNLNIVPSDGQSEDLSLEHKDEHNSEHPDETESGSRRSRRY 240
           EQDAD P   TSKILPLQDLNLN +PSDGQSEDLSLEHKD  NSEH DETES SRRSRRY
Sbjct: 181 EQDADDPSSLTSKILPLQDLNLNTIPSDGQSEDLSLEHKDADNSEHLDETESDSRRSRRY 240

Query: 241 PPKKKEINIEGKKMVADGIDASQGQYDDNEGDWTPAVSRSTQRRYLRRKARREYYEALAE 300
           PPKKKEINIEGKKMVADGIDASQGQYDDNEGDWTPAVSRSTQRRY RRKARREYYE+LAE
Sbjct: 241 PPKKKEINIEGKKMVADGIDASQGQYDDNEGDWTPAVSRSTQRRYNRRKARREYYESLAE 300

Query: 301 KDSQQDVETTDGDVLVENNRSGQSQDQISE-PITGNGNDCQIAEGTNNNENLSEILNQMR 360
           KD QQD+ETT+GD+ VE+N SGQS+D+ISE P TGNGN+ QI E   NNENLSEIL QMR
Sbjct: 301 KDIQQDIETTNGDIQVESNGSGQSEDKISELPNTGNGNESQIEEEMFNNENLSEILKQMR 360

Query: 361 LEEDSLNAFHMEGLDASKKEELDISEVENIVAAEGINDTAKDEMEHLEYSSQTNESVDTS 420
           LEEDSLNA HM    AS KE  + SE EN +A EGI D  KDEMEH+E +SQTNESVDTS
Sbjct: 361 LEEDSLNALHM---SASTKEGSE-SEGEN-MAVEGIKDAVKDEMEHMEDASQTNESVDTS 420

Query: 421 NIDDISSDQSWMLRSLSESSVACVTGDFAMQNVLLQMGLRLLAPGGMQIRQLHRWILKCH 480
           N+DD+SSDQSWMLRSLSESSVACVTGD+AMQNVLLQMGLRLLAPGGMQIRQLHRWILKCH
Sbjct: 421 NVDDVSSDQSWMLRSLSESSVACVTGDYAMQNVLLQMGLRLLAPGGMQIRQLHRWILKCH 480

Query: 481 ACYNVTAEIGKIFCPKCGNGGTLRKVAVTVGENGVVLASRKPRITLRGTKFSLPLPQGGR 540
           ACYNVTAEIG+IFCPKCGNGGTLRKVAVTVGENGVVLA+RKPRITLRGTKFSLPLPQGGR
Sbjct: 481 ACYNVTAEIGRIFCPKCGNGGTLRKVAVTVGENGVVLAARKPRITLRGTKFSLPLPQGGR 540

Query: 541 DAITKNLVLREDQLPQKFLHPKTKKKVNKQGDDFFAVDDFFSHHNTDKRAPLQPPVRQAL 600
           DAITKNLVLREDQLPQKFLHPKTKKKVNKQGD+FFAVDDFFSHHNTDKRAPLQPPVRQAL
Sbjct: 541 DAITKNLVLREDQLPQKFLHPKTKKKVNKQGDEFFAVDDFFSHHNTDKRAPLQPPVRQAL 600

Query: 601 AVFSGKRNPNDNHYSRSHHK 620
           AVFSGKRNPNDNHYSRS H+
Sbjct: 601 AVFSGKRNPNDNHYSRSKHR 615

BLAST of CmoCh02G015020 vs. NCBI nr
Match: gi|590678051|ref|XP_007040191.1| (RNA-binding protein nob1, putative isoform 1 [Theobroma cacao])

HSP 1 Score: 832.0 bits (2148), Expect = 6.7e-238
Identity = 437/624 (70.03%), Postives = 515/624 (82.53%), Query Frame = 1

Query: 3   SPYPASCWSNVVKSQPAPKPQHQTPTS-TVQVFADSCKSSQGVAVAVVDANAIIQGGEKL 62
           +P PASCWSNV+KSQP PKPQ Q  T+ T Q+F +SCKS++G+AVAVVDANA+I+GGEKL
Sbjct: 10  NPNPASCWSNVLKSQP-PKPQTQKQTAATTQLFVESCKSTKGIAVAVVDANAVIEGGEKL 69

Query: 63  STCADKFVSVPEVLDEVRDPVSRHRLAFVPFTLESMDPSPEALNKVIKFARATGDLQTLS 122
           +  AD+FV+VPEVL E+RDPVSRHRLAF+PF+++SM+PS +ALNKVIKFARATGDLQTLS
Sbjct: 70  NNSADRFVTVPEVLAEIRDPVSRHRLAFIPFSIDSMEPSSDALNKVIKFARATGDLQTLS 129

Query: 123 DVDIKLIALTYTLETQIHGTKHLRECPPPVHMVNTKRLPEKDLPGWGSNVPNLEEWEALE 182
           DVD+KLIALTYTLE QIHGT H+R+ PPPVH+VN KRLPE+DLPGWGSNVPNL+EWEALE
Sbjct: 130 DVDLKLIALTYTLEAQIHGTNHIRDAPPPVHVVNVKRLPERDLPGWGSNVPNLDEWEALE 189

Query: 183 QDADAPLDTTSKILPLQDLNLNIVPSDGQSEDLSLEHKDEHNSEHPDETESGSRRSRRYP 242
           ++A+   ++ S+ILPL+DLN+N +PSD  SED S+E K E +SE+ ++ E G RR RRY 
Sbjct: 190 REAEGGTNSNSRILPLKDLNMNTLPSDNGSEDGSVEIKSETHSENQEDVEHGFRRPRRYL 249

Query: 243 PKKKEINIEGKKMVADGIDASQGQYDDNEGDWTPAVSRSTQRRYLRRKARREYYEALAEK 302
           P+KKE+ IEGKKMVADGIDASQGQ DDN  +W PAVSRST RRYLRRKARREYYEAL EK
Sbjct: 250 PQKKEVKIEGKKMVADGIDASQGQIDDNGDNWQPAVSRSTHRRYLRRKARREYYEALVEK 309

Query: 303 DSQQDVETTDGDVLVENNRSGQSQDQISEPITGNG--NDCQIAEGTNNNENLSEILNQMR 362
           D Q+D+E +              ++ + +  +GNG   + + AE    +E+LS IL QMR
Sbjct: 310 DCQEDMEKS------------MDKNNVEDAHSGNGILEETERAEEKKGDEDLSSILKQMR 369

Query: 363 LEEDSLNAFHMEGLDASKKEELDISEVENI-VAAEGIN-DTAKDEMEHLEYSSQTNESVD 422
           LEEDSL A         + EE++I+   N+ ++ EG   D   +E++ LE SSQTNE+VD
Sbjct: 370 LEEDSLEALQ-------EAEEVEITVEANVNLSVEGNKMDLVNEELDQLEMSSQTNETVD 429

Query: 423 TSNIDDISSDQSWMLRSLSESSVACVTGDFAMQNVLLQMGLRLLAPGGMQIRQLHRWILK 482
            S  DD+S +QSWMLRSLSESSVACVTGDFAMQNV+LQMGLRLLAPGGMQIRQLHRWILK
Sbjct: 430 ASYTDDVSCEQSWMLRSLSESSVACVTGDFAMQNVILQMGLRLLAPGGMQIRQLHRWILK 489

Query: 483 CHACYNVTAEIGKIFCPKCGNGGTLRKVAVTVGENGVVLASRKPRITLRGTKFSLPLPQG 542
           CHACYNVTAEIG+IFCPKCGNGGTLRKVAVTVGENG+VLAS +PRI+LRGTKFSLPLPQG
Sbjct: 490 CHACYNVTAEIGRIFCPKCGNGGTLRKVAVTVGENGIVLASHRPRISLRGTKFSLPLPQG 549

Query: 543 GRDAITKNLVLREDQLPQKFLHPKTKKKVNKQGDD--FFAVDDFFSHHNTDKRAPLQPPV 602
           GRDAITKNL+LREDQLPQKFL+PKTKKKVNKQGDD  F  VD F   H+TDKRAPLQPPV
Sbjct: 550 GRDAITKNLILREDQLPQKFLYPKTKKKVNKQGDDDLFMGVDTF--THHTDKRAPLQPPV 609

Query: 603 RQALAVFSGKRNPNDNHYSRSHHK 620
           R+ALAVF+GKRNPNDNHYSRS HK
Sbjct: 610 RKALAVFTGKRNPNDNHYSRSKHK 611

BLAST of CmoCh02G015020 vs. NCBI nr
Match: gi|590678054|ref|XP_007040192.1| (RNA-binding protein nob1, putative isoform 2 [Theobroma cacao])

HSP 1 Score: 830.5 bits (2144), Expect = 2.0e-237
Identity = 436/623 (69.98%), Postives = 514/623 (82.50%), Query Frame = 1

Query: 3   SPYPASCWSNVVKSQPAPKPQHQTPTS-TVQVFADSCKSSQGVAVAVVDANAIIQGGEKL 62
           +P PASCWSNV+KSQP PKPQ Q  T+ T Q+F +SCKS++G+AVAVVDANA+I+GGEKL
Sbjct: 10  NPNPASCWSNVLKSQP-PKPQTQKQTAATTQLFVESCKSTKGIAVAVVDANAVIEGGEKL 69

Query: 63  STCADKFVSVPEVLDEVRDPVSRHRLAFVPFTLESMDPSPEALNKVIKFARATGDLQTLS 122
           +  AD+FV+VPEVL E+RDPVSRHRLAF+PF+++SM+PS +ALNKVIKFARATGDLQTLS
Sbjct: 70  NNSADRFVTVPEVLAEIRDPVSRHRLAFIPFSIDSMEPSSDALNKVIKFARATGDLQTLS 129

Query: 123 DVDIKLIALTYTLETQIHGTKHLRECPPPVHMVNTKRLPEKDLPGWGSNVPNLEEWEALE 182
           DVD+KLIALTYTLE QIHGT H+R+ PPPVH+VN KRLPE+DLPGWGSNVPNL+EWEALE
Sbjct: 130 DVDLKLIALTYTLEAQIHGTNHIRDAPPPVHVVNVKRLPERDLPGWGSNVPNLDEWEALE 189

Query: 183 QDADAPLDTTSKILPLQDLNLNIVPSDGQSEDLSLEHKDEHNSEHPDETESGSRRSRRYP 242
           ++A+   ++ S+ILPL+DLN+N +PSD  SED S+E K E +SE+ ++ E G RR RRY 
Sbjct: 190 REAEGGTNSNSRILPLKDLNMNTLPSDNGSEDGSVEIKSETHSENQEDVEHGFRRPRRYL 249

Query: 243 PKKKEINIEGKKMVADGIDASQGQYDDNEGDWTPAVSRSTQRRYLRRKARREYYEALAEK 302
           P+KKE+ IEGKKMVADGIDASQGQ DDN  +W PAVSRST RRYLRRKARREYYEAL EK
Sbjct: 250 PQKKEVKIEGKKMVADGIDASQGQIDDNGDNWQPAVSRSTHRRYLRRKARREYYEALVEK 309

Query: 303 DSQQDVETTDGDVLVENNRSGQSQDQISEPITGNG--NDCQIAEGTNNNENLSEILNQMR 362
           D Q+D+E +              ++ + +  +GNG   + + AE    +E+LS IL QMR
Sbjct: 310 DCQEDMEKS------------MDKNNVEDAHSGNGILEETERAEEKKGDEDLSSILKQMR 369

Query: 363 LEEDSLNAFHMEGLDASKKEELDISEVENI-VAAEGIN-DTAKDEMEHLEYSSQTNESVD 422
           LEEDSL A         + EE++I+   N+ ++ EG   D   +E++ LE SSQTNE+VD
Sbjct: 370 LEEDSLEALQ-------EAEEVEITVEANVNLSVEGNKMDLVNEELDQLEMSSQTNETVD 429

Query: 423 TSNIDDISSDQSWMLRSLSESSVACVTGDFAMQNVLLQMGLRLLAPGGMQIRQLHRWILK 482
            S  DD+S +QSWMLRSLSESSVACVTGDFAMQNV+LQMGLRLLAPGGMQIRQLHRWILK
Sbjct: 430 ASYTDDVSCEQSWMLRSLSESSVACVTGDFAMQNVILQMGLRLLAPGGMQIRQLHRWILK 489

Query: 483 CHACYNVTAEIGKIFCPKCGNGGTLRKVAVTVGENGVVLASRKPRITLRGTKFSLPLPQG 542
           CHACYNVTAEIG+IFCPKCGNGGTLRKVAVTVGENG+VLAS +PRI+LRGTKFSLPLPQG
Sbjct: 490 CHACYNVTAEIGRIFCPKCGNGGTLRKVAVTVGENGIVLASHRPRISLRGTKFSLPLPQG 549

Query: 543 GRDAITKNLVLREDQLPQKFLHPKTKKKVNKQGDD--FFAVDDFFSHHNTDKRAPLQPPV 602
           GRDAITKNL+LREDQLPQKFL+PKTKKKVNKQGDD  F  VD F   H+TDKRAPLQPPV
Sbjct: 550 GRDAITKNLILREDQLPQKFLYPKTKKKVNKQGDDDLFMGVDTF--THHTDKRAPLQPPV 609

Query: 603 RQALAVFSGKRNPNDNHYSRSHH 619
           R+ALAVF+GKRNPNDNHYSRS H
Sbjct: 610 RKALAVFTGKRNPNDNHYSRSKH 610

BLAST of CmoCh02G015020 vs. NCBI nr
Match: gi|703071846|ref|XP_010089131.1| (hypothetical protein L484_024306 [Morus notabilis])

HSP 1 Score: 812.4 bits (2097), Expect = 5.5e-232
Identity = 416/618 (67.31%), Postives = 496/618 (80.26%), Query Frame = 1

Query: 4   PYPASCWSNVVKSQPAPKPQHQTPTSTVQVFADSCKSSQGVAVAVVDANAIIQGGEKLST 63
           P PA CWSN++K+Q APKPQ+ +P+ TV VFA+SCKS++G+AVAVVDANAII GGE+LS 
Sbjct: 9   PSPAPCWSNLLKNQTAPKPQNPSPSPTVGVFAESCKSTKGIAVAVVDANAIIDGGERLSQ 68

Query: 64  CADKFVSVPEVLDEVRDPVSRHRLAFVPFTLESMDPSPEALNKVIKFARATGDLQTLSDV 123
           CADKFVSVPEV+DEVRDPVSRHRLAF+PF+++S++PSPE+LNKVIKFARATGDLQTLSDV
Sbjct: 69  CADKFVSVPEVMDEVRDPVSRHRLAFIPFSVQSIEPSPESLNKVIKFARATGDLQTLSDV 128

Query: 124 DIKLIALTYTLETQIHGTKHLRECPPPVHMVNTKRLPEKDLPGWGSNVPNLEEWEALEQD 183
           D+KLIALTYTLE QIHGT+HLRECPPP+H VN KRLPEKD+PGWGSNVPNLEEWEALE  
Sbjct: 129 DLKLIALTYTLEAQIHGTEHLRECPPPIHTVNVKRLPEKDMPGWGSNVPNLEEWEALEHQ 188

Query: 184 ADAPLDTTSKILPLQDLNLNIVPSDGQSEDLSLEHKDEHNSEHPDETESGSRRSRRYPPK 243
           A+   D  S+ILPL+DLNLN+V   G           EH +   D+ E    R RRYPPK
Sbjct: 189 AEDKPDEDSRILPLKDLNLNVVSDAGS----------EHQT---DDGEENVGRPRRYPPK 248

Query: 244 KKEINIEGKKMVADGIDASQGQYDDNEGDWTPAVSRSTQRRYLRRKARREYYEALAEKDS 303
           KKEINIEGKKMV DGIDAS+G++D +EGDW PAVSRST RR+LRRKARR+YYE+L+EKD 
Sbjct: 249 KKEINIEGKKMVTDGIDASRGEFDGDEGDWLPAVSRSTHRRFLRRKARRDYYESLSEKDG 308

Query: 304 QQDVETTDGDVLVENNRSGQSQDQISEPITGNGNDCQIAEGTNNNENLSEILNQMRLEED 363
                 ++ + + E+   GQ ++   E   G   + ++ EG N+ ENLS IL+QMRLEED
Sbjct: 309 -----FSEKNEMTEDKNDGQ-ENGTKEEKNGGVEENEVREGKNDEENLSTILHQMRLEED 368

Query: 364 SLNAFHMEGLDASKKEELDISEVENIVAAEGIN-DTAKDEMEHLEYSSQTNESVDTSNI- 423
            +    ++GL             E    AEG   D   +  +HLE SS+ N+ +D SN+ 
Sbjct: 369 LVEEDLVKGLQ------------EGNANAEGDQTDMVSEGHDHLEVSSEINDCIDASNVE 428

Query: 424 DDISSDQSWMLRSLSESSVACVTGDFAMQNVLLQMGLRLLAPGGMQIRQLHRWILKCHAC 483
           DD SS+ SWMLRSLSESSVAC+T DFAMQNV+LQMGLRL+APGGMQIRQLHRW+L+CHAC
Sbjct: 429 DDASSEHSWMLRSLSESSVACITSDFAMQNVILQMGLRLVAPGGMQIRQLHRWVLRCHAC 488

Query: 484 YNVTAEIGKIFCPKCGNGGTLRKVAVTVGENGVVLASRKPRITLRGTKFSLPLPQGGRDA 543
           Y VTAEIG+IFCPKCGNGGTLRKVAVTVGENG+ LA+R+PRITLRGTKFSLPLPQGGRDA
Sbjct: 489 YTVTAEIGRIFCPKCGNGGTLRKVAVTVGENGITLAARRPRITLRGTKFSLPLPQGGRDA 548

Query: 544 ITKNLVLREDQLPQKFLHPKTKKKVNKQGDDFFAVDDFFSHHNTDKRAPLQPPVRQALAV 603
           ++KN++LREDQLPQKFL+PKTKKK  KQGDD++  DD FSHH++ K+AP QPPVR+ALAV
Sbjct: 549 VSKNVILREDQLPQKFLYPKTKKKSTKQGDDYYVSDDIFSHHHSHKKAPFQPPVRKALAV 595

Query: 604 FSGKRNPNDNHYSRSHHK 620
           FSGKRNPNDNHY+RS HK
Sbjct: 609 FSGKRNPNDNHYTRSKHK 595

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
NOB1_MACFA1.9e-2742.05RNA-binding protein NOB1 OS=Macaca fascicularis GN=NOB1 PE=2 SV=1[more]
NOB1_HUMAN2.5e-2742.05RNA-binding protein NOB1 OS=Homo sapiens GN=NOB1 PE=1 SV=1[more]
NOB1_RAT4.8e-2637.33RNA-binding protein NOB1 OS=Rattus norvegicus GN=Nob1 PE=2 SV=1[more]
NOB1_PONAB8.2e-2635.37RNA-binding protein NOB1 OS=Pongo abelii GN=NOB1 PE=2 SV=1[more]
NOB1_BOVIN1.1e-2535.44RNA-binding protein NOB1 OS=Bos taurus GN=NOB1 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KML5_CUCSA0.0e+0089.35Uncharacterized protein OS=Cucumis sativus GN=Csa_5G154760 PE=4 SV=1[more]
A0A061G6A8_THECC4.7e-23870.03RNA-binding protein nob1, putative isoform 1 OS=Theobroma cacao GN=TCM_016231 PE... [more]
A0A061G5P4_THECC1.4e-23769.98RNA-binding protein nob1, putative isoform 2 OS=Theobroma cacao GN=TCM_016231 PE... [more]
W9QSA3_9ROSA3.8e-23267.31Uncharacterized protein OS=Morus notabilis GN=L484_024306 PE=4 SV=1[more]
A0A0B0Q0J6_GOSAR2.8e-23068.64RNA-binding NOB1 OS=Gossypium arboreum GN=F383_14923 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G41190.11.1e-20158.91 Nin one binding (NOB1) Zn-ribbon like (InterPro:IPR014881), D-site 2... [more]
Match NameE-valueIdentityDescription
gi|449452344|ref|XP_004143919.1|0.0e+0089.35PREDICTED: RNA-binding protein NOB1 [Cucumis sativus][more]
gi|659073777|ref|XP_008437247.1|6.7e-31088.87PREDICTED: RNA-binding protein NOB1 [Cucumis melo][more]
gi|590678051|ref|XP_007040191.1|6.7e-23870.03RNA-binding protein nob1, putative isoform 1 [Theobroma cacao][more]
gi|590678054|ref|XP_007040192.1|2.0e-23769.98RNA-binding protein nob1, putative isoform 2 [Theobroma cacao][more]
gi|703071846|ref|XP_010089131.1|5.5e-23267.31hypothetical protein L484_024306 [Morus notabilis][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR014881NOB1_Zn-bd
IPR017117Nob1_euk
Vocabulary: Biological Process
TermDefinition
GO:0000469cleavage involved in rRNA processing
GO:0042274ribosomal small subunit biogenesis
Vocabulary: Molecular Function
TermDefinition
GO:0004521endoribonuclease activity
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0000469 cleavage involved in rRNA processing
biological_process GO:0009553 embryo sac development
biological_process GO:0009555 pollen development
biological_process GO:0051252 regulation of RNA metabolic process
biological_process GO:0042274 ribosomal small subunit biogenesis
biological_process GO:0090502 RNA phosphodiester bond hydrolysis, endonucleolytic
biological_process GO:0008150 biological_process
cellular_component GO:0005737 cytoplasm
cellular_component GO:0005575 cellular_component
molecular_function GO:0004521 endoribonuclease activity
molecular_function GO:0046872 metal ion binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh02G015020.1CmoCh02G015020.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR014881Nin one binding (NOB1) Zn-ribbon-likePFAMPF08772NOB1_Zn_bindcoord: 468..538
score: 3.0
IPR014881Nin one binding (NOB1) Zn-ribbon-likeunknownSSF144206NOB1 zinc finger-likecoord: 468..531
score: 1.28
IPR017117Ribonuclease Nob1, eukaryotePIRPIRSF037125Nob1coord: 26..619
score: 3.7E
NoneNo IPR availableunknownCoilCoilcoord: 341..361
scor
NoneNo IPR availablePANTHERPTHR12814RNA-BINDING PROTEIN NOB1coord: 1..179
score: 4.5E-153coord: 254..619
score: 4.5E