CmaCh02G005760 (gene) Cucurbita maxima (Rimu) v1.1

Overview
NameCmaCh02G005760
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu) v1.1)
DescriptionHeterogeneous nuclear ribonucleoprotein
LocationCma_Chr02: 3311271 .. 3319019 (-)
RNA-Seq ExpressionCmaCh02G005760
SyntenyCmaCh02G005760
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CGAGCGAGCGAGCGAGCGAGCGAGGAAGAACAGGAGGAGAGTGAGATATTCTTTTTTCACCTTTAGGGTTCAATTTCATTGCGTTACCTGCCCAAATTCATCATGGATCGGAAGCTTGTGGTTAGTTTTTTCCTTTTCATTGTTTCTTCTATCAAGTAATTTGGTAAAGGGTTCGATTCGTTTTACTTATTTATTTCGTGAAATTTAGGTTCTCGGCATTCCATGGGACATCGATACAGAAGGGCTAAGAGACTACATGAGCAAATTTGGAGAGTTGGAGGATTGTATTGTCATGAAGGTTTGCCTTTTCCCCTATTCGTATGATGCATTCGTACTATCCGTTCTTTCTTATGGGTTTTTAAGAAAGCATGCCTTTGTCATGAATTCAGTGATTTTAAAAGTGTGAGGTGCACTGGAGCGTAATAACTTCAAAATAAAAAAGATTTGTGAACACTTTATATAATACATGAAAATGAAGATTGTTATGAATAAGAACAGAATGAAAATTTGAAGAATTAAAAGCTTTTATTAGAGATTGTGGTGGAAAAATGAAGGTGTTAAGAATGCTTTTATTTTCTTCCTAAAACATTATTAAAGAAATTAGGGCTTCAAGCATCGAGGTGCCACCTCGGCTATTGATGCTCGTGCCTCAAGCTGGCCTTATCAAGGGTGCCTAGCTGTTTTGAAGCAATACACCGAGTGTGGGCTTTTCTGTGTGCCATGGCATGCTTGATCTATTGGCGTTTAGGTTGTAGGAATGTATGTGCTGTTTATGTTTGGATGTGCAGAAATCCATATCTTGCCATCTCTTGTTCAGACCCAAGCACAAACCAAAAATATCTTACAATCTTCATTAGGTACTCCAAGTAAAAAGACAACTTCAGAAAGTCGGATCCAAATCTTTTGATGTTTGCACACAAAAACACACTGTATATGGCCTAGTGTGTCTCTTTTGGTACTCCAAGTAAAAAGACAACCCCAGAAAGTATGATCCAAATATTTTGATGCTTGGACACGAAAACACACAGTAGATGGCCTAATGTGTCTCTTTAGGTCGTTTATAAGAAATTTTGTACTATTCGGTAGGCCTTGTTCTCTTTGATTGAAGCCCTGTTTTGTATAGCTTGTTTGGATTTGAATTGTTCTCTTGTTGGGCTATGGTTTTAGACACCCCTCTTACATTCTTGTTTGTTTTGTCAACGAAAGTTGGGTTTCAATAATTTTTTCTAATATATTAATTTATCACCTTTGTTTTCCCTGGTGGATTGTACATATGAAGACAGTGGATGCTTGTTGTGGCTAATTTTGTATAGATACATTTTTATTTTCAAGTCCTTTTTTGGTAGTCATATGCTTTTTTTTTTTTTTTTTTTTCCCTTTNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNAGTCATATGTTTTTTTTTTTTTTTTTTTTCCCTTATATGTCAGGAGAGGTCAACTGGTCGGTCTCGTGGTTTTGGGTATGTGACCTTTGCAACCGATGAAGATGCCAAGGTAAGACTATCTTCAATACCTTCACGTTTCGTTAGACTTGATGCCTGATGCCTTACACCTCTTTGTTGCAGAATGCGCTATCTAATGAGCACTTCCTTGGCAATAGAATGCTGGAAGTTAAAGTTGCCACACCAAAGGTTAGTTTCTCTTTTTCTCTCTTTCCCTTTTTTATTTGCCTTCCTTGCATACATTTTCCTTGTCTAGAATTCCCTTCATTTTTATCTTCTTTTTGAATAAATAAATCATTAAAAGTGTGAGTACTGTGTTTCGATTTGTATGCATACATATTTGTATCATATGCTTCGAATTGAAAAGCATTTCCTGGCAATGATCATATTTTCAGGAAATATATTGTATCATATTTGTATCATATTTTCTGTTGACAGGAAGAGATGAGAGCACCTCCAAAGAGAGTTACGAGGATATTTGTAGCAAGGATTTCACAATCTGTGACTGAGGCTGCCTTCCGCAGGTAGTTTGGTGTGATTTTATGCCAGATTCTTGTGCACAATCAATATATATCAGACATTAGTTGAACACGATGGCTATACGTTGGACACTTCTTAGGCACATGTTAAATCCTTGTTAAGCATAATAAATAGGTCTTATTTCACTTGATAAGGGCTGTTCCTTTTGTGGATTCGGTTTTATTATATGCCTTTTTATTCTTTCATTTTCTCTCAATGAAAGCTTGATTCTTTATTAAAACAAACGTTCTTAAAAAAGAAAAGAAAAGAAAAGAAAAATAGTAATATAGGACAAAACAATTTACTGAGTTAATTTATCAATTTTTTTTTCCAAGCATACACATAGGCTTACTGACTTGAGATTTGCTGATGGTATACAATATTCATTTAAAAAGTGTATATTTTAATTAGTGTATTCTTGTCACGTTTGTGTCCTGAACTTTTTAGAAAATTCCGTCTTACTGTGTCCATGTTGTGTTGTGTGCGTCTCTCCTGTCCATGCTAGAAATTTGATTGGGATTTATCGGTGAAGGCTGAAGAGGAATTGGGAAGAGAACAGGCAACAAAGCTGGAAGTAGGAAAAAAGATTGAATCTGGCCTAAAGAATTCAGGCTATAGATCATTTAAGGAATTCTGTGGAGAACTGAGAAAGTACAAACCTCTTAATGGAACTTTGTCGCCAGAAACTTTTTGAGCTTCAAATCTGACGTGGAATTATCAGTGAATAGGAGGCAGAGACTAATTTAGAAGAGGATGGGAAAAAAGATGAAAGAAGAAAAATATTGAATCACACCAAGAGTATGCAGGTTATAGAACGTCGAAGGAAGTTGTGGAGGCATGAGAGAAATTGAGAACTTCTCAATGAAACTTCGTAGCAACTTGAGAGATAAAGAACTTTGAGGTGGCAGTTGAATGACTATTTGTTAATTTTCTATTCACATCATTACATGTGGGAATGACAGAATGATAATCCATAAAGTCTAAGGAATGTATGCGTGAGGAAAGAGTGAGGATTGATTTTTTAATTAGTTAGTTAAAGGGGGATGCATCGATGTGTTACGAATTTGTGGGGCTGACATAATAGGAAAGAGGGAGTGCTCAAGGGAAAGAAGGGAGCTAGGAGAGGTACGCTTGCTTGGTCGGCAAGTTGGGTGTACTTTGGAGCGAGTTCGCTTGATTGTGTAACATATTGATAAATAAAGCTAGCTCCCATGCTTGTATAGCTTATCAAGTATCGAGCAAGAAGCTACACTTTATGGCTTATGTTTTCCCGTCCTCTGACCCTTCCTCCTTGCTGTCCTTGAGTTTTTTCCGCAATTTCTCTAAAAGAGAGGCTTTGGATGTTGTGCGCCTTCTTTTGATTCCTTAGGGGCGATATGTTCTTCGCGGGGGGAGGGACACTAGGTTCTAGTCCCTGGATCCATGTAGGGGCTTCTCTTGTAGCTCTTTCTATTGGTTCAGTCTGGCCTGTTCTCCCTCTTTCTGTGTTTGCTTTCTCCCTGGTCTGGAAAGTGCAAGTCGCTTAGAGGGTAAAATATTTGTGTGGCAGATTTTACACGAGAGAATCAACACTTGGACCTGTTCCTAGGTGCATTCTATATAGGTGTCAGACTGAGTATTTGTATCATCTGTTGTGGGTTGCTCGTTTGTCCGAATTTTTTGGGCTCATTAGTTAGATTCTTATAATTTATGTTGGGTTTGACAGAGGAGGTTCTCCTGGGCTTACTTTTTCGTGAGAAGGACACTGTTTTGTGGCATGCCAGTTTCTTTTGCCATGCTATGGGGGGTTTGGCTGGAGAGGAATTTTAGAATTTTTAGAGAGGTGTGGGTGTCGGTCACTTGGCCTTTTTGTAATTATGATCTTGGTTTGATTTTGTTGGATTGAAGTCCCTTTGCATAGTTTATGAGACGTTCTTTTGTGGGTCTTATTTTATCTATACCCATATATTTTCTTTTATTTCTCAAAGAAAGTGCGGTTTCTTCTACAAAAAATAAATAAATCCATCTCATATCACAAACTTTTGTGTGTCCTTTCTGTTTCAAATGGGGATGTTGCAGGGACAAACATTTTATAATTTAATTTGGGGATGATCCGTGCTTGCATCTGCCCCCCAAATGTGTCCCAAATTATTACCATTATTCTATTGACCTATTAACCTTTTCCATTGACAGACGAATAGGAAGTTTCATATTTTCATTAAATGAATTATTTAAACTAATAAATAATAAAATTAATTTTATTGCCTAGAAAAAGTAGGTACCTGCCAGGAAACCTGTTCCTTCAGGGTTTTTAGACATCTCTTAAGAGTCAGAGTCTGTTGACTCAATCCACAAGGACTTTGGTAAAGCTTTCAAGCTATATACCCAAATGTAACTTTGTAAGAATTTCATAATCTGATGGTTATCGGAGAGAAGATTGTTGGTATCCTTCGGATGGCACTAAGGCCTTTTAAGAATCCACTAAGACACTATTCAAGCATCTCGTAGAAGAGTTGCAACGTTTGTCTTAAGAAGTTGATTCATGTTTTTTATTTATGACTTGATGATCTTTTTTTGTTATTTGGTTTACAAATTCTTGATGGAAATGAACTTCATGTTTGTTGCAGTCATTTTGAGAAATATGGAGAAATAACAGATCTTTACATGCCGAAGGTACGTTTTCCTCTCACTAAACTAGGAAATTAGTTCTGTGGTTCTACAACCAAGTGTAAATCAGGAAGGGGATTCTCTATTCCTATTTTCACTTATCAGGTGTACATGTACTACGTGAATTGGAAAATATGGGTCACCCATGTTTTTGTGCAGTGTATATTAACATCTTTTAACATTTGTTAATTATTCAGGACCAGGGCTCAAAAATCCATCGTGGAATAGGTTTCATCACTTTTGCAAGTTCAGGTTGGATCATCTTTTTATATTGATTTAAGATAATGATTGCAGAGTTCAAGTTGGCGACTTTTGGCTTGTTTCTTCAATTTCTGAAAATATTCCGTCCAAATACGGATCCTTTTAATTGTTTACTAGTGCAGTAGTTGAGTTTTTCTACACTGTTCAAAGATTTCCTGATACATAAGTACAAAGATTGGTTTATGGGAAAACATTTTCTCTCTGCTACCACTCTCAAACAACTCACTCTCTTTCATATCATCAACAAGCATAGGTAACACATCACAAAGAATCTCATTTTTATACTTTTCCAAAATAAAAGATATTAAAGCTTGAGAAACTATCTTCATTTTTCCACTATATTGAGTCATTGAAACTTATAAGGTTTAGGATGGTGTTGGGTTGGAACTTGCAACATTTTGTGGTTGCCGATTTCCAGGCACTAAATGTAGCAACTACTATCTTGTAGACTCTGTAGAGAATTTGATGGCTGAAACTCATGAATTAGAAGGTTCTGCAGTGGTTGTTGACAGAGCGACCCCCAAGGCAAGATTTTGCTGTTATCTTTATAACACCATATTTACCTTTTGATGGATTCTTTTTTATTTTTTTTTTTTTTGAAGGCCAAAGGATTCTTTTTAGTTTGTAATGATTTTGTTTGTCTAATTGAGGTCAATATACTATTTTTCCAGGACGATGATTTCAGGCCCATAGGGAAAATGCCGCGAGGGGGTCGTGGTGGTGGTGGTGGTGGTGGTGGATATGGTGCATACAATGCTTATATTTCTGCAGCAACAAGATATGCAGCACTTGGTGCTCCTACCTTATACGACCATCCAGTCTCAGTTTATGGTGGGAGTATGTATCTCTGATAGTATCCTTGTTCATGTGATGCTTGGTCTTGGTGAATATTGTTATGATGATTCCCTGTGTCATTGGATCAGGAAGGGAATTTCGAGGGATGGGAAAAAAGATTTTTATTGGTAGGCTCCCACAAGAAGCATCTGCTGATGATCTCCGCCAATACTTTGGTAGATTTGGCCGTATATTGGATGTCTATGTTCCCAAGGTAAAAGTAGATTTAGTTAATGCTCTACTTTTTATGAGTGACTGAATTGAGATAGCATAACGGTTCAGTTGTTAATGACGAATGTAGGATCCTAAAAGATCTGGTCATCGAGGTTTTGGTTTTGTAACTTTTGCTGAAGATGGTGTAGCAGATCGTGTATCTCGAAGATCTCACGAGATTTGTGGACAACAGGTATTCTACTGCATGTCACTTTCCTTACTAGAAGGGCATTGCCAGCGACTGTTCTTTCTGCCTTTTGCGAGTAGTGGTTACTAGTTATTAGTTATATGATTATTACGCAAGCTTCAATCTTGGGTTATGAATCTTATCTCACAATTGGAATCTGCAGGTTGCTATCGATTCAGCCACTCCTCTTGATGATGCTGGAGCAGCTGGAGCAAGTGGAACTTTTGTGATGAATAGCGCGGCAGAATCTTTTGGGAGCTATGGTGGACCTATGAGGACTTATGGAAGGATGTATGGAAGCCTGGACTTTGATGATGTAAGTGGCTCTGTTTCTATTTGAATTATATTCAAGTTGGTTAAAATTTAGGTAGTGATTGATAGGTTGTCCACCACATTTAGCGAGCTTTACATTTACTGGTGCCTGCCTCTCAGACCCTCCCAAGGGTCCTACAATTAGGATTATAAACATTTTGATTAAAGATTCTTTAGAAATTATTTTAAGGTTTGGTTTCTGAACTTTGAAAGTTTTTTATGATAAGCATTTTTTGGCAAAGTACTTTTGAGTGCTTTTGATGAGTCAAAAGTCAAAAGCCAAAAGCATTTAAAAAGGTTACTTCAACATGAGTTGTTTTCTCTTTTCAGCTTTTTAAAGATTAATGGGTTTATATTGTGCAGTGGGGTTATGGAGTTGGTGGTGGAAGACCATCTAGAGCAGACTGGAGGTATCGGCCATACTGATGAGAGACAAACCAGCATTGAATTAGTAGTGATGGCTGATGGCTTGATAAATCAACGGCATGAATTGCCATATTTATGGGGAAGATATTTGAGGTTTCAGCCTTTGTAGAGTCGATATCTTTCAATGTCTGATTATTTGGTTTCTTTTGGTTTATTGTTTTTCAACTGTCTCAATACCATTCCTTAGCTGAAACATGGATCAATTGGGGTCGTCGAGGTGTGGAGTCGGGGTGTGGAATCGGGATCGGGACCAGGAGGGTATGTCGGCTGTATGAATGTTTGGTTAGAGTAGTTCTTAGTGAGTTATATTCTTCTCTAACTCTACTTTAAGAATATCCATTTTATAACTTCTCTAGGTTAGTTTGGTTATCATCGAACAATTGATTGACTACTGAACGCTGGATGGCCTTTTTTTAGTTATGCGTTCTTTTTCTTTGCATCAGTCCTACAATGTTATCGGATCCTTCCAATTAGACTAAATCCATGGTCAAATAAGAACACTCCCTCAAGAACAAAATTGTATTGCAATAGAAGTGGCTGATTCACTTAAACCTATTACGTCATTTAATTTCAAC

mRNA sequence

CGAGCGAGCGAGCGAGCGAGCGAGGAAGAACAGGAGGAGAGTGAGATATTCTTTTTTCACCTTTAGGGTTCAATTTCATTGCGTTACCTGCCCAAATTCATCATGGATCGGAAGCTTGTGGTTCTCGGCATTCCATGGGACATCGATACAGAAGGGCTAAGAGACTACATGAGCAAATTTGGAGAGTTGGAGGATTGTATTGTCATGAAGGAGAGGTCAACTGGTCGGTCTCGTGGTTTTGGGTATGTGACCTTTGCAACCGATGAAGATGCCAAGAATGCGCTATCTAATGAGCACTTCCTTGGCAATAGAATGCTGGAAGTTAAAGTTGCCACACCAAAGGCAACAAAGCTGGAAGTAGGAAAAAAGATTGAATCTGGCCTAAAGAATTCAGGCTATAGATCATTTAAGGAATTCTGTGGAGAACTGAGAAATCATTTTGAGAAATATGGAGAAATAACAGATCTTTACATGCCGAAGGACCAGGGCTCAAAAATCCATCGTGGAATAGGTTTCATCACTTTTGCAAGTTCAGAGAATTTGATGGCTGAAACTCATGAATTAGAAGGTTCTGCAGTGGACGATGATTTCAGGCCCATAGGGAAAATGCCGCGAGGGGGTCGTGGTGGTGGTGGTGGTGGTGGTGGATATGGTGCATACAATGCTTATATTTCTGCAGCAACAAGATATGCAGCACTTGGTGCTCCTACCTTATACGACCATCCAGTCTCAGTTTATGGTGGGAGAAGGGAATTTCGAGGGATGGGAAAAAAGATTTTTATTGGTAGGCTCCCACAAGAAGCATCTGCTGATGATCTCCGCCAATACTTTGGTAGATTTGGCCGTATATTGGATGTCTATGTTCCCAAGGATCCTAAAAGATCTGGTCATCGAGGTTTTGGTTTTGTAACTTTTGCTGAAGATGGTGTAGCAGATCGTGTATCTCGAAGATCTCACGAGATTTGTGGACAACAGGTTGCTATCGATTCAGCCACTCCTCTTGATGATGCTGGAGCAGCTGGAGCAAGTGGAACTTTTGTGATGAATAGCGCGGCAGAATCTTTTGGGAGCTATGGTGGACCTATGAGGACTTATGGAAGGATGTATGGAAGCCTGGACTTTGATGATTGGGGTTATGGAGTTGGTGGTGGAAGACCATCTAGAGCAGACTGGAGGTATCGGCCATACTGATGAGAGACAAACCAGCATTGAATTAGTAGTGATGGCTGATGGCTTGATAAATCAACGGCATGAATTGCCATATTTATGGGGAAGATATTTGAGGTTTCAGCCTTTGTAGAGTCGATATCTTTCAATGTCTGATTATTTGGTTTCTTTTGGTTTATTGTTTTTCAACTGTCTCAATACCATTCCTTAGCTGAAACATGGATCAATTGGGGTCGTCGAGGTGTGGAGTCGGGGTGTGGAATCGGGATCGGGACCAGGAGGGTATGTCGGCTGTATGAATGTTTGGTTAGAGTAGTTCTTAGTGAGTTATATTCTTCTCTAACTCTACTTTAAGAATATCCATTTTATAACTTCTCTAGGTTAGTTTGGTTATCATCGAACAATTGATTGACTACTGAACGCTGGATGGCCTTTTTTTAGTTATGCGTTCTTTTTCTTTGCATCAGTCCTACAATGTTATCGGATCCTTCCAATTAGACTAAATCCATGGTCAAATAAGAACACTCCCTCAAGAACAAAATTGTATTGCAATAGAAGTGGCTGATTCACTTAAACCTATTACGTCATTTAATTTCAAC

Coding sequence (CDS)

ATGGATCGGAAGCTTGTGGTTCTCGGCATTCCATGGGACATCGATACAGAAGGGCTAAGAGACTACATGAGCAAATTTGGAGAGTTGGAGGATTGTATTGTCATGAAGGAGAGGTCAACTGGTCGGTCTCGTGGTTTTGGGTATGTGACCTTTGCAACCGATGAAGATGCCAAGAATGCGCTATCTAATGAGCACTTCCTTGGCAATAGAATGCTGGAAGTTAAAGTTGCCACACCAAAGGCAACAAAGCTGGAAGTAGGAAAAAAGATTGAATCTGGCCTAAAGAATTCAGGCTATAGATCATTTAAGGAATTCTGTGGAGAACTGAGAAATCATTTTGAGAAATATGGAGAAATAACAGATCTTTACATGCCGAAGGACCAGGGCTCAAAAATCCATCGTGGAATAGGTTTCATCACTTTTGCAAGTTCAGAGAATTTGATGGCTGAAACTCATGAATTAGAAGGTTCTGCAGTGGACGATGATTTCAGGCCCATAGGGAAAATGCCGCGAGGGGGTCGTGGTGGTGGTGGTGGTGGTGGTGGATATGGTGCATACAATGCTTATATTTCTGCAGCAACAAGATATGCAGCACTTGGTGCTCCTACCTTATACGACCATCCAGTCTCAGTTTATGGTGGGAGAAGGGAATTTCGAGGGATGGGAAAAAAGATTTTTATTGGTAGGCTCCCACAAGAAGCATCTGCTGATGATCTCCGCCAATACTTTGGTAGATTTGGCCGTATATTGGATGTCTATGTTCCCAAGGATCCTAAAAGATCTGGTCATCGAGGTTTTGGTTTTGTAACTTTTGCTGAAGATGGTGTAGCAGATCGTGTATCTCGAAGATCTCACGAGATTTGTGGACAACAGGTTGCTATCGATTCAGCCACTCCTCTTGATGATGCTGGAGCAGCTGGAGCAAGTGGAACTTTTGTGATGAATAGCGCGGCAGAATCTTTTGGGAGCTATGGTGGACCTATGAGGACTTATGGAAGGATGTATGGAAGCCTGGACTTTGATGATTGGGGTTATGGAGTTGGTGGTGGAAGACCATCTAGAGCAGACTGGAGGTATCGGCCATACTGA

Protein sequence

MDRKLVVLGIPWDIDTEGLRDYMSKFGELEDCIVMKERSTGRSRGFGYVTFATDEDAKNALSNEHFLGNRMLEVKVATPKATKLEVGKKIESGLKNSGYRSFKEFCGELRNHFEKYGEITDLYMPKDQGSKIHRGIGFITFASSENLMAETHELEGSAVDDDFRPIGKMPRGGRGGGGGGGGYGAYNAYISAATRYAALGAPTLYDHPVSVYGGRREFRGMGKKIFIGRLPQEASADDLRQYFGRFGRILDVYVPKDPKRSGHRGFGFVTFAEDGVADRVSRRSHEICGQQVAIDSATPLDDAGAAGASGTFVMNSAAESFGSYGGPMRTYGRMYGSLDFDDWGYGVGGGRPSRADWRYRPY
Homology
BLAST of CmaCh02G005760 vs. ExPASy Swiss-Prot
Match: O94432 (Uncharacterized RNA-binding protein C660.15 OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=SPBC660.15 PE=4 SV=1)

HSP 1 Score: 99.4 bits (246), Expect = 9.0e-20
Identity = 47/148 (31.76%), Postives = 79/148 (53.38%), Query Frame = 0

Query: 2   DRKLVVLGIPWDIDTEGLRDYMSKFGELEDCIVMKERSTGRSRGFGYVTFATDEDAKNAL 61
           D K+ + G+ W+   + LRDY  +FGE+ DC VM++ +TGRSRGFG++TF   +     +
Sbjct: 162 DGKMFIGGLNWETTDDSLRDYFEQFGEVLDCTVMRDSTTGRSRGFGFLTFKNPKCVNEVM 221

Query: 62  SNEHFLGNRMLEVKVATPKATKLEVGKKIESGLKNSGYRSFKEFCGELRNHFEKYGEITD 121
           S EH L  ++++ K A P+  + +  K    G+             E RN F ++G + D
Sbjct: 222 SKEHHLDGKIIDPKRAIPREEQEKTAKMFVGGVPGDCTEE------EFRNFFNQFGRVLD 281

Query: 122 LYMPKDQGSKIHRGIGFITFASSENLMA 150
             +  D+ +   RG GF+T+ +   + A
Sbjct: 282 ATLMMDKDTGRPRGFGFVTYENESAVEA 303

BLAST of CmaCh02G005760 vs. ExPASy Swiss-Prot
Match: Q8W034 (Heterogeneous nuclear ribonucleoprotein 1 OS=Arabidopsis thaliana OX=3702 GN=RNP1 PE=1 SV=1)

HSP 1 Score: 97.1 bits (240), Expect = 4.5e-19
Identity = 101/374 (27.01%), Postives = 148/374 (39.57%), Query Frame = 0

Query: 4   KLVVLGIPWDIDTEGLRDYMSKFGELEDCIVMKERSTGRSRGFGYVTFATDEDAKNALSN 63
           KL V GI W+ D + LR++ + +GE+   IVM+++ TGR RGFG+V F+        L  
Sbjct: 7   KLFVGGISWETDEDKLREHFTNYGEVSQAIVMRDKLTGRPRGFGFVIFSDPSVLDRVLQE 66

Query: 64  EHFLGNRMLEVKVATPKATKLEVGK--KIESGLKNSG---YRSFKEFCG---------EL 123
           +H +  R ++VK A  +  +   G+   + +   + G    ++ K F G         E 
Sbjct: 67  KHSIDTREVDVKRAMSREEQQVSGRTGNLNTSRSSGGDAYNKTKKIFVGGLPPTLTDEEF 126

Query: 124 RNHFEKYGEITDLYMPKDQGSKIHRGIGFITFASS---ENLMAET-HELEGSAVDDDFRP 183
           R +FE YG +TD+ +  DQ +   RG GF++F S    ++++ +T H+L G  V+     
Sbjct: 127 RQYFEVYGPVTDVAIMYDQATNRPRGFGFVSFDSEDAVDSVLHKTFHDLSGKQVEVKRAL 186

Query: 184 IGKMPRGGRGGGGGGGGYGAYNAYISAATRYAALGAPTLYDHPVSVYGGRREFRGMGKKI 243
                 GG G   GGGG G Y  Y    + Y        +    SV  G   +   G   
Sbjct: 187 PKDANPGGGGRSMGGGGSGGYQGYGGNESSYDGRMDSNRFLQHQSVGNGLPSYGSSGYGA 246

Query: 244 FIGRLPQEASADDLRQYFGRFGRILDVYVPKDPKRSGHRGFGFVTFAEDGVADRVSRRSH 303
             G     A       Y G  G                 G+G       G          
Sbjct: 247 GYGNGSNGAGYGAYGGYTGSAGGY---------GAGATAGYGATNIPGAGYGSSTGVAPR 306

Query: 304 EICGQQVAIDSATPLDDAGAAGASGTFVMNSAAESFGSYGGPMRTYGRMYGSLDFDDWG- 354
                  +     P   +GAA  SG  V  +A  +    G   + YG  YG     D G 
Sbjct: 307 NSWDTPASSGYGNPGYGSGAA-HSGYGVPGAAPPTQSPSGYSNQGYG--YGGYSGSDSGY 366

BLAST of CmaCh02G005760 vs. ExPASy Swiss-Prot
Match: Q96DH6 (RNA-binding protein Musashi homolog 2 OS=Homo sapiens OX=9606 GN=MSI2 PE=1 SV=1)

HSP 1 Score: 96.7 bits (239), Expect = 5.9e-19
Identity = 47/143 (32.87%), Postives = 82/143 (57.34%), Query Frame = 0

Query: 4   KLVVLGIPWDIDTEGLRDYMSKFGELEDCIVMKERSTGRSRGFGYVTFATDEDAKNALSN 63
           K+ + G+ W    + LRDY SKFGE+ +C+VM++ +T RSRGFG+VTFA        L  
Sbjct: 22  KMFIGGLSWQTSPDSLRDYFSKFGEIRECMVMRDPTTKRSRGFGFVTFADPASVDKVLGQ 81

Query: 64  EHF-LGNRMLEVKVATPKATKLEVGKKIESGLKNSGYRSFKEFCGELRNHFEKYGEITDL 123
            H  L ++ ++ KVA P+  + ++  + +      G  S      +++ +FE++G++ D 
Sbjct: 82  PHHELDSKTIDPKVAFPRRAQPKMVTRTKKIF--VGGLSANTVVEDVKQYFEQFGKVEDA 141

Query: 124 YMPKDQGSKIHRGIGFITFASSE 146
            +  D+ +  HRG GF+TF + +
Sbjct: 142 MLMFDKTTNRHRGFGFVTFENED 162

BLAST of CmaCh02G005760 vs. ExPASy Swiss-Prot
Match: Q920Q6 (RNA-binding protein Musashi homolog 2 OS=Mus musculus OX=10090 GN=Msi2 PE=1 SV=1)

HSP 1 Score: 96.7 bits (239), Expect = 5.9e-19
Identity = 47/143 (32.87%), Postives = 82/143 (57.34%), Query Frame = 0

Query: 4   KLVVLGIPWDIDTEGLRDYMSKFGELEDCIVMKERSTGRSRGFGYVTFATDEDAKNALSN 63
           K+ + G+ W    + LRDY SKFGE+ +C+VM++ +T RSRGFG+VTFA        L  
Sbjct: 22  KMFIGGLSWQTSPDSLRDYFSKFGEIRECMVMRDPTTKRSRGFGFVTFADPASVDKVLGQ 81

Query: 64  EHF-LGNRMLEVKVATPKATKLEVGKKIESGLKNSGYRSFKEFCGELRNHFEKYGEITDL 123
            H  L ++ ++ KVA P+  + ++  + +      G  S      +++ +FE++G++ D 
Sbjct: 82  PHHELDSKTIDPKVAFPRRAQPKMVTRTKKIF--VGGLSANTVVEDVKQYFEQFGKVEDA 141

Query: 124 YMPKDQGSKIHRGIGFITFASSE 146
            +  D+ +  HRG GF+TF + +
Sbjct: 142 MLMFDKTTNRHRGFGFVTFENED 162

BLAST of CmaCh02G005760 vs. ExPASy Swiss-Prot
Match: P48809 (Heterogeneous nuclear ribonucleoprotein 27C OS=Drosophila melanogaster OX=7227 GN=Hrb27C PE=1 SV=2)

HSP 1 Score: 95.9 bits (237), Expect = 1.0e-18
Identity = 69/201 (34.33%), Postives = 97/201 (48.26%), Query Frame = 0

Query: 4   KLVVLGIPWDIDTEGLRDYMSKFGELEDCIVMKERSTGRSRGFGYVTFATDEDAKNALSN 63
           KL V G+ W+   E L  Y  +FG++ DC+VMK   +GRSRGFG+VTFA   +  + L N
Sbjct: 8   KLFVGGLSWETTQENLSRYFCRFGDIIDCVVMKNNESGRSRGFGFVTFADPTNVNHVLQN 67

Query: 64  -EHFLGNRMLEVKVATPKATKLEVGKKIESGLKNSGYRSFKEFCG---------ELRNHF 123
             H L  R ++ K   P+         ++   K  GY   K F G         +LR  F
Sbjct: 68  GPHTLDGRTIDPKPCNPRT--------LQKPKKGGGY---KVFLGGLPSNVTETDLRTFF 127

Query: 124 EKYGEITDLYMPKDQGSKIHRGIGFITFASSENLMAETHE----LEGSAVDDDFRPIGKM 183
            +YG++T++ +  DQ  K  RG GF++F    ++   T+E    L G  V+     I K 
Sbjct: 128 NRYGKVTEVVIMYDQEKKKSRGFGFLSFEEESSVEHVTNERYINLNGKQVE-----IKKA 187

Query: 184 -PRGGRGGGGG-----GGGYG 185
            PR G GG        GG YG
Sbjct: 188 EPRDGSGGQNSNNSTVGGAYG 192

BLAST of CmaCh02G005760 vs. TAIR 10
Match: AT4G36960.1 (RNA-binding (RRM/RBD/RNP motifs) family protein )

HSP 1 Score: 486.5 bits (1251), Expect = 1.9e-137
Identity = 249/381 (65.35%), Postives = 289/381 (75.85%), Query Frame = 0

Query: 1   MDRKLVVLGIPWDIDTEGLRDYMSKFGELEDCIVMKERSTGRSRGFGYVTFATDEDAKNA 60
           M+RKLVVLGIPWDID++GL+DYMSKFG+LEDCIVMK+RSTGRSRGFGYVTFA+ EDAKNA
Sbjct: 1   MERKLVVLGIPWDIDSDGLKDYMSKFGDLEDCIVMKDRSTGRSRGFGYVTFASAEDAKNA 60

Query: 61  LSNEHFLGNRMLEVKVATPKATKLEVGKKIESGLKNSGYRSFKEFCGELRNHFEKYGEIT 120
           L  EHFLGNR+LEVKVATPK    +  KK+          S  E   + R+HFE+YGEIT
Sbjct: 61  LKGEHFLGNRILEVKVATPKEEMRQPAKKVTRIFVARIPSSVSE--SDFRSHFERYGEIT 120

Query: 121 DLYMPKDQGSKIHRGIGFITFASS---ENLMAETHELEGSAV--------DDDFRP---- 180
           DLYMPKD  SK HRGIGFITF+S+   E+LM +TH+L G+ V        +DD  P    
Sbjct: 121 DLYMPKDYNSKQHRGIGFITFSSADSVEDLMEDTHDLGGTTVAVDRATPKEDDHPPRPPP 180

Query: 181 ---IGKMPRGGRGGGGGGGGYGAYNAYISAATRYAALGAPTLYDHPVSVYG-GRREFRGM 240
              + + P    GG G  GGYGAY+AYISAATRYAALGAPTLYD+P + YG G    RG+
Sbjct: 181 VARMSRPPVAIAGGFGAPGGYGAYDAYISAATRYAALGAPTLYDNPATFYGRGEPTTRGI 240

Query: 241 GKKIFIGRLPQEASADDLRQYFGRFGRILDVYVPKDPKRSGHRGFGFVTFAEDGVADRVS 300
           G KIF+GRLPQEAS DDLR YFGRFG I D Y+PKDPKRSGHRGFGFVTFAE+GVADRV+
Sbjct: 241 GNKIFVGRLPQEASVDDLRDYFGRFGHIQDAYIPKDPKRSGHRGFGFVTFAENGVADRVA 300

Query: 301 RRSHEICGQQVAIDSATPLDDAGAAGASGTFVMNSAAESFGSYGGPMRTYGRMYGSLDFD 360
           RRSHEICGQ+VAIDSATPLD+AG +  + + + +S  E FG YGGPMR +GRMYG +  D
Sbjct: 301 RRSHEICGQEVAIDSATPLDEAGPSAGASSMLSSSRPEYFGGYGGPMRAFGRMYGGMSLD 360

Query: 361 DWGYGVGGGRPSRADWRYRPY 363
           DWGYG+   RPSR D RYRPY
Sbjct: 361 DWGYGMPNARPSRPDRRYRPY 379

BLAST of CmaCh02G005760 vs. TAIR 10
Match: AT4G36960.2 (RNA-binding (RRM/RBD/RNP motifs) family protein )

HSP 1 Score: 486.5 bits (1251), Expect = 1.9e-137
Identity = 249/381 (65.35%), Postives = 289/381 (75.85%), Query Frame = 0

Query: 1   MDRKLVVLGIPWDIDTEGLRDYMSKFGELEDCIVMKERSTGRSRGFGYVTFATDEDAKNA 60
           M+RKLVVLGIPWDID++GL+DYMSKFG+LEDCIVMK+RSTGRSRGFGYVTFA+ EDAKNA
Sbjct: 1   MERKLVVLGIPWDIDSDGLKDYMSKFGDLEDCIVMKDRSTGRSRGFGYVTFASAEDAKNA 60

Query: 61  LSNEHFLGNRMLEVKVATPKATKLEVGKKIESGLKNSGYRSFKEFCGELRNHFEKYGEIT 120
           L  EHFLGNR+LEVKVATPK    +  KK+          S  E   + R+HFE+YGEIT
Sbjct: 61  LKGEHFLGNRILEVKVATPKEEMRQPAKKVTRIFVARIPSSVSE--SDFRSHFERYGEIT 120

Query: 121 DLYMPKDQGSKIHRGIGFITFASS---ENLMAETHELEGSAV--------DDDFRP---- 180
           DLYMPKD  SK HRGIGFITF+S+   E+LM +TH+L G+ V        +DD  P    
Sbjct: 121 DLYMPKDYNSKQHRGIGFITFSSADSVEDLMEDTHDLGGTTVAVDRATPKEDDHPPRPPP 180

Query: 181 ---IGKMPRGGRGGGGGGGGYGAYNAYISAATRYAALGAPTLYDHPVSVYG-GRREFRGM 240
              + + P    GG G  GGYGAY+AYISAATRYAALGAPTLYD+P + YG G    RG+
Sbjct: 181 VARMSRPPVAIAGGFGAPGGYGAYDAYISAATRYAALGAPTLYDNPATFYGRGEPTTRGI 240

Query: 241 GKKIFIGRLPQEASADDLRQYFGRFGRILDVYVPKDPKRSGHRGFGFVTFAEDGVADRVS 300
           G KIF+GRLPQEAS DDLR YFGRFG I D Y+PKDPKRSGHRGFGFVTFAE+GVADRV+
Sbjct: 241 GNKIFVGRLPQEASVDDLRDYFGRFGHIQDAYIPKDPKRSGHRGFGFVTFAENGVADRVA 300

Query: 301 RRSHEICGQQVAIDSATPLDDAGAAGASGTFVMNSAAESFGSYGGPMRTYGRMYGSLDFD 360
           RRSHEICGQ+VAIDSATPLD+AG +  + + + +S  E FG YGGPMR +GRMYG +  D
Sbjct: 301 RRSHEICGQEVAIDSATPLDEAGPSAGASSMLSSSRPEYFGGYGGPMRAFGRMYGGMSLD 360

Query: 361 DWGYGVGGGRPSRADWRYRPY 363
           DWGYG+   RPSR D RYRPY
Sbjct: 361 DWGYGMPNARPSRPDRRYRPY 379

BLAST of CmaCh02G005760 vs. TAIR 10
Match: AT4G26650.1 (RNA-binding (RRM/RBD/RNP motifs) family protein )

HSP 1 Score: 109.4 bits (272), Expect = 6.2e-24
Identity = 62/179 (34.64%), Postives = 98/179 (54.75%), Query Frame = 0

Query: 4   KLVVLGIPWDIDTEGLRDYMSKFGELEDCIVMKERSTGRSRGFGYVTFATDEDAKNALSN 63
           KL + GI WD D E L++Y  K+G+L + ++M++R+TGR+RGFG++ FA    A+  + +
Sbjct: 16  KLFIGGISWDTDEERLQEYFGKYGDLVEAVIMRDRTTGRARGFGFIVFADPSVAERVIMD 75

Query: 64  EHFLGNRMLEVKVATPKATKLEVGKKIES---------GLKNSGYRSFKEFCG------- 123
           +H +  R +E K A P+  + +V K+  S         G    G R+ K F G       
Sbjct: 76  KHIIDGRTVEAKKAVPRDDQ-QVLKRHASPMHLISPSHGGNGGGARTKKIFVGGLPSSIT 135

Query: 124 --ELRNHFEKYGEITDLYMPKDQGSKIHRGIGFITFASSEN----LMAETHELEGSAVD 161
             E +N+F+++G I D+ +  D  ++  RG GFITF S E+    L    HEL G  V+
Sbjct: 136 EAEFKNYFDQFGTIADVVVMYDHNTQRPRGFGFITFDSEESVDMVLHKTFHELNGKMVE 193

BLAST of CmaCh02G005760 vs. TAIR 10
Match: AT4G26650.2 (RNA-binding (RRM/RBD/RNP motifs) family protein )

HSP 1 Score: 109.4 bits (272), Expect = 6.2e-24
Identity = 62/179 (34.64%), Postives = 98/179 (54.75%), Query Frame = 0

Query: 4   KLVVLGIPWDIDTEGLRDYMSKFGELEDCIVMKERSTGRSRGFGYVTFATDEDAKNALSN 63
           KL + GI WD D E L++Y  K+G+L + ++M++R+TGR+RGFG++ FA    A+  + +
Sbjct: 13  KLFIGGISWDTDEERLQEYFGKYGDLVEAVIMRDRTTGRARGFGFIVFADPSVAERVIMD 72

Query: 64  EHFLGNRMLEVKVATPKATKLEVGKKIES---------GLKNSGYRSFKEFCG------- 123
           +H +  R +E K A P+  + +V K+  S         G    G R+ K F G       
Sbjct: 73  KHIIDGRTVEAKKAVPRDDQ-QVLKRHASPMHLISPSHGGNGGGARTKKIFVGGLPSSIT 132

Query: 124 --ELRNHFEKYGEITDLYMPKDQGSKIHRGIGFITFASSEN----LMAETHELEGSAVD 161
             E +N+F+++G I D+ +  D  ++  RG GFITF S E+    L    HEL G  V+
Sbjct: 133 EAEFKNYFDQFGTIADVVVMYDHNTQRPRGFGFITFDSEESVDMVLHKTFHELNGKMVE 190

BLAST of CmaCh02G005760 vs. TAIR 10
Match: AT5G55550.1 (RNA-binding (RRM/RBD/RNP motifs) family protein )

HSP 1 Score: 101.7 bits (252), Expect = 1.3e-21
Identity = 66/214 (30.84%), Postives = 108/214 (50.47%), Query Frame = 0

Query: 4   KLVVLGIPWDIDTEGLRDYMSKFGELEDCIVMKERSTGRSRGFGYVTFATDEDAKNALSN 63
           KL + GI WD D E LRDY S +G++ + ++M++R+TGR+RGFG++ FA    ++  + +
Sbjct: 7   KLFIGGISWDTDEERLRDYFSNYGDVVEAVIMRDRATGRARGFGFIVFADPCVSERVIMD 66

Query: 64  EHFLGNRMLEVKVATPKATKLEVGK-----KIESGLKNSGYRSFKEFCG---------EL 123
           +H +  R +E K A P+  +  + +      + S +   G R+ K F G         E 
Sbjct: 67  KHIIDGRTVEAKKAVPRDDQQVLKRHASPIHLMSPVHGGGGRTKKIFVGGLPSSITEEEF 126

Query: 124 RNHFEKYGEITDLYMPKDQGSKIHRGIGFITFASSEN----LMAETHELEGS------AV 183
           +N+F+++G I D+ +  D  ++  RG GFITF S +     L    HEL G       AV
Sbjct: 127 KNYFDQFGTIADVVVMYDHNTQRPRGFGFITFDSDDAVDRVLHKTFHELNGKLVEVKRAV 186

Query: 184 DDDFRPIG--KMPRGGRGGGGGGGGYGAYNAYIS 192
             +  P+   + P       GGG      N+Y +
Sbjct: 187 PKEISPVSNIRSPLASGVNYGGGSNRMPANSYFN 220

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
O944329.0e-2031.76Uncharacterized RNA-binding protein C660.15 OS=Schizosaccharomyces pombe (strain... [more]
Q8W0344.5e-1927.01Heterogeneous nuclear ribonucleoprotein 1 OS=Arabidopsis thaliana OX=3702 GN=RNP... [more]
Q96DH65.9e-1932.87RNA-binding protein Musashi homolog 2 OS=Homo sapiens OX=9606 GN=MSI2 PE=1 SV=1[more]
Q920Q65.9e-1932.87RNA-binding protein Musashi homolog 2 OS=Mus musculus OX=10090 GN=Msi2 PE=1 SV=1[more]
P488091.0e-1834.33Heterogeneous nuclear ribonucleoprotein 27C OS=Drosophila melanogaster OX=7227 G... [more]
Match NameE-valueIdentityDescription
AT4G36960.11.9e-13765.35RNA-binding (RRM/RBD/RNP motifs) family protein [more]
AT4G36960.21.9e-13765.35RNA-binding (RRM/RBD/RNP motifs) family protein [more]
AT4G26650.16.2e-2434.64RNA-binding (RRM/RBD/RNP motifs) family protein [more]
AT4G26650.26.2e-2434.64RNA-binding (RRM/RBD/RNP motifs) family protein [more]
AT5G55550.11.3e-2130.84RNA-binding (RRM/RBD/RNP motifs) family protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita maxima (Rimu) v1.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000504RNA recognition motif domainSMARTSM00360rrm1_1coord: 83..169
e-value: 0.19
score: 12.9
coord: 4..75
e-value: 8.9E-22
score: 88.3
coord: 224..295
e-value: 1.1E-15
score: 68.2
IPR000504RNA recognition motif domainPFAMPF00076RRM_1coord: 225..279
e-value: 1.4E-12
score: 47.3
coord: 108..147
e-value: 1.9E-6
score: 27.6
coord: 5..70
e-value: 9.3E-15
score: 54.2
IPR000504RNA recognition motif domainPROSITEPS50102RRMcoord: 223..299
score: 16.005526
IPR000504RNA recognition motif domainPROSITEPS50102RRMcoord: 108..166
score: 9.967915
IPR000504RNA recognition motif domainPROSITEPS50102RRMcoord: 3..79
score: 16.517189
IPR012677Nucleotide-binding alpha-beta plait domain superfamilyGENE3D3.30.70.330coord: 107..193
e-value: 5.0E-11
score: 44.7
IPR012677Nucleotide-binding alpha-beta plait domain superfamilyGENE3D3.30.70.330coord: 221..303
e-value: 3.3E-20
score: 74.0
IPR012677Nucleotide-binding alpha-beta plait domain superfamilyGENE3D3.30.70.330coord: 1..86
e-value: 5.7E-24
score: 86.2
NoneNo IPR availablePANTHERPTHR48027:SF15OS01G0945800 PROTEINcoord: 1..362
NoneNo IPR availablePANTHERPTHR48027HETEROGENEOUS NUCLEAR RIBONUCLEOPROTEIN 87F-RELATEDcoord: 1..362
NoneNo IPR availableCDDcd12322RRM2_TDP43coord: 223..298
e-value: 9.88555E-30
score: 106.968
IPR035979RNA-binding domain superfamilySUPERFAMILY54928RNA-binding domain, RBDcoord: 3..157
IPR035979RNA-binding domain superfamilySUPERFAMILY54928RNA-binding domain, RBDcoord: 223..304

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh02G005760.1CmaCh02G005760.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0003723 RNA binding
molecular_function GO:0003676 nucleic acid binding