Cp4.1LG09g06540 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG09g06540
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionRNA-binding protein, putative
LocationCp4.1LG09 : 4942778 .. 4947289 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
AGTGGGCTCATGGATCCTGGCAAGCTTTTTATTGGGGGTATTTCGTGGGACACAGATGAAGATCGACTTAAACAGTATTTTCAAACCTATGGAGAAGTGGTGGAGGTGATGATCATGAGGGATCGGAACACTGGCCGTGCTCGTGGCTTCGGTTTCGTCGTGTTTTCTGACCCTGTTGTTGCAGCTAGAGTTGTATTGGAGAAGCATGTTATTGATGGAAGAACTGTAAGTTTTTTGAATTCTTCTTTTGATTTTGCTAATCAATATGCTCATGGAAGGATCATGGTTTTCTTGATTGTTCTTTGGTGTTCATCATGATTGAAACTTGTCAGAACTCTTCTTACATTTACATTTTTTTGAGTTCCACAACTCGGGGATGAGAATTTGAATATCTTGTTGAGGATTGTTGGGAGGGAGTCGTACATTGGCTAATTTAGGGAATGATACTGAGTTTATAATTAAGGAATACATCTCTATTAGTATGAGGCCGTTTTGGGAAGCCAAAAGCAAACTAACGAGAGCTTATACTCAAAGTGGACAATATCATACCATTGTGGAGGTCCGGGATTCCTAACATGGTATCAAAGTCATGCCCTTAACTTAGCCATGTGAATAGAATTCTCAAATGTCGAATAAAGAAGTTGTGAGCCTCGAAGGTGTAGTTAAAAGTGACTCAAGTGTTGAACAAAGGGTGTGTCCTACATTGGCTAATTTAGGGGTTTATAAGTAAGGAATACATCCCCATTGGTATGAGGCCTTACGGGGAAGCCCAAAGCAAAGCCATGAGAGCTTATACTCAAAGTGGACAATATCATACCATTGTGGAGGTCTGTGATTCCTAACATGGTATCAGAGTCATGCCCTTAACTTAGCCATATCAATAGAAAGCCTCGAAGGTGTAGTCAAAAGTGACTCAAGTGGTGTGTCCCACATTGGCTAATTTAGGGGTATATAAGTAACGAATACATCTCCATTAGTATGAGACCTTTTGGGGAAACCAAACAGTACACAATCATACCATTGTGGAGAGTCGTGGTTCCTAACACTAGTTAACGTCGAATCACTTTTGTGTTGAAAAACATATAAACTATAATTTTCCTTTGGAGCTTTGGATGGGAAGATTTGGCTGTCTGCATTTTATCAGGGTGAATTTTGTTTCATGTGGCTGTTCATATTTTTGTGGAAGTACAATGCCGCCTGCATTTGCCCCGAATTTCAATCGTTTCGGGAACCGATTGTTTATACATGTTCTAGAAAAGGTAATAAGCACGATGACAGTAAAGGAATGGTTTGAAACTAGTCCAAGTGAGCATAACTCAATGGTAACTAGCTATACCCTCGACCGAGAAAGGACATCGGAAGAATTGAAACTCAGATTTGTGAAAATTCGGGGGCTCGATATCTTATGGAGTTCGTGGATGCACGGACATTAGGATTTACATGTGATGTTCACTGCTTATATTAAAATGATACATTTTGTTGCGTCAAGATATATCTATTTGTATCGTAATTAGAGTAAAGTTTATCATGTTCATGTTTCGATATTGATTTTTTTGTTTTAGGTTGAAGCAAAGAAGGCTGTTCCTCGAGACGATCAGAACATTTTGAGCAGGAACAATACCGGTATCCTTGGGTCACCTGGCCCGACTCGCACAAAGAAGATATTTGTAGGGGGTTTGGCATCAACGGTTACGGAGAGTGACTTCAAAAAATATTTTGATCAGTTTGGAACGATCACAGATGCCGTGGTGATGTATGATCACAATACTCAGAGACCGAGAGGCTTTGGATTTATCACGTACGAGTCGGAGGAATCGGTGGAGAAAGTGTTATACAAAACATTTCATGAACTCAATGGTAAAATGGTTGAGGTTAAGAGAGCTGTTCCAAAAGAATTATCACCAGTGCCAAATCGAAATCAATTAGGTGCATACCCTTACAGTTTTGGTAGAGTTGGTAGCTATTTAAGTGGTTATAATCAAGGATATAACACAACCTCAGTTGGGGGATATGGACTGAGATCTGATGGTAGGTTTAGTCCTGTTACGGTCGGTCGGGGTGGGCTGTCTCCAATTAGTCCTGGTTATGGAATAGGTCTAAATCTCGACGCAGGGTTGAACCTGAACTATGGGACCGGTCCCAATGGCAGTTCTAACCTCATTTACGGACGGGTAATGAGTTCTTCCTATGGTGGAAATTTAAATAGGTATGGTAGTCCAAGCCCCATGGTATATGGCGGAGGCAGTGGAGGTAATGGTTCTATTTTAAGCTCGTCGGTTCAGAATCTGTGGGGAAATGTCGGTAACTCGGCTGGTACGAATGCCTCGCACCTGAGGACGTTTCCTGGCTCTGGTGGTGTGCATACGGGAACGAGTTCATTGAACGGTATTGGAGGGCTTTGGGGTGTAGGTCATGGCGAAAACGCAGGTTCTTCTCCATTCAATGGTGCTGGTAATGTCGAGTTTGGAAACGGAGGAAGCGTTGTAGGTTATGGTAGAAGCATTAGAAGCAATGTTTCTTCAGCTTCTTTGTACTCTGCACCAAATATTTATGATGAAGTTCATGGGAATAATGAAGAAGGAAACTCATTTTATGGGCATTCAAGTTGGCAGTCATTGCCCACAGAGCTTGAGGATTCTTCCTCAATTGGGTTTGGGCTTGGCAATGCTGCTTCAGATGTTATTAGTAGAAACAATGCTTCTGGTTATACTGTTGGATATGGTGTTTCTAATACACAGTCGAATAGAGGTGAATCTCTCTCTATCTCTTTCGTTCTTTTATCGTCGTTACATGTGAGATCCCACATCGGTTGGGGAGGAGAACGAAACACTCTTTATAAGAGTGTGGAAACCTCTCCCCAGTAGACGTGTTTTCAAAACCTTGAGGGTAAGTTCGAAGGGGAAATCCTAAGGAGGACAATATCTGCTAGCGGTGGGGTTGGGTTGTTACAACTGATATCAGAGCTAGACACTAGGCGATGTGCCAATGGGGAGGTTGTTCCTTGAAGGGGGTAGACACAAGGCGATGTGCCAGTAAAGACGCTAGGCCCTGAAGGGGGGTGGATTTGGTAGGAGTCCCACATCGATTGGAGAAAGGAATGAGTGCCAGCGATAACGTTGGGCCCTGAAGGGGGTGGATTGTTTGAGATCCCACATCAGTTGGAGAGGAGAACAAAACACCCTTTATAAGAGTGTGGAAACCTCTCGCTAGTAGATGCGTTTTAAAATCCTTGAGGGTAAGCCCGAAAGGGAAAACCCAAAGAGGACAATATCTACTAGCGGTGGGAACGGGCTATTACAAATGGTATCAAAGCCAAACATCGGGCAATGTGCCAGCTAAGAGGCTGTTCTCCGAAGGGGGTAGACACGAGGCGGTGTGCCAGTAAGGACGTTGGACCCCGAAGGGGGTGGATTTGGTAGGGGTCCCACATCGATTGAAGAAAGGAATGAGTGTCAGCGAGGACGCTGGACCCTGAAGGGGGTGGATTATGAGATCCCACATCAGTTGGGGGGGAGGAGAACGAAACACTCTCTACAGGGTGTGGAAACCTCTCCCTAGTAGTAGACGTGTTTTAAAATCTTTGAGGGTAAACTCGAAAGGGAAAGCTCAAAGAGGACAATATCTGCTAGCGGTGCATTACATCTCTACAAACATTAGATCTTCGAGCTTATTGATATCAGTTTAAGCATGATTAAAACGCAACGACGTTACTTGAACAAGATCTGTTTATGAATGGTGAAATGTTTGTTTGATTCAGGAATTGCTGCTNCGAGGGCTTGGGAGTTAGCAAACTCTCTATAACAGTATTGACACTCGTATTTCCTCTCACCGGATGCGGCGGCGGCGCCTCCTCCTCCTCCTCCTCCACCTCCGCCTCCTCCTCCTCCTCCCCCTCCTCCCATATGATAATATGATACTGTGAGAGCTAATTTGTTTTCTGTTTTGAGAAAGTGTCCCATAAAAGAAAGCAGGTTCCAGAGGGTTTGGTTTTTGGATTGAGCTTGAAAATATAGGTATTTTCTTCCTTTCTTTACTTGTAATTTGATTTGATTGATATGTATAATTGGATTGTGGGTAAGGTCTGATCATACGTTACTCCGACAAAGAAACGGTGTCACACGAACGAATCGACATCATATTTTTTCTTCTTTTCTTTGGTTTCGGGTTCGTTTTAGAACAACTCATTCAGGCTTTCTTCTTGTTGTTTCATCTCTTGTTATTATTATTAGCGGTTCTGGGTTTTTACTTCTCATGTGATTGATTGTATGAAACTGAGGTTGATTTATTTAGACATAGATCATACTCTCCATCTACCTTCCTTCCAATTCTTTCTTACTTTACAGCAAAAACGGATCTACAAATTATCTGCGATTACTTTATGCATGTGCTATGGTTTTTATTTACAATGCTACCATCAACTGGGTGCAATGGTGCGTTGTCTTTACCTAAAACAAAAGAGTAGCATATGCTTTGAATATTTTAACGGTTACAGGGTGCATTGCCTTTATCTAA

mRNA sequence

AGTGGGCTCATGGATCCTGGCAAGCTTTTTATTGGGGGTATTTCGTGGGACACAGATGAAGATCGACTTAAACAGTATTTTCAAACCTATGGAGAAGTGGTGGAGGTGATGATCATGAGGGATCGGAACACTGGCCGTGCTCGTGGCTTCGGTTTCGTCGTGTTTTCTGACCCTGTTGTTGCAGCTAGAGTTGTATTGGAGAAGCATGTTATTGATGGAAGAACTGTTGAAGCAAAGAAGGCTGTTCCTCGAGACGATCAGAACATTTTGAGCAGGAACAATACCGGTATCCTTGGGTCACCTGGCCCGACTCGCACAAAGAAGATATTTGTAGGGGGTTTGGCATCAACGGTTACGGAGAGTGACTTCAAAAAATATTTTGATCAGTTTGGAACGATCACAGATGCCGTGGTGATGTATGATCACAATACTCAGAGACCGAGAGGCTTTGGATTTATCACGTACGAGTCGGAGGAATCGGTGGAGAAAGTGTTATACAAAACATTTCATGAACTCAATGGTAAAATGGTTGAGGTTAAGAGAGCTGTTCCAAAAGAATTATCACCAGTGCCAAATCGAAATCAATTAGGTGCATACCCTTACAGTTTTGGTAGAGTTGGTAGCTATTTAAGTGGTTATAATCAAGGATATAACACAACCTCAGTTGGGGGATATGGACTGAGATCTGATGGTAGGTTTAGTCCTGTTACGGTCGGTCGGGGTGGGCTGTCTCCAATTAGTCCTGGTTATGGAATAGGTCTAAATCTCGACGCAGGGTTGAACCTGAACTATGGGACCGGTCCCAATGGCAGTTCTAACCTCATTTACGGACGGGTAATGAGTTCTTCCTATGGTGGAAATTTAAATAGGTATGGTAGTCCAAGCCCCATGGTATATGGCGGAGGCAGTGGAGGTAATGGTTCTATTTTAAGCTCGTCGGTTCAGAATCTGTGGGGAAATGTCGGTAACTCGGCTGGTACGAATGCCTCGCACCTGAGGACGTTTCCTGGCTCTGGTGGTGTGCATACGGGAACGAGTTCATTGAACGGTATTGGAGGGCTTTGGGGTGTAGGTCATGGCGAAAACGCAGGTTCTTCTCCATTCAATGGTGCTGGTAATGTCGAGTTTGGAAACGGAGGAAGCGTTGTAGGTTATGGTAGAAGCATTAGAAGCAATGTTTCTTCAGCTTCTTTGTACTCTGCACCAAATATTTATGATGAAGTTCATGGGAATAATGAAGAAGGAAACTCATTTTATGGGCATTCAAGTTGGCAGTCATTGCCCACAGAGCTTGAGGATTCTTCCTCAATTGGGTTTGGGCTTGGCAATGCTGCTTCAGATGTTATTAGTAGAAACAATGCTTCTGGTTATACTGTTGGATATGGGTGCATTGCCTTTATCTAA

Coding sequence (CDS)

AGTGGGCTCATGGATCCTGGCAAGCTTTTTATTGGGGGTATTTCGTGGGACACAGATGAAGATCGACTTAAACAGTATTTTCAAACCTATGGAGAAGTGGTGGAGGTGATGATCATGAGGGATCGGAACACTGGCCGTGCTCGTGGCTTCGGTTTCGTCGTGTTTTCTGACCCTGTTGTTGCAGCTAGAGTTGTATTGGAGAAGCATGTTATTGATGGAAGAACTGTTGAAGCAAAGAAGGCTGTTCCTCGAGACGATCAGAACATTTTGAGCAGGAACAATACCGGTATCCTTGGGTCACCTGGCCCGACTCGCACAAAGAAGATATTTGTAGGGGGTTTGGCATCAACGGTTACGGAGAGTGACTTCAAAAAATATTTTGATCAGTTTGGAACGATCACAGATGCCGTGGTGATGTATGATCACAATACTCAGAGACCGAGAGGCTTTGGATTTATCACGTACGAGTCGGAGGAATCGGTGGAGAAAGTGTTATACAAAACATTTCATGAACTCAATGGTAAAATGGTTGAGGTTAAGAGAGCTGTTCCAAAAGAATTATCACCAGTGCCAAATCGAAATCAATTAGGTGCATACCCTTACAGTTTTGGTAGAGTTGGTAGCTATTTAAGTGGTTATAATCAAGGATATAACACAACCTCAGTTGGGGGATATGGACTGAGATCTGATGGTAGGTTTAGTCCTGTTACGGTCGGTCGGGGTGGGCTGTCTCCAATTAGTCCTGGTTATGGAATAGGTCTAAATCTCGACGCAGGGTTGAACCTGAACTATGGGACCGGTCCCAATGGCAGTTCTAACCTCATTTACGGACGGGTAATGAGTTCTTCCTATGGTGGAAATTTAAATAGGTATGGTAGTCCAAGCCCCATGGTATATGGCGGAGGCAGTGGAGGTAATGGTTCTATTTTAAGCTCGTCGGTTCAGAATCTGTGGGGAAATGTCGGTAACTCGGCTGGTACGAATGCCTCGCACCTGAGGACGTTTCCTGGCTCTGGTGGTGTGCATACGGGAACGAGTTCATTGAACGGTATTGGAGGGCTTTGGGGTGTAGGTCATGGCGAAAACGCAGGTTCTTCTCCATTCAATGGTGCTGGTAATGTCGAGTTTGGAAACGGAGGAAGCGTTGTAGGTTATGGTAGAAGCATTAGAAGCAATGTTTCTTCAGCTTCTTTGTACTCTGCACCAAATATTTATGATGAAGTTCATGGGAATAATGAAGAAGGAAACTCATTTTATGGGCATTCAAGTTGGCAGTCATTGCCCACAGAGCTTGAGGATTCTTCCTCAATTGGGTTTGGGCTTGGCAATGCTGCTTCAGATGTTATTAGTAGAAACAATGCTTCTGGTTATACTGTTGGATATGGGTGCATTGCCTTTATCTAA

Protein sequence

SGLMDPGKLFIGGISWDTDEDRLKQYFQTYGEVVEVMIMRDRNTGRARGFGFVVFSDPVVAARVVLEKHVIDGRTVEAKKAVPRDDQNILSRNNTGILGSPGPTRTKKIFVGGLASTVTESDFKKYFDQFGTITDAVVMYDHNTQRPRGFGFITYESEESVEKVLYKTFHELNGKMVEVKRAVPKELSPVPNRNQLGAYPYSFGRVGSYLSGYNQGYNTTSVGGYGLRSDGRFSPVTVGRGGLSPISPGYGIGLNLDAGLNLNYGTGPNGSSNLIYGRVMSSSYGGNLNRYGSPSPMVYGGGSGGNGSILSSSVQNLWGNVGNSAGTNASHLRTFPGSGGVHTGTSSLNGIGGLWGVGHGENAGSSPFNGAGNVEFGNGGSVVGYGRSIRSNVSSASLYSAPNIYDEVHGNNEEGNSFYGHSSWQSLPTELEDSSSIGFGLGNAASDVISRNNASGYTVGYGCIAFI
BLAST of Cp4.1LG09g06540 vs. Swiss-Prot
Match: RNP1_ARATH (Heterogeneous nuclear ribonucleoprotein 1 OS=Arabidopsis thaliana GN=RNP1 PE=1 SV=1)

HSP 1 Score: 238.0 bits (606), Expect = 2.0e-61
Identity = 172/442 (38.91%), Postives = 235/442 (53.17%), Query Frame = 1

Query: 5   DPGKLFIGGISWDTDEDRLKQYFQTYGEVVEVMIMRDRNTGRARGFGFVVFSDPVVAARV 64
           D GKLF+GGISW+TDED+L+++F  YGEV + ++MRD+ TGR RGFGFV+FSDP V  RV
Sbjct: 4   DQGKLFVGGISWETDEDKLREHFTNYGEVSQAIVMRDKLTGRPRGFGFVIFSDPSVLDRV 63

Query: 65  VLEKHVIDGRTVEAKKAVPRDDQNILSR----NNTGILGSPGPTRTKKIFVGGLASTVTE 124
           + EKH ID R V+ K+A+ R++Q +  R    N +   G     +TKKIFVGGL  T+T+
Sbjct: 64  LQEKHSIDTREVDVKRAMSREEQQVSGRTGNLNTSRSSGGDAYNKTKKIFVGGLPPTLTD 123

Query: 125 SDFKKYFDQFGTITDAVVMYDHNTQRPRGFGFITYESEESVEKVLYKTFHELNGKMVEVK 184
            +F++YF+ +G +TD  +MYD  T RPRGFGF++++SE++V+ VL+KTFH+L+GK VEVK
Sbjct: 124 EEFRQYFEVYGPVTDVAIMYDQATNRPRGFGFVSFDSEDAVDSVLHKTFHDLSGKQVEVK 183

Query: 185 RAVPKELSPVPNRNQLGAYPYSFGRVGSYLSGYNQGYNTTSVGGYGLRSDGRF-SPVTVG 244
           RA+PK+ +P       G    S G  GS   GY QGY        G     RF    +VG
Sbjct: 184 RALPKDANP-------GGGGRSMGGGGS--GGY-QGYGGNESSYDGRMDSNRFLQHQSVG 243

Query: 245 RGGLSPISPGYGIGLNLDAGLNLNYGTGPNGSSNLIYGRVMSSSYGGNLNRYGSPSPMVY 304
            G  S  S GYG G          YG G NG+    YG      Y G+   YG+ +   Y
Sbjct: 244 NGLPSYGSSGYGAG----------YGNGSNGAGYGAYG-----GYTGSAGGYGAGATAGY 303

Query: 305 GGGS---GGNGSILSSSVQNLWGNVGNSAGTNASHLRTFPGSGGVHTGTSSLNGIGGLWG 364
           G  +    G GS    + +N W    +S   N  +     GSG  H+G          +G
Sbjct: 304 GATNIPGAGYGSSTGVAPRNSWDTPASSGYGNPGY-----GSGAAHSG----------YG 363

Query: 365 VGHGENAGSSPFNGAGNVEFGNG---GSVVGYGRSIRSNVSSASLYSAPNIYDEVHGNNE 424
           V        SP +G  N  +G G   GS  GYG        + + Y          G+N 
Sbjct: 364 VPGAAPPTQSP-SGYSNQGYGYGGYSGSDSGYG--------NQAAYGVVGGRPSGGGSNN 396

Query: 425 EGN-----SFYGHSSWQSLPTE 431
            G+       YG  SW+S P++
Sbjct: 424 PGSGGYMGGGYGDGSWRSDPSQ 396

BLAST of Cp4.1LG09g06540 vs. Swiss-Prot
Match: MSI2H_HUMAN (RNA-binding protein Musashi homolog 2 OS=Homo sapiens GN=MSI2 PE=1 SV=1)

HSP 1 Score: 184.5 bits (467), Expect = 2.7e-45
Identity = 123/323 (38.08%), Postives = 173/323 (53.56%), Query Frame = 1

Query: 5   DPGKLFIGGISWDTDEDRLKQYFQTYGEVVEVMIMRDRNTGRARGFGFVVFSDPVVAARV 64
           DPGK+FIGG+SW T  D L+ YF  +GE+ E M+MRD  T R+RGFGFV F+DP    +V
Sbjct: 19  DPGKMFIGGLSWQTSPDSLRDYFSKFGEIRECMVMRDPTTKRSRGFGFVTFADPASVDKV 78

Query: 65  VLEKH-VIDGRTVEAKKAVPRDDQNILSRNNTGILGSPGPTRTKKIFVGGLASTVTESDF 124
           + + H  +D +T++ K A PR  Q  +             TRTKKIFVGGL++     D 
Sbjct: 79  LGQPHHELDSKTIDPKVAFPRRAQPKMV------------TRTKKIFVGGLSANTVVEDV 138

Query: 125 KKYFDQFGTITDAVVMYDHNTQRPRGFGFITYESEESVEKVLYKTFHELNGKMVEVKRAV 184
           K+YF+QFG + DA++M+D  T R RGFGF+T+E+E+ VEKV    FHE+N KMVE K+A 
Sbjct: 139 KQYFEQFGKVEDAMLMFDKTTNRHRGFGFVTFENEDVVEKVCEIHFHEINNKMVECKKAQ 198

Query: 185 PKE-LSPVPNRNQLGAYPYS-------FGRVG--SYLSGYNQGYNTTSVGGYGLRSDGRF 244
           PKE + P   R +    PY+        G +G  ++++ Y +GY       YG +  G F
Sbjct: 199 PKEVMFPPGTRGRARGLPYTMDAFMLGMGMLGYPNFVATYGRGY-PGFAPSYGYQFPG-F 258

Query: 245 SPVTVGRGGLSPISPGYGIGLNLDAGLNLNYGTGPNGSSNLIYGRVMSSSYGGNLNRYGS 304
                G    + ++   G G N            P   ++L YG     S  GN     S
Sbjct: 259 PAAAYGPVAAAAVAAARGSGSNPARPGGFPGANSPGPVADL-YGPASQDSGVGNYISAAS 318

Query: 305 PSPMVYGGGSGGNGSILSSSVQN 317
           P P   G G G  G +++++  N
Sbjct: 319 PQP-GSGFGHGIAGPLIATAFTN 325

BLAST of Cp4.1LG09g06540 vs. Swiss-Prot
Match: MSIR6_DROME (RNA-binding protein Musashi homolog Rbp6 OS=Drosophila melanogaster GN=Rbp6 PE=2 SV=3)

HSP 1 Score: 181.8 bits (460), Expect = 1.7e-44
Identity = 92/184 (50.00%), Postives = 124/184 (67.39%), Query Frame = 1

Query: 5   DPGKLFIGGISWDTDEDRLKQYFQTYGEVVEVMIMRDRNTGRARGFGFVVFSDPVVAARV 64
           DPGK+FIGG+SW T  + L+ YF  YG++ E M+M+D  T R+RGFGFV FSDP    +V
Sbjct: 27  DPGKMFIGGLSWQTSPESLRDYFGRYGDISEAMVMKDPTTRRSRGFGFVTFSDPNSVDKV 86

Query: 65  VLE-KHVIDGRTVEAKKAVPRDDQNILSRNNTGILGSPGPTRTKKIFVGGLASTVTESDF 124
           + +  H +DG+ V+ K A PR       R +  ++     TRTKKIFVGGL++  T  D 
Sbjct: 87  LTQGTHELDGKKVDPKVAFPR-------RAHPKMV-----TRTKKIFVGGLSAPTTLEDV 146

Query: 125 KKYFDQFGTITDAVVMYDHNTQRPRGFGFITYESEESVEKVLYKTFHELNGKMVEVKRAV 184
           K YF+QFG I DA++M+D  T R RGFGF+T++SE+ V+KV    FHE+N KMVE K+A 
Sbjct: 147 KSYFEQFGPIEDAMLMFDKQTNRHRGFGFVTFQSEDVVDKVCEIHFHEINNKMVECKKAQ 198

Query: 185 PKEL 188
           PKE+
Sbjct: 207 PKEV 198

BLAST of Cp4.1LG09g06540 vs. Swiss-Prot
Match: RBP1_ARATH (RNA-binding protein 1 OS=Arabidopsis thaliana GN=RBP1 PE=2 SV=1)

HSP 1 Score: 179.9 bits (455), Expect = 6.6e-44
Identity = 101/228 (44.30%), Postives = 143/228 (62.72%), Query Frame = 1

Query: 5   DPGKLFIGGISWDTDEDRLKQYFQTYGEVVEVMIMRDRNTGRARGFGFVVFSDPVVAARV 64
           D  KLF+GGI+ +T E+ LKQYF  YG V+E ++ +++ TG+ RGFGFV F++     + 
Sbjct: 4   DRYKLFVGGIAKETSEEALKQYFSRYGAVLEAVVAKEKVTGKPRGFGFVRFANDCDVVKA 63

Query: 65  VLEKHVIDGRTVEAKKAVPRDD-----------QNILSRNNTGI--LGSPG-PTRTKKIF 124
           + + H I G+ V+ +KA+ + +           +  + + N G+  + S G  +RTKKIF
Sbjct: 64  LRDTHFILGKPVDVRKAIRKHELYQQPFSMQFLERKVQQMNGGLREMSSNGVTSRTKKIF 123

Query: 125 VGGLASTVTESDFKKYFDQFGTITDAVVMYDHNTQRPRGFGFITYESEESVEKVLYKTFH 184
           VGGL+S  TE +FK YF++FG  TD VVM+D  T RPRGFGF+TY+SE+SVE V+   FH
Sbjct: 124 VGGLSSNTTEEEFKSYFERFGRTTDVVVMHDGVTNRPRGFGFVTYDSEDSVEVVMQSNFH 183

Query: 185 ELNGKMVEVKRAVPKELSPVPNRNQLGAYP-YSFGRVGSYLSGYNQGY 218
           EL+ K VEVKRA+PKE     N N +   P YS  +   Y+   N GY
Sbjct: 184 ELSDKRVEVKRAIPKEGIQSNNGNAVNIPPSYSSFQATPYVPEQN-GY 230

BLAST of Cp4.1LG09g06540 vs. Swiss-Prot
Match: MSI1H_HUMAN (RNA-binding protein Musashi homolog 1 OS=Homo sapiens GN=MSI1 PE=1 SV=1)

HSP 1 Score: 178.7 bits (452), Expect = 1.5e-43
Identity = 103/236 (43.64%), Postives = 144/236 (61.02%), Query Frame = 1

Query: 5   DPGKLFIGGISWDTDEDRLKQYFQTYGEVVEVMIMRDRNTGRARGFGFVVFSDPVVAARV 64
           DP K+FIGG+SW T ++ L++YF  +GEV E ++MRD  T R+RGFGFV F D     +V
Sbjct: 18  DPCKMFIGGLSWQTTQEGLREYFGQFGEVKECLVMRDPLTKRSRGFGFVTFMDQAGVDKV 77

Query: 65  VLE-KHVIDGRTVEAKKAVPRDDQNILSRNNTGILGSPGPTRTKKIFVGGLASTVTESDF 124
           + + +H +D +T++ K A PR  Q  +             TRTKKIFVGGL+   T  D 
Sbjct: 78  LAQSRHELDSKTIDPKVAFPRRAQPKMV------------TRTKKIFVGGLSVNTTVEDV 137

Query: 125 KKYFDQFGTITDAVVMYDHNTQRPRGFGFITYESEESVEKVLYKTFHELNGKMVEVKRAV 184
           K+YF+QFG + DA++M+D  T R RGFGF+T+ESE+ VEKV    FHE+N KMVE K+A 
Sbjct: 138 KQYFEQFGKVDDAMLMFDKTTNRHRGFGFVTFESEDIVEKVCEIHFHEINNKMVECKKAQ 197

Query: 185 PKE-LSPVPN---RNQLGAYPYSFGRVGSYLSGYNQGYNTTSVGGYGLRSDGRFSP 236
           PKE +SP  +   R+++  Y      +G  + GY  G+  T+   Y  RS    +P
Sbjct: 198 PKEVMSPTGSARGRSRVMPYGMDAFMLGIGMLGY-PGFQATT---YASRSYTGLAP 237

BLAST of Cp4.1LG09g06540 vs. TrEMBL
Match: A0A0A0LTE8_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G181500 PE=4 SV=1)

HSP 1 Score: 791.6 bits (2043), Expect = 5.3e-226
Identity = 411/469 (87.63%), Postives = 431/469 (91.90%), Query Frame = 1

Query: 4   MDPGKLFIGGISWDTDEDRLKQYFQTYGEVVEVMIMRDRNTGRARGFGFVVFSDPVVAAR 63
           MDPGKLFIGGISWDTDEDRL++YF+ +GEVVEVMIMRDR TGRARGFGFVVF+DPV AAR
Sbjct: 1   MDPGKLFIGGISWDTDEDRLREYFRNFGEVVEVMIMRDRATGRARGFGFVVFADPVAAAR 60

Query: 64  VVLEKHVIDGRTVEAKKAVPRDDQNILSRNNTGILGSPGPTRTKKIFVGGLASTVTESDF 123
           VVLEKHVIDGRTVEAKKAVPRDDQNILSRNNTGILGSPGPTRTKKIFVGGLASTVTESDF
Sbjct: 61  VVLEKHVIDGRTVEAKKAVPRDDQNILSRNNTGILGSPGPTRTKKIFVGGLASTVTESDF 120

Query: 124 KKYFDQFGTITDAVVMYDHNTQRPRGFGFITYESEESVEKVLYKTFHELNGKMVEVKRAV 183
           KKYFDQFGTI D VVMYDHNTQRPRGFGFITYESEESVEKVLYKTFHELNGKMVEVKRAV
Sbjct: 121 KKYFDQFGTIVDVVVMYDHNTQRPRGFGFITYESEESVEKVLYKTFHELNGKMVEVKRAV 180

Query: 184 PKELSPVPNRNQLGAYPYSFGRVGSYLSGYNQGYNTTSVGGYGLRSDGRFSPVTVGRGGL 243
           PKE SPVPNRNQL  YP++FGRVGSYL+GYNQGYN T+VGGYGLRSDGRFSPVTVGRGGL
Sbjct: 181 PKESSPVPNRNQLAGYPFNFGRVGSYLNGYNQGYNPTAVGGYGLRSDGRFSPVTVGRGGL 240

Query: 244 SPISPGYGIGLNLDAGLNLNYGTGPNGSSNLIYGRVMSSSYGGNLNRYGSPSPMVYGGGS 303
           SPISPGYG+GLNL+ GLN NYGTGPN SSNL YGRVMS SY GNLNRYGSP+PMVY GG 
Sbjct: 241 SPISPGYGMGLNLETGLNPNYGTGPNVSSNLSYGRVMSPSYSGNLNRYGSPNPMVYSGG- 300

Query: 304 GGNGSILSSSVQNLWGNVGNSAGTNASHLRTFPGSGGVHTGTSSLNGIGGLWG----VGH 363
           GGNGSILSSSVQNLWGNV  SAGTN+SHLRTFPGSGGVHTGTSSLN IGGLWG    +GH
Sbjct: 301 GGNGSILSSSVQNLWGNVSTSAGTNSSHLRTFPGSGGVHTGTSSLNNIGGLWGASASLGH 360

Query: 364 GENAGSSPFNGAGNVEFGNG------GSVVGYGRSIRSNVSSASLYSAPNIYDEVHGNNE 423
           GENAGSS FN   N++FGNG      G+ VGY RSI +NVSSASLYSAPNIYDEVHGNN+
Sbjct: 361 GENAGSS-FNTV-NLDFGNGDASFTSGTTVGYARSIGTNVSSASLYSAPNIYDEVHGNND 420

Query: 424 EGNSFYGHSSWQSLPTELEDSSSIGFGLGNAASDVISRNNASGYTVGYG 463
           EGN+FYGHSSWQSLPTELEDSSSIGFGLGNAASDVISRNNA GYTVGYG
Sbjct: 421 EGNTFYGHSSWQSLPTELEDSSSIGFGLGNAASDVISRNNA-GYTVGYG 465

BLAST of Cp4.1LG09g06540 vs. TrEMBL
Match: A0A061GGB0_THECC (RNA-binding family protein isoform 3 OS=Theobroma cacao GN=TCM_030384 PE=4 SV=1)

HSP 1 Score: 549.7 bits (1415), Expect = 3.5e-153
Identity = 295/467 (63.17%), Postives = 359/467 (76.87%), Query Frame = 1

Query: 4   MDPGKLFIGGISWDTDEDRLKQYFQTYGEVVEVMIMRDRNTGRARGFGFVVFSDPVVAAR 63
           M+ GKLFIGGISWDT+EDRL++YFQ +G+VVE +IMRDR TGRARGFGFVVF+DP +A R
Sbjct: 3   MELGKLFIGGISWDTNEDRLREYFQAFGDVVEAVIMRDRATGRARGFGFVVFADPAIAER 62

Query: 64  VVLEKHVIDGRTVEAKKAVPRDDQNILSRNNTGILGSPGPTRTKKIFVGGLASTVTESDF 123
           VV+EKH+IDGRTVEAKKAVPRDDQNIL+++N  I GSPGP RTKKIFVGGLASTVTESDF
Sbjct: 63  VVMEKHMIDGRTVEAKKAVPRDDQNILNKSNVSIHGSPGPARTKKIFVGGLASTVTESDF 122

Query: 124 KKYFDQFGTITDAVVMYDHNTQRPRGFGFITYESEESVEKVLYKTFHELNGKMVEVKRAV 183
           K+YFDQFGTITD VVMYDHNTQRPRGFGFITY+SEE+V+KVL +TFHELNGKMVEVKRAV
Sbjct: 123 KRYFDQFGTITDVVVMYDHNTQRPRGFGFITYDSEEAVDKVLQRTFHELNGKMVEVKRAV 182

Query: 184 PKELSPVPNRNQLGAYPYSFGRVGSYLSGYNQGYNTTSVGGYGLRSDGRFSPVTVGRGGL 243
           PKE SP P+RNQLG Y +   RV S+L+GY QGYNT+SVGGYG R +GRFSPVT GR G 
Sbjct: 183 PKESSPGPSRNQLGGYNFGLSRVNSFLNGYMQGYNTSSVGGYGFRMEGRFSPVTAGRSGF 242

Query: 244 SPISPGYGIGLNLDAGLNLNYGTGPNGSSNLIYGRVMSSSYGGNLNRYGSPSPMVYGGGS 303
            P+SPGYG+GLN ++ L+ +YG   N  SNL YGR + +S+ GN NR+G  SP  YGGGS
Sbjct: 243 PPLSPGYGMGLNFESNLSPSYGGSSNLGSNLSYGRGLYTSFNGNSNRFG--SPFGYGGGS 302

Query: 304 GGNGSILSSSVQNLWGNVGNSAGTNASHLRTFPGSGGVHTGTSSLNGIGGLW----GVGH 363
           GGN SIL+S+ +N+WGN   +  TN++      GSG  ++G SS   IG LW     +G 
Sbjct: 303 GGNSSILNSAGRNMWGNGSFNYATNSTTSSAIVGSGSGNSGVSSFGSIGALWDSSPSLGQ 362

Query: 364 GENAGSSPFNGA----GNVEFGNGGSVVGYGRSIRSNVSSASLYSAPN-IYDEVHGNNEE 423
           G  A S+ +NG     G+ +FG G   +GYGR+  SNV+  S + A N  YD  + N  E
Sbjct: 363 GGGAASA-YNGGNLRYGSGDFGVGSGGIGYGRN--SNVAQLSTHGASNGGYDGAYANIYE 422

Query: 424 GNSFYGHSSWQSLPTELEDSSSIGFGLGNAASDVISRNNASGYTVGY 462
             SFYG S+W+S P++LE SSS GFGLG+A+SDV++ NN++ Y  GY
Sbjct: 423 NGSFYGDSTWRSSPSDLERSSSFGFGLGDASSDVMT-NNSADYIGGY 463

BLAST of Cp4.1LG09g06540 vs. TrEMBL
Match: A0A061GIB9_THECC (RNA-binding family protein isoform 5 OS=Theobroma cacao GN=TCM_030384 PE=4 SV=1)

HSP 1 Score: 549.7 bits (1415), Expect = 3.5e-153
Identity = 295/467 (63.17%), Postives = 359/467 (76.87%), Query Frame = 1

Query: 4   MDPGKLFIGGISWDTDEDRLKQYFQTYGEVVEVMIMRDRNTGRARGFGFVVFSDPVVAAR 63
           M+ GKLFIGGISWDT+EDRL++YFQ +G+VVE +IMRDR TGRARGFGFVVF+DP +A R
Sbjct: 3   MELGKLFIGGISWDTNEDRLREYFQAFGDVVEAVIMRDRATGRARGFGFVVFADPAIAER 62

Query: 64  VVLEKHVIDGRTVEAKKAVPRDDQNILSRNNTGILGSPGPTRTKKIFVGGLASTVTESDF 123
           VV+EKH+IDGRTVEAKKAVPRDDQNIL+++N  I GSPGP RTKKIFVGGLASTVTESDF
Sbjct: 63  VVMEKHMIDGRTVEAKKAVPRDDQNILNKSNVSIHGSPGPARTKKIFVGGLASTVTESDF 122

Query: 124 KKYFDQFGTITDAVVMYDHNTQRPRGFGFITYESEESVEKVLYKTFHELNGKMVEVKRAV 183
           K+YFDQFGTITD VVMYDHNTQRPRGFGFITY+SEE+V+KVL +TFHELNGKMVEVKRAV
Sbjct: 123 KRYFDQFGTITDVVVMYDHNTQRPRGFGFITYDSEEAVDKVLQRTFHELNGKMVEVKRAV 182

Query: 184 PKELSPVPNRNQLGAYPYSFGRVGSYLSGYNQGYNTTSVGGYGLRSDGRFSPVTVGRGGL 243
           PKE SP P+RNQLG Y +   RV S+L+GY QGYNT+SVGGYG R +GRFSPVT GR G 
Sbjct: 183 PKESSPGPSRNQLGGYNFGLSRVNSFLNGYMQGYNTSSVGGYGFRMEGRFSPVTAGRSGF 242

Query: 244 SPISPGYGIGLNLDAGLNLNYGTGPNGSSNLIYGRVMSSSYGGNLNRYGSPSPMVYGGGS 303
            P+SPGYG+GLN ++ L+ +YG   N  SNL YGR + +S+ GN NR+G  SP  YGGGS
Sbjct: 243 PPLSPGYGMGLNFESNLSPSYGGSSNLGSNLSYGRGLYTSFNGNSNRFG--SPFGYGGGS 302

Query: 304 GGNGSILSSSVQNLWGNVGNSAGTNASHLRTFPGSGGVHTGTSSLNGIGGLW----GVGH 363
           GGN SIL+S+ +N+WGN   +  TN++      GSG  ++G SS   IG LW     +G 
Sbjct: 303 GGNSSILNSAGRNMWGNGSFNYATNSTTSSAIVGSGSGNSGVSSFGSIGALWDSSPSLGQ 362

Query: 364 GENAGSSPFNGA----GNVEFGNGGSVVGYGRSIRSNVSSASLYSAPN-IYDEVHGNNEE 423
           G  A S+ +NG     G+ +FG G   +GYGR+  SNV+  S + A N  YD  + N  E
Sbjct: 363 GGGAASA-YNGGNLRYGSGDFGVGSGGIGYGRN--SNVAQLSTHGASNGGYDGAYANIYE 422

Query: 424 GNSFYGHSSWQSLPTELEDSSSIGFGLGNAASDVISRNNASGYTVGY 462
             SFYG S+W+S P++LE SSS GFGLG+A+SDV++ NN++ Y  GY
Sbjct: 423 NGSFYGDSTWRSSPSDLERSSSFGFGLGDASSDVMT-NNSADYIGGY 463

BLAST of Cp4.1LG09g06540 vs. TrEMBL
Match: A0A061GHM1_THECC (RNA-binding family protein isoform 1 OS=Theobroma cacao GN=TCM_030384 PE=4 SV=1)

HSP 1 Score: 549.7 bits (1415), Expect = 3.5e-153
Identity = 295/467 (63.17%), Postives = 359/467 (76.87%), Query Frame = 1

Query: 4   MDPGKLFIGGISWDTDEDRLKQYFQTYGEVVEVMIMRDRNTGRARGFGFVVFSDPVVAAR 63
           M+ GKLFIGGISWDT+EDRL++YFQ +G+VVE +IMRDR TGRARGFGFVVF+DP +A R
Sbjct: 3   MELGKLFIGGISWDTNEDRLREYFQAFGDVVEAVIMRDRATGRARGFGFVVFADPAIAER 62

Query: 64  VVLEKHVIDGRTVEAKKAVPRDDQNILSRNNTGILGSPGPTRTKKIFVGGLASTVTESDF 123
           VV+EKH+IDGRTVEAKKAVPRDDQNIL+++N  I GSPGP RTKKIFVGGLASTVTESDF
Sbjct: 63  VVMEKHMIDGRTVEAKKAVPRDDQNILNKSNVSIHGSPGPARTKKIFVGGLASTVTESDF 122

Query: 124 KKYFDQFGTITDAVVMYDHNTQRPRGFGFITYESEESVEKVLYKTFHELNGKMVEVKRAV 183
           K+YFDQFGTITD VVMYDHNTQRPRGFGFITY+SEE+V+KVL +TFHELNGKMVEVKRAV
Sbjct: 123 KRYFDQFGTITDVVVMYDHNTQRPRGFGFITYDSEEAVDKVLQRTFHELNGKMVEVKRAV 182

Query: 184 PKELSPVPNRNQLGAYPYSFGRVGSYLSGYNQGYNTTSVGGYGLRSDGRFSPVTVGRGGL 243
           PKE SP P+RNQLG Y +   RV S+L+GY QGYNT+SVGGYG R +GRFSPVT GR G 
Sbjct: 183 PKESSPGPSRNQLGGYNFGLSRVNSFLNGYMQGYNTSSVGGYGFRMEGRFSPVTAGRSGF 242

Query: 244 SPISPGYGIGLNLDAGLNLNYGTGPNGSSNLIYGRVMSSSYGGNLNRYGSPSPMVYGGGS 303
            P+SPGYG+GLN ++ L+ +YG   N  SNL YGR + +S+ GN NR+G  SP  YGGGS
Sbjct: 243 PPLSPGYGMGLNFESNLSPSYGGSSNLGSNLSYGRGLYTSFNGNSNRFG--SPFGYGGGS 302

Query: 304 GGNGSILSSSVQNLWGNVGNSAGTNASHLRTFPGSGGVHTGTSSLNGIGGLW----GVGH 363
           GGN SIL+S+ +N+WGN   +  TN++      GSG  ++G SS   IG LW     +G 
Sbjct: 303 GGNSSILNSAGRNMWGNGSFNYATNSTTSSAIVGSGSGNSGVSSFGSIGALWDSSPSLGQ 362

Query: 364 GENAGSSPFNGA----GNVEFGNGGSVVGYGRSIRSNVSSASLYSAPN-IYDEVHGNNEE 423
           G  A S+ +NG     G+ +FG G   +GYGR+  SNV+  S + A N  YD  + N  E
Sbjct: 363 GGGAASA-YNGGNLRYGSGDFGVGSGGIGYGRN--SNVAQLSTHGASNGGYDGAYANIYE 422

Query: 424 GNSFYGHSSWQSLPTELEDSSSIGFGLGNAASDVISRNNASGYTVGY 462
             SFYG S+W+S P++LE SSS GFGLG+A+SDV++ NN++ Y  GY
Sbjct: 423 NGSFYGDSTWRSSPSDLERSSSFGFGLGDASSDVMT-NNSADYIGGY 463

BLAST of Cp4.1LG09g06540 vs. TrEMBL
Match: W9QWV7_9ROSA (RNA-binding protein Musashi-2-like protein OS=Morus notabilis GN=L484_004264 PE=4 SV=1)

HSP 1 Score: 541.2 bits (1393), Expect = 1.2e-150
Identity = 295/472 (62.50%), Postives = 351/472 (74.36%), Query Frame = 1

Query: 4   MDPGKLFIGGISWDTDEDRLKQYFQTYGEVVEVMIMRDRNTGRARGFGFVVFSDPVVAAR 63
           M+PGKLFIGGISWDT EDRL++YFQ++G+VVE +IM+DR TGRARGFGF+VF+DP VA +
Sbjct: 1   MEPGKLFIGGISWDTTEDRLREYFQSFGDVVEAVIMKDRATGRARGFGFIVFADPTVAEK 60

Query: 64  VVLEKHVIDGRTVEAKKAVPRDDQNILSRNNTGILGSPGPTRTKKIFVGGLASTVTESDF 123
           VV EKHVID R VEAKKAVPRDDQNIL+RNN+ I GSPGP RTKKIFVGGLASTVTESDF
Sbjct: 61  VVTEKHVIDSRFVEAKKAVPRDDQNILNRNNSSIQGSPGPARTKKIFVGGLASTVTESDF 120

Query: 124 KKYFDQFGTITDAVVMYDHNTQRPRGFGFITYESEESVEKVLYKTFHELNGKMVEVKRAV 183
           KKYFDQFGTITD VVMYDHNTQRPRGFGFITY+SE++V+KVLYKTFHELNGKMVEVKRAV
Sbjct: 121 KKYFDQFGTITDVVVMYDHNTQRPRGFGFITYDSEDAVDKVLYKTFHELNGKMVEVKRAV 180

Query: 184 PKELSPVPNRNQLGAYPYSFGRVGSYLSGYNQGYNTTSVGGYGLRSDGRFSPVTVGRGGL 243
           PKELSP P+R QLG Y +  GRV S+L+GY+QGYN  SVGGY     GR SPVTVGR G 
Sbjct: 181 PKELSPGPSRGQLGVYNHGLGRVSSFLNGYSQGYNLNSVGGY-----GRLSPVTVGRNGF 240

Query: 244 SPISPGYGIGLNLDAGLNLNYGTGPNGSSNLIYGRVMSSSYGGNLNRYGSPSPMVYGGGS 303
           SP  PGYG+G+N + GL+ +YG   N SSNL +GR +S SY  + NR+GS     YGGG+
Sbjct: 241 SPFGPGYGVGVNFETGLSPSYGVNANLSSNLNFGRGLSPSYSSSSNRFGSTPG--YGGGN 300

Query: 304 GGNGSILSSSVQNLWGNVGNSAGTNASHLRTFPGSGGVHTGTSSLNGIGGLW-------- 363
            GN SILS++ +N WGN   S  TN+++   F GS G +TG  SL  IG LW        
Sbjct: 301 EGNSSILSTTGRNFWGNGNLSYPTNSANSSAFIGSAGGNTGVGSLGSIGALWGSSPNPSQ 360

Query: 364 --GVGHGENAGSSPFNGAGNVEFGNGGSVVGYGRSIRSNVSSASLYSAPNIYDEVHGNNE 423
             G G   N GS  + G+G+V F +GG  +GYGR+   +V+  S Y A   YD  H +  
Sbjct: 361 GGGAGSNYNTGSLVY-GSGDVRFRSGG--LGYGRN-GGSVAPGSSYGANGGYDGAHSDIY 420

Query: 424 EGNSFYGHSSWQSLPTELEDSSSIGFGLGNAASDVISRNNASGYTVGYGCIA 466
           +G + YG S+W+S  +EL+ S S GFGLGN ASDV+S+N  SGY  GYG  A
Sbjct: 421 DGGALYGDSTWKSSQSELDGSGSFGFGLGNGASDVMSKN--SGYIGGYGVTA 459

BLAST of Cp4.1LG09g06540 vs. TAIR10
Match: AT3G07810.2 (AT3G07810.2 RNA-binding (RRM/RBD/RNP motifs) family protein)

HSP 1 Score: 431.8 bits (1109), Expect = 5.4e-121
Identity = 264/486 (54.32%), Postives = 324/486 (66.67%), Query Frame = 1

Query: 5   DPGKLFIGGISWDTDEDRLKQYFQTYGEVVEVMIMRDRNTGRARGFGFVVFSDPVVAARV 64
           D GKLFIGGISWDT+E+RLK+YF ++GEV+E +I++DR TGRARGFGFVVF+DP VA  V
Sbjct: 4   DNGKLFIGGISWDTNEERLKEYFSSFGEVIEAVILKDRTTGRARGFGFVVFADPAVAEIV 63

Query: 65  VLEKHVIDGRTVEAKKAVPRDDQNILSRNNTG-ILGSPG-PTRTKKIFVGGLASTVTESD 124
           + EKH IDGR VEAKKAVPRDDQN+++R+N+  I GSPG P RT+KIFVGGL S+VTESD
Sbjct: 64  ITEKHNIDGRLVEAKKAVPRDDQNMVNRSNSSSIQGSPGGPGRTRKIFVGGLPSSVTESD 123

Query: 125 FKKYFDQFGTITDAVVMYDHNTQRPRGFGFITYESEESVEKVLYKTFHELNGKMVEVKRA 184
           FK YF+QFGT TD VVMYDHNTQRPRGFGFITY+SEE+VEKVL KTFHELNGKMVEVKRA
Sbjct: 124 FKTYFEQFGTTTDVVVMYDHNTQRPRGFGFITYDSEEAVEKVLLKTFHELNGKMVEVKRA 183

Query: 185 VPKELSPVPNRNQLGA-YPYSFGRVGSYLSGYNQGYNTTSVGGYGLRSDGRFSPVTVGRG 244
           VPKELSP P+R+ LGA Y Y   RV + L+GY QG+N  +VGGYGLR DGRFSPV  GR 
Sbjct: 184 VPKELSPGPSRSPLGAGYSYGVNRVNNLLNGYAQGFNPAAVGGYGLRMDGRFSPVGAGRS 243

Query: 245 GLSPISPGYGIGLNLDAGLNLNYGTGPNGSSNLIYGRVMSSSYGGNLNRYGSPSPMVYGG 304
           G +  S GYG+ +N D GL   +  G N + N+ YGR MS  Y GN NR+G P+    GG
Sbjct: 244 GFANYSSGYGMNVNFDQGLPTGFTGGTNYNGNVDYGRGMSPYYIGNTNRFG-PAVGYEGG 303

Query: 305 GSGGNGSILSSSVQNLWGNVG----NSAGTNASHLRTFPGSGGVHTGTSSLNGIGGLWGV 364
             GGN S  SS  +NLWGN G    N+  TN S+  T+   GG  +G ++L+G  G  GV
Sbjct: 304 NGGGNSSFFSSVTRNLWGNNGGLNYNNNNTN-SNSNTY--MGGSSSGNNTLSGPFGNSGV 363

Query: 365 GHGENAGSSPFNGAGNVEFGNGGS--------VVGY-----GRSIRSNVSSASLYSAPNI 424
             G   G +      NV+FG GG+          GY     G +  +  SS S  SA N 
Sbjct: 364 NWGAPGGGNNAVSNENVKFGYGGNGESGFGLGTGGYAARNPGANKAAPSSSFSSASATNN 423

Query: 425 --YD-----EVHGNNEEGNSFYGHSSWQSLPTELEDSSSIGFGLGNA--ASDVISRNNAS 462
             YD     E +GN     + Y   +W+S   E E  +   +G+G    +SDV +R+++ 
Sbjct: 424 TGYDTAGLAEFYGN----GAVYSDPTWRSPTPETEGPAPFSYGIGGGVPSSDVSARSSSP 481

BLAST of Cp4.1LG09g06540 vs. TAIR10
Match: AT5G47620.4 (AT5G47620.4 RNA-binding (RRM/RBD/RNP motifs) family protein)

HSP 1 Score: 362.8 bits (930), Expect = 3.1e-100
Identity = 225/478 (47.07%), Postives = 290/478 (60.67%), Query Frame = 1

Query: 4   MDPGKLFIGGISWDTDEDRLKQYFQTYGEVVEVMIMRDRNTGRARGFGFVVFSDPVVAAR 63
           M+  KLFIGGISW+T EDRL+ YF ++GEV+E +IM+DR TGRARGFGFVVF+DP VA R
Sbjct: 3   MESCKLFIGGISWETSEDRLRDYFHSFGEVLEAVIMKDRATGRARGFGFVVFADPNVAER 62

Query: 64  VVLEKHVIDGR----------------------TVEAKKAVPRDDQNILSRNNTGILGSP 123
           VVL KH+IDG+                       VEAKKAVPRDD  + +++N+ + GSP
Sbjct: 63  VVLLKHIIDGKILVDSIVYNQLCRSDKCISLSEVVEAKKAVPRDDHVVFNKSNSSLQGSP 122

Query: 124 GPTRTKKIFVGGLASTVTESDFKKYFDQFGTITDAVVMYDHNTQRPRGFGFITYESEESV 183
           GP+ +KKIFVGGLAS+VTE++FKKYF QFG ITD VVMYDH TQRPRGFGFI+Y+SEE+V
Sbjct: 123 GPSNSKKIFVGGLASSVTEAEFKKYFAQFGMITDVVVMYDHRTQRPRGFGFISYDSEEAV 182

Query: 184 EKVLYKTFHELNGKMVEVKRAVPKELSPVPNRNQLGAYPYSFGRVGSYLSGYNQGYNTTS 243
           +KVL KTFHELNGKMVEVK AVPK+++    RNQ+    +   R+ S L+ Y QG++ + 
Sbjct: 183 DKVLQKTFHELNGKMVEVKLAVPKDMALNTMRNQMNVNSFGTSRISSLLNEYTQGFSPSP 242

Query: 244 VGGYGLRSDGRFSPVTVGRGGLSPISPGYGIGLNLDAGLNLNYGTGPNGSSNLIYGRVMS 303
           + GYG++ + R+SP    RGG SP   GYGI LN +     NYG+G +G     +GR  S
Sbjct: 243 ISGYGVKPEVRYSPAVGNRGGFSPFGHGYGIELNFEPNQTQNYGSGSSGG----FGRPFS 302

Query: 304 SSYGGNLNRYGSPSPMVYGGGSGGNGSILSSSVQN-LWGNVGNSAGTNASHLR-TFPGSG 363
             Y  +L R+G  S M  GG S GNGS+L+++ +N LWGN G    +N+   R +F G+ 
Sbjct: 303 PGYAASLGRFG--SQMESGGASVGNGSVLNAAPKNHLWGNGGLGYMSNSPISRSSFSGNS 362

Query: 364 GVHTGTSSLNGIGGLWGVGHGENAGSSPFNGAGNVEFGNGGSVVGYGRSIRSNVSSASLY 423
           G+    SSL  IG  WG      +      G   +E   G  V GY        S +S+ 
Sbjct: 363 GM----SSLGSIGDNWGTVARARSSYHGERGGVGLEAMRGVHVGGYS-------SGSSIL 422

Query: 424 SAPNIYDEVHGNNEEGNSFYGHSSWQSLPTELEDSSSIGFGLGNAASDVISRNNASGY 458
                         E +S Y  S W SLP + E+      GLG    D +SR  A GY
Sbjct: 423 --------------EADSLYSDSMWLSLPAKAEE------GLGMGPLDFMSRGPA-GY 442

BLAST of Cp4.1LG09g06540 vs. TAIR10
Match: AT4G26650.1 (AT4G26650.1 RNA-binding (RRM/RBD/RNP motifs) family protein)

HSP 1 Score: 340.9 bits (873), Expect = 1.3e-93
Identity = 229/482 (47.51%), Postives = 286/482 (59.34%), Query Frame = 1

Query: 5   DPGKLFIGGISWDTDEDRLKQYFQTYGEVVEVMIMRDRNTGRARGFGFVVFSDPVVAARV 64
           D GKLFIGGISWDTDE+RL++YF  YG++VE +IMRDR TGRARGFGF+VF+DP VA RV
Sbjct: 13  DLGKLFIGGISWDTDEERLQEYFGKYGDLVEAVIMRDRTTGRARGFGFIVFADPSVAERV 72

Query: 65  VLEKHVIDGRTVEAKKAVPRDDQNILSRNNTGIL-------GSPGPTRTKKIFVGGLAST 124
           +++KH+IDGRTVEAKKAVPRDDQ +L R+ + +        G+ G  RTKKIFVGGL S+
Sbjct: 73  IMDKHIIDGRTVEAKKAVPRDDQQVLKRHASPMHLISPSHGGNGGGARTKKIFVGGLPSS 132

Query: 125 VTESDFKKYFDQFGTITDAVVMYDHNTQRPRGFGFITYESEESVEKVLYKTFHELNGKMV 184
           +TE++FK YFDQFGTI D VVMYDHNTQRPRGFGFIT++SEESV+ VL+KTFHELNGKMV
Sbjct: 133 ITEAEFKNYFDQFGTIADVVVMYDHNTQRPRGFGFITFDSEESVDMVLHKTFHELNGKMV 192

Query: 185 EVKRAVPKEL-SPVPNRNQLGAYPYSFGRV------GSYLSGYNQGYNTTSVGGYGLRSD 244
           EVKRAVPKEL S  PNR+ L  Y  ++G V       SY + +  GYN  ++G     S 
Sbjct: 193 EVKRAVPKELSSTTPNRSPLIGYGNNYGVVPNRSSANSYFNSFPPGYNNNNLG-----SA 252

Query: 245 GRFSPVTVGRGGLSPISPGYGIGLNLDAGLNLNYGTGPNGSSNLIYGRVMSSSY--GGNL 304
           GRFSP+  GR   S     +G+GLN +  LN N+       + L Y R+  + Y    + 
Sbjct: 253 GRFSPIGSGRNAFS----SFGLGLNQELNLNSNF-----DGNTLGYSRIPGNQYFNSASP 312

Query: 305 NRYGSPSPMVYGGGSGGNGSILSSSVQNLWGNVGNSAGTNASHLRTFPGSGGVHTGTSSL 364
           NRY SP       G     S  + S ++LWGN  +S+G                      
Sbjct: 313 NRYNSPI------GYNRGDSAYNPSNRDLWGNRSDSSGP--------------------- 372

Query: 365 NGIGGLWGVGHGENAGSSPFNGAGNVEFGNGGSVVGYGRS--IRSNVSSASLYSAPNIYD 424
              G   GV  G N G+    G  +V   N     GYGRS    S +S  S     N +D
Sbjct: 373 ---GWNLGVSVGNNRGNW---GLSSVVSDNN----GYGRSYGAGSGLSGLSFAGNTNGFD 432

Query: 425 EVHGNNEEGNSFYGHSSW-QSLP-----TELED-SSSIGFGLGNAASDVISRNNASGYTV 462
              G    G+S Y  S+W QS+P      EL+  S S GFG+ N  SD  S N + GY+ 
Sbjct: 433 GSIGELYRGSSVYSDSTWQQSMPHHQSSNELDGLSRSYGFGIDNVGSDP-SANASEGYSG 442

BLAST of Cp4.1LG09g06540 vs. TAIR10
Match: AT5G55550.3 (AT5G55550.3 RNA-binding (RRM/RBD/RNP motifs) family protein)

HSP 1 Score: 312.0 bits (798), Expect = 6.2e-85
Identity = 211/419 (50.36%), Postives = 271/419 (64.68%), Query Frame = 1

Query: 5   DPGKLFIGGISWDTDEDRLKQYFQTYGEVVEVMIMRDRNTGRARGFGFVVFSDPVVAARV 64
           D GKLFIGGISWDTDE+RL+ YF  YG+VVE +IMRDR TGRARGFGF+VF+DP V+ RV
Sbjct: 4   DLGKLFIGGISWDTDEERLRDYFSNYGDVVEAVIMRDRATGRARGFGFIVFADPCVSERV 63

Query: 65  VLEKHVIDGRTVEAKKAVPRDDQNILSRNNTGI-LGSP---GPTRTKKIFVGGLASTVTE 124
           +++KH+IDGRTVEAKKAVPRDDQ +L R+ + I L SP   G  RTKKIFVGGL S++TE
Sbjct: 64  IMDKHIIDGRTVEAKKAVPRDDQQVLKRHASPIHLMSPVHGGGGRTKKIFVGGLPSSITE 123

Query: 125 SDFKKYFDQFGTITDAVVMYDHNTQRPRGFGFITYESEESVEKVLYKTFHELNGKMVEVK 184
            +FK YFDQFGTI D VVMYDHNTQRPRGFGFIT++S+++V++VL+KTFHELNGK+VEVK
Sbjct: 124 EEFKNYFDQFGTIADVVVMYDHNTQRPRGFGFITFDSDDAVDRVLHKTFHELNGKLVEVK 183

Query: 185 RAVPKELSPVPNRNQLGAYPYSFGRVGSYLSGYNQGYNTTSVGGYGLRSDG----RFSPV 244
           RAVPKE+SPV N     A   ++G  GS     N  +N  + G     S G    RFSPV
Sbjct: 184 RAVPKEISPVSNIRSPLASGVNYGG-GSNRMPANSYFNNFAPGPGFYNSLGPVGRRFSPV 243

Query: 245 T-VGRGGLSPISPGYGIGLNLDAGLNLNYGTGPNGSSNLIYGRVMSSSY--GGNLNRYGS 304
              GR  +S     +G+GLN D  LNLN  +    SS   Y R+ S+ Y  G + NRY S
Sbjct: 244 IGSGRNAVS----AFGLGLNHDLSLNLN-PSCDGTSSTFGYNRIPSNPYFNGASPNRYTS 303

Query: 305 PSPMVYGGGSGGNGSILSSSVQNLWGNVGNSAG----TNASHLRTFPGSGGVHTGTSSLN 364
           P       G     S  +S+ ++LWGN  ++AG     N S+     G+ G+ + ++  N
Sbjct: 304 PI------GHNRTESPYNSNNRDLWGNRTDTAGPGWNLNVSNGNN-RGNWGLPSSSAVSN 363

Query: 365 GIGGLWGVGHGENAG--SSPFNG-AGNV-EFGNGGSVVGYGRSIRSNVSSASLYSAPNI 405
              G +G  +G ++G  SSPFNG  G++ E   GGSV       +  + S S +   N+
Sbjct: 364 DNNG-FGRNYGTSSGLSSSPFNGFEGSIGELYRGGSVYSDSTWQQQQLPSQSSHELDNL 408

BLAST of Cp4.1LG09g06540 vs. TAIR10
Match: AT4G14300.1 (AT4G14300.1 RNA-binding (RRM/RBD/RNP motifs) family protein)

HSP 1 Score: 238.0 bits (606), Expect = 1.1e-62
Identity = 172/442 (38.91%), Postives = 235/442 (53.17%), Query Frame = 1

Query: 5   DPGKLFIGGISWDTDEDRLKQYFQTYGEVVEVMIMRDRNTGRARGFGFVVFSDPVVAARV 64
           D GKLF+GGISW+TDED+L+++F  YGEV + ++MRD+ TGR RGFGFV+FSDP V  RV
Sbjct: 4   DQGKLFVGGISWETDEDKLREHFTNYGEVSQAIVMRDKLTGRPRGFGFVIFSDPSVLDRV 63

Query: 65  VLEKHVIDGRTVEAKKAVPRDDQNILSR----NNTGILGSPGPTRTKKIFVGGLASTVTE 124
           + EKH ID R V+ K+A+ R++Q +  R    N +   G     +TKKIFVGGL  T+T+
Sbjct: 64  LQEKHSIDTREVDVKRAMSREEQQVSGRTGNLNTSRSSGGDAYNKTKKIFVGGLPPTLTD 123

Query: 125 SDFKKYFDQFGTITDAVVMYDHNTQRPRGFGFITYESEESVEKVLYKTFHELNGKMVEVK 184
            +F++YF+ +G +TD  +MYD  T RPRGFGF++++SE++V+ VL+KTFH+L+GK VEVK
Sbjct: 124 EEFRQYFEVYGPVTDVAIMYDQATNRPRGFGFVSFDSEDAVDSVLHKTFHDLSGKQVEVK 183

Query: 185 RAVPKELSPVPNRNQLGAYPYSFGRVGSYLSGYNQGYNTTSVGGYGLRSDGRF-SPVTVG 244
           RA+PK+ +P       G    S G  GS   GY QGY        G     RF    +VG
Sbjct: 184 RALPKDANP-------GGGGRSMGGGGS--GGY-QGYGGNESSYDGRMDSNRFLQHQSVG 243

Query: 245 RGGLSPISPGYGIGLNLDAGLNLNYGTGPNGSSNLIYGRVMSSSYGGNLNRYGSPSPMVY 304
            G  S  S GYG G          YG G NG+    YG      Y G+   YG+ +   Y
Sbjct: 244 NGLPSYGSSGYGAG----------YGNGSNGAGYGAYG-----GYTGSAGGYGAGATAGY 303

Query: 305 GGGS---GGNGSILSSSVQNLWGNVGNSAGTNASHLRTFPGSGGVHTGTSSLNGIGGLWG 364
           G  +    G GS    + +N W    +S   N  +     GSG  H+G          +G
Sbjct: 304 GATNIPGAGYGSSTGVAPRNSWDTPASSGYGNPGY-----GSGAAHSG----------YG 363

Query: 365 VGHGENAGSSPFNGAGNVEFGNG---GSVVGYGRSIRSNVSSASLYSAPNIYDEVHGNNE 424
           V        SP +G  N  +G G   GS  GYG        + + Y          G+N 
Sbjct: 364 VPGAAPPTQSP-SGYSNQGYGYGGYSGSDSGYG--------NQAAYGVVGGRPSGGGSNN 396

Query: 425 EGN-----SFYGHSSWQSLPTE 431
            G+       YG  SW+S P++
Sbjct: 424 PGSGGYMGGGYGDGSWRSDPSQ 396

BLAST of Cp4.1LG09g06540 vs. NCBI nr
Match: gi|659127261|ref|XP_008463611.1| (PREDICTED: heterogeneous nuclear ribonucleoprotein 1-like [Cucumis melo])

HSP 1 Score: 802.0 bits (2070), Expect = 5.6e-229
Identity = 414/469 (88.27%), Postives = 433/469 (92.32%), Query Frame = 1

Query: 4   MDPGKLFIGGISWDTDEDRLKQYFQTYGEVVEVMIMRDRNTGRARGFGFVVFSDPVVAAR 63
           MDPGKLFIGGISWDTDEDRL++YF+ +GEVVEVMIMRDR TGRARGFGFVVF+DPV AAR
Sbjct: 1   MDPGKLFIGGISWDTDEDRLREYFRNFGEVVEVMIMRDRTTGRARGFGFVVFADPVAAAR 60

Query: 64  VVLEKHVIDGRTVEAKKAVPRDDQNILSRNNTGILGSPGPTRTKKIFVGGLASTVTESDF 123
           VVLEKHVIDGRTVEAKKAVPRDDQNILSRNNTGILGSPGPTRTKKIFVGGLASTVTESDF
Sbjct: 61  VVLEKHVIDGRTVEAKKAVPRDDQNILSRNNTGILGSPGPTRTKKIFVGGLASTVTESDF 120

Query: 124 KKYFDQFGTITDAVVMYDHNTQRPRGFGFITYESEESVEKVLYKTFHELNGKMVEVKRAV 183
           KKYFDQFGTI D VVMYDHNTQRPRGFGFITYESEESVEKVLYKTFHELNGKMVEVKRAV
Sbjct: 121 KKYFDQFGTIVDVVVMYDHNTQRPRGFGFITYESEESVEKVLYKTFHELNGKMVEVKRAV 180

Query: 184 PKELSPVPNRNQLGAYPYSFGRVGSYLSGYNQGYNTTSVGGYGLRSDGRFSPVTVGRGGL 243
           PKE SPVPNRNQL  YPY+FGRVGSYL+GYNQGYN TSVGGYGLRSDGRFSPVTVGRGGL
Sbjct: 181 PKESSPVPNRNQLAGYPYNFGRVGSYLNGYNQGYNPTSVGGYGLRSDGRFSPVTVGRGGL 240

Query: 244 SPISPGYGIGLNLDAGLNLNYGTGPNGSSNLIYGRVMSSSYGGNLNRYGSPSPMVYGGGS 303
           SPISPGYG+G+NLD GLN NYGTGP  SSNL YGRVMS SYGGNLNRYGSP+PMVYGGG 
Sbjct: 241 SPISPGYGMGINLDTGLNPNYGTGP--SSNLSYGRVMSPSYGGNLNRYGSPNPMVYGGGG 300

Query: 304 GGNGSILSSSVQNLWGNVGNSAGTNASHLRTFPGSGGVHTGTSSLNGIGGLWG----VGH 363
           GGNGSILSSSVQNLWGNV NSAGTN+SHLRTFPGSGGVHTGT+SLN IGGLWG    +GH
Sbjct: 301 GGNGSILSSSVQNLWGNVSNSAGTNSSHLRTFPGSGGVHTGTNSLNNIGGLWGTSASLGH 360

Query: 364 GENAGSSPFNGAGNVEFGNG------GSVVGYGRSIRSNVSSASLYSAPNIYDEVHGNNE 423
           GEN G SPFN A N++FGNG      G+ VGY RSI +NVSSASLYSAPNIYDEVHGNN+
Sbjct: 361 GEN-GGSPFNTA-NLDFGNGDASFTSGTTVGYARSIGTNVSSASLYSAPNIYDEVHGNND 420

Query: 424 EGNSFYGHSSWQSLPTELEDSSSIGFGLGNAASDVISRNNASGYTVGYG 463
           EGN+FYGHSSWQSLPTELEDSSSIGFGLGNAASDVISRNNA GYTVGYG
Sbjct: 421 EGNTFYGHSSWQSLPTELEDSSSIGFGLGNAASDVISRNNA-GYTVGYG 464

BLAST of Cp4.1LG09g06540 vs. NCBI nr
Match: gi|449443534|ref|XP_004139532.1| (PREDICTED: heterogeneous nuclear ribonucleoprotein 1 [Cucumis sativus])

HSP 1 Score: 791.6 bits (2043), Expect = 7.6e-226
Identity = 411/469 (87.63%), Postives = 431/469 (91.90%), Query Frame = 1

Query: 4   MDPGKLFIGGISWDTDEDRLKQYFQTYGEVVEVMIMRDRNTGRARGFGFVVFSDPVVAAR 63
           MDPGKLFIGGISWDTDEDRL++YF+ +GEVVEVMIMRDR TGRARGFGFVVF+DPV AAR
Sbjct: 1   MDPGKLFIGGISWDTDEDRLREYFRNFGEVVEVMIMRDRATGRARGFGFVVFADPVAAAR 60

Query: 64  VVLEKHVIDGRTVEAKKAVPRDDQNILSRNNTGILGSPGPTRTKKIFVGGLASTVTESDF 123
           VVLEKHVIDGRTVEAKKAVPRDDQNILSRNNTGILGSPGPTRTKKIFVGGLASTVTESDF
Sbjct: 61  VVLEKHVIDGRTVEAKKAVPRDDQNILSRNNTGILGSPGPTRTKKIFVGGLASTVTESDF 120

Query: 124 KKYFDQFGTITDAVVMYDHNTQRPRGFGFITYESEESVEKVLYKTFHELNGKMVEVKRAV 183
           KKYFDQFGTI D VVMYDHNTQRPRGFGFITYESEESVEKVLYKTFHELNGKMVEVKRAV
Sbjct: 121 KKYFDQFGTIVDVVVMYDHNTQRPRGFGFITYESEESVEKVLYKTFHELNGKMVEVKRAV 180

Query: 184 PKELSPVPNRNQLGAYPYSFGRVGSYLSGYNQGYNTTSVGGYGLRSDGRFSPVTVGRGGL 243
           PKE SPVPNRNQL  YP++FGRVGSYL+GYNQGYN T+VGGYGLRSDGRFSPVTVGRGGL
Sbjct: 181 PKESSPVPNRNQLAGYPFNFGRVGSYLNGYNQGYNPTAVGGYGLRSDGRFSPVTVGRGGL 240

Query: 244 SPISPGYGIGLNLDAGLNLNYGTGPNGSSNLIYGRVMSSSYGGNLNRYGSPSPMVYGGGS 303
           SPISPGYG+GLNL+ GLN NYGTGPN SSNL YGRVMS SY GNLNRYGSP+PMVY GG 
Sbjct: 241 SPISPGYGMGLNLETGLNPNYGTGPNVSSNLSYGRVMSPSYSGNLNRYGSPNPMVYSGG- 300

Query: 304 GGNGSILSSSVQNLWGNVGNSAGTNASHLRTFPGSGGVHTGTSSLNGIGGLWG----VGH 363
           GGNGSILSSSVQNLWGNV  SAGTN+SHLRTFPGSGGVHTGTSSLN IGGLWG    +GH
Sbjct: 301 GGNGSILSSSVQNLWGNVSTSAGTNSSHLRTFPGSGGVHTGTSSLNNIGGLWGASASLGH 360

Query: 364 GENAGSSPFNGAGNVEFGNG------GSVVGYGRSIRSNVSSASLYSAPNIYDEVHGNNE 423
           GENAGSS FN   N++FGNG      G+ VGY RSI +NVSSASLYSAPNIYDEVHGNN+
Sbjct: 361 GENAGSS-FNTV-NLDFGNGDASFTSGTTVGYARSIGTNVSSASLYSAPNIYDEVHGNND 420

Query: 424 EGNSFYGHSSWQSLPTELEDSSSIGFGLGNAASDVISRNNASGYTVGYG 463
           EGN+FYGHSSWQSLPTELEDSSSIGFGLGNAASDVISRNNA GYTVGYG
Sbjct: 421 EGNTFYGHSSWQSLPTELEDSSSIGFGLGNAASDVISRNNA-GYTVGYG 465

BLAST of Cp4.1LG09g06540 vs. NCBI nr
Match: gi|590626877|ref|XP_007026293.1| (RNA-binding family protein isoform 3 [Theobroma cacao])

HSP 1 Score: 549.7 bits (1415), Expect = 5.0e-153
Identity = 295/467 (63.17%), Postives = 359/467 (76.87%), Query Frame = 1

Query: 4   MDPGKLFIGGISWDTDEDRLKQYFQTYGEVVEVMIMRDRNTGRARGFGFVVFSDPVVAAR 63
           M+ GKLFIGGISWDT+EDRL++YFQ +G+VVE +IMRDR TGRARGFGFVVF+DP +A R
Sbjct: 3   MELGKLFIGGISWDTNEDRLREYFQAFGDVVEAVIMRDRATGRARGFGFVVFADPAIAER 62

Query: 64  VVLEKHVIDGRTVEAKKAVPRDDQNILSRNNTGILGSPGPTRTKKIFVGGLASTVTESDF 123
           VV+EKH+IDGRTVEAKKAVPRDDQNIL+++N  I GSPGP RTKKIFVGGLASTVTESDF
Sbjct: 63  VVMEKHMIDGRTVEAKKAVPRDDQNILNKSNVSIHGSPGPARTKKIFVGGLASTVTESDF 122

Query: 124 KKYFDQFGTITDAVVMYDHNTQRPRGFGFITYESEESVEKVLYKTFHELNGKMVEVKRAV 183
           K+YFDQFGTITD VVMYDHNTQRPRGFGFITY+SEE+V+KVL +TFHELNGKMVEVKRAV
Sbjct: 123 KRYFDQFGTITDVVVMYDHNTQRPRGFGFITYDSEEAVDKVLQRTFHELNGKMVEVKRAV 182

Query: 184 PKELSPVPNRNQLGAYPYSFGRVGSYLSGYNQGYNTTSVGGYGLRSDGRFSPVTVGRGGL 243
           PKE SP P+RNQLG Y +   RV S+L+GY QGYNT+SVGGYG R +GRFSPVT GR G 
Sbjct: 183 PKESSPGPSRNQLGGYNFGLSRVNSFLNGYMQGYNTSSVGGYGFRMEGRFSPVTAGRSGF 242

Query: 244 SPISPGYGIGLNLDAGLNLNYGTGPNGSSNLIYGRVMSSSYGGNLNRYGSPSPMVYGGGS 303
            P+SPGYG+GLN ++ L+ +YG   N  SNL YGR + +S+ GN NR+G  SP  YGGGS
Sbjct: 243 PPLSPGYGMGLNFESNLSPSYGGSSNLGSNLSYGRGLYTSFNGNSNRFG--SPFGYGGGS 302

Query: 304 GGNGSILSSSVQNLWGNVGNSAGTNASHLRTFPGSGGVHTGTSSLNGIGGLW----GVGH 363
           GGN SIL+S+ +N+WGN   +  TN++      GSG  ++G SS   IG LW     +G 
Sbjct: 303 GGNSSILNSAGRNMWGNGSFNYATNSTTSSAIVGSGSGNSGVSSFGSIGALWDSSPSLGQ 362

Query: 364 GENAGSSPFNGA----GNVEFGNGGSVVGYGRSIRSNVSSASLYSAPN-IYDEVHGNNEE 423
           G  A S+ +NG     G+ +FG G   +GYGR+  SNV+  S + A N  YD  + N  E
Sbjct: 363 GGGAASA-YNGGNLRYGSGDFGVGSGGIGYGRN--SNVAQLSTHGASNGGYDGAYANIYE 422

Query: 424 GNSFYGHSSWQSLPTELEDSSSIGFGLGNAASDVISRNNASGYTVGY 462
             SFYG S+W+S P++LE SSS GFGLG+A+SDV++ NN++ Y  GY
Sbjct: 423 NGSFYGDSTWRSSPSDLERSSSFGFGLGDASSDVMT-NNSADYIGGY 463

BLAST of Cp4.1LG09g06540 vs. NCBI nr
Match: gi|590626883|ref|XP_007026295.1| (RNA-binding family protein isoform 5 [Theobroma cacao])

HSP 1 Score: 549.7 bits (1415), Expect = 5.0e-153
Identity = 295/467 (63.17%), Postives = 359/467 (76.87%), Query Frame = 1

Query: 4   MDPGKLFIGGISWDTDEDRLKQYFQTYGEVVEVMIMRDRNTGRARGFGFVVFSDPVVAAR 63
           M+ GKLFIGGISWDT+EDRL++YFQ +G+VVE +IMRDR TGRARGFGFVVF+DP +A R
Sbjct: 3   MELGKLFIGGISWDTNEDRLREYFQAFGDVVEAVIMRDRATGRARGFGFVVFADPAIAER 62

Query: 64  VVLEKHVIDGRTVEAKKAVPRDDQNILSRNNTGILGSPGPTRTKKIFVGGLASTVTESDF 123
           VV+EKH+IDGRTVEAKKAVPRDDQNIL+++N  I GSPGP RTKKIFVGGLASTVTESDF
Sbjct: 63  VVMEKHMIDGRTVEAKKAVPRDDQNILNKSNVSIHGSPGPARTKKIFVGGLASTVTESDF 122

Query: 124 KKYFDQFGTITDAVVMYDHNTQRPRGFGFITYESEESVEKVLYKTFHELNGKMVEVKRAV 183
           K+YFDQFGTITD VVMYDHNTQRPRGFGFITY+SEE+V+KVL +TFHELNGKMVEVKRAV
Sbjct: 123 KRYFDQFGTITDVVVMYDHNTQRPRGFGFITYDSEEAVDKVLQRTFHELNGKMVEVKRAV 182

Query: 184 PKELSPVPNRNQLGAYPYSFGRVGSYLSGYNQGYNTTSVGGYGLRSDGRFSPVTVGRGGL 243
           PKE SP P+RNQLG Y +   RV S+L+GY QGYNT+SVGGYG R +GRFSPVT GR G 
Sbjct: 183 PKESSPGPSRNQLGGYNFGLSRVNSFLNGYMQGYNTSSVGGYGFRMEGRFSPVTAGRSGF 242

Query: 244 SPISPGYGIGLNLDAGLNLNYGTGPNGSSNLIYGRVMSSSYGGNLNRYGSPSPMVYGGGS 303
            P+SPGYG+GLN ++ L+ +YG   N  SNL YGR + +S+ GN NR+G  SP  YGGGS
Sbjct: 243 PPLSPGYGMGLNFESNLSPSYGGSSNLGSNLSYGRGLYTSFNGNSNRFG--SPFGYGGGS 302

Query: 304 GGNGSILSSSVQNLWGNVGNSAGTNASHLRTFPGSGGVHTGTSSLNGIGGLW----GVGH 363
           GGN SIL+S+ +N+WGN   +  TN++      GSG  ++G SS   IG LW     +G 
Sbjct: 303 GGNSSILNSAGRNMWGNGSFNYATNSTTSSAIVGSGSGNSGVSSFGSIGALWDSSPSLGQ 362

Query: 364 GENAGSSPFNGA----GNVEFGNGGSVVGYGRSIRSNVSSASLYSAPN-IYDEVHGNNEE 423
           G  A S+ +NG     G+ +FG G   +GYGR+  SNV+  S + A N  YD  + N  E
Sbjct: 363 GGGAASA-YNGGNLRYGSGDFGVGSGGIGYGRN--SNVAQLSTHGASNGGYDGAYANIYE 422

Query: 424 GNSFYGHSSWQSLPTELEDSSSIGFGLGNAASDVISRNNASGYTVGY 462
             SFYG S+W+S P++LE SSS GFGLG+A+SDV++ NN++ Y  GY
Sbjct: 423 NGSFYGDSTWRSSPSDLERSSSFGFGLGDASSDVMT-NNSADYIGGY 463

BLAST of Cp4.1LG09g06540 vs. NCBI nr
Match: gi|590626871|ref|XP_007026291.1| (RNA-binding family protein isoform 1 [Theobroma cacao])

HSP 1 Score: 549.7 bits (1415), Expect = 5.0e-153
Identity = 295/467 (63.17%), Postives = 359/467 (76.87%), Query Frame = 1

Query: 4   MDPGKLFIGGISWDTDEDRLKQYFQTYGEVVEVMIMRDRNTGRARGFGFVVFSDPVVAAR 63
           M+ GKLFIGGISWDT+EDRL++YFQ +G+VVE +IMRDR TGRARGFGFVVF+DP +A R
Sbjct: 3   MELGKLFIGGISWDTNEDRLREYFQAFGDVVEAVIMRDRATGRARGFGFVVFADPAIAER 62

Query: 64  VVLEKHVIDGRTVEAKKAVPRDDQNILSRNNTGILGSPGPTRTKKIFVGGLASTVTESDF 123
           VV+EKH+IDGRTVEAKKAVPRDDQNIL+++N  I GSPGP RTKKIFVGGLASTVTESDF
Sbjct: 63  VVMEKHMIDGRTVEAKKAVPRDDQNILNKSNVSIHGSPGPARTKKIFVGGLASTVTESDF 122

Query: 124 KKYFDQFGTITDAVVMYDHNTQRPRGFGFITYESEESVEKVLYKTFHELNGKMVEVKRAV 183
           K+YFDQFGTITD VVMYDHNTQRPRGFGFITY+SEE+V+KVL +TFHELNGKMVEVKRAV
Sbjct: 123 KRYFDQFGTITDVVVMYDHNTQRPRGFGFITYDSEEAVDKVLQRTFHELNGKMVEVKRAV 182

Query: 184 PKELSPVPNRNQLGAYPYSFGRVGSYLSGYNQGYNTTSVGGYGLRSDGRFSPVTVGRGGL 243
           PKE SP P+RNQLG Y +   RV S+L+GY QGYNT+SVGGYG R +GRFSPVT GR G 
Sbjct: 183 PKESSPGPSRNQLGGYNFGLSRVNSFLNGYMQGYNTSSVGGYGFRMEGRFSPVTAGRSGF 242

Query: 244 SPISPGYGIGLNLDAGLNLNYGTGPNGSSNLIYGRVMSSSYGGNLNRYGSPSPMVYGGGS 303
            P+SPGYG+GLN ++ L+ +YG   N  SNL YGR + +S+ GN NR+G  SP  YGGGS
Sbjct: 243 PPLSPGYGMGLNFESNLSPSYGGSSNLGSNLSYGRGLYTSFNGNSNRFG--SPFGYGGGS 302

Query: 304 GGNGSILSSSVQNLWGNVGNSAGTNASHLRTFPGSGGVHTGTSSLNGIGGLW----GVGH 363
           GGN SIL+S+ +N+WGN   +  TN++      GSG  ++G SS   IG LW     +G 
Sbjct: 303 GGNSSILNSAGRNMWGNGSFNYATNSTTSSAIVGSGSGNSGVSSFGSIGALWDSSPSLGQ 362

Query: 364 GENAGSSPFNGA----GNVEFGNGGSVVGYGRSIRSNVSSASLYSAPN-IYDEVHGNNEE 423
           G  A S+ +NG     G+ +FG G   +GYGR+  SNV+  S + A N  YD  + N  E
Sbjct: 363 GGGAASA-YNGGNLRYGSGDFGVGSGGIGYGRN--SNVAQLSTHGASNGGYDGAYANIYE 422

Query: 424 GNSFYGHSSWQSLPTELEDSSSIGFGLGNAASDVISRNNASGYTVGY 462
             SFYG S+W+S P++LE SSS GFGLG+A+SDV++ NN++ Y  GY
Sbjct: 423 NGSFYGDSTWRSSPSDLERSSSFGFGLGDASSDVMT-NNSADYIGGY 463

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
RNP1_ARATH2.0e-6138.91Heterogeneous nuclear ribonucleoprotein 1 OS=Arabidopsis thaliana GN=RNP1 PE=1 S... [more]
MSI2H_HUMAN2.7e-4538.08RNA-binding protein Musashi homolog 2 OS=Homo sapiens GN=MSI2 PE=1 SV=1[more]
MSIR6_DROME1.7e-4450.00RNA-binding protein Musashi homolog Rbp6 OS=Drosophila melanogaster GN=Rbp6 PE=2... [more]
RBP1_ARATH6.6e-4444.30RNA-binding protein 1 OS=Arabidopsis thaliana GN=RBP1 PE=2 SV=1[more]
MSI1H_HUMAN1.5e-4343.64RNA-binding protein Musashi homolog 1 OS=Homo sapiens GN=MSI1 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0LTE8_CUCSA5.3e-22687.63Uncharacterized protein OS=Cucumis sativus GN=Csa_1G181500 PE=4 SV=1[more]
A0A061GGB0_THECC3.5e-15363.17RNA-binding family protein isoform 3 OS=Theobroma cacao GN=TCM_030384 PE=4 SV=1[more]
A0A061GIB9_THECC3.5e-15363.17RNA-binding family protein isoform 5 OS=Theobroma cacao GN=TCM_030384 PE=4 SV=1[more]
A0A061GHM1_THECC3.5e-15363.17RNA-binding family protein isoform 1 OS=Theobroma cacao GN=TCM_030384 PE=4 SV=1[more]
W9QWV7_9ROSA1.2e-15062.50RNA-binding protein Musashi-2-like protein OS=Morus notabilis GN=L484_004264 PE=... [more]
Match NameE-valueIdentityDescription
AT3G07810.25.4e-12154.32 RNA-binding (RRM/RBD/RNP motifs) family protein[more]
AT5G47620.43.1e-10047.07 RNA-binding (RRM/RBD/RNP motifs) family protein[more]
AT4G26650.11.3e-9347.51 RNA-binding (RRM/RBD/RNP motifs) family protein[more]
AT5G55550.36.2e-8550.36 RNA-binding (RRM/RBD/RNP motifs) family protein[more]
AT4G14300.11.1e-6238.91 RNA-binding (RRM/RBD/RNP motifs) family protein[more]
Match NameE-valueIdentityDescription
gi|659127261|ref|XP_008463611.1|5.6e-22988.27PREDICTED: heterogeneous nuclear ribonucleoprotein 1-like [Cucumis melo][more]
gi|449443534|ref|XP_004139532.1|7.6e-22687.63PREDICTED: heterogeneous nuclear ribonucleoprotein 1 [Cucumis sativus][more]
gi|590626877|ref|XP_007026293.1|5.0e-15363.17RNA-binding family protein isoform 3 [Theobroma cacao][more]
gi|590626883|ref|XP_007026295.1|5.0e-15363.17RNA-binding family protein isoform 5 [Theobroma cacao][more]
gi|590626871|ref|XP_007026291.1|5.0e-15363.17RNA-binding family protein isoform 1 [Theobroma cacao][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0000166nucleotide binding
GO:0003676nucleic acid binding
Vocabulary: INTERPRO
TermDefinition
IPR012677Nucleotide-bd_a/b_plait_sf
IPR000504RRM_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0000166 nucleotide binding
molecular_function GO:0003723 RNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG09g06540.1Cp4.1LG09g06540.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000504RNA recognition motif domainPFAMPF00076RRM_1coord: 9..74
score: 9.4E-16coord: 109..167
score: 1.7
IPR000504RNA recognition motif domainSMARTSM00360rrm1_1coord: 108..180
score: 4.1E-24coord: 8..79
score: 2.8
IPR000504RNA recognition motif domainPROFILEPS50102RRMcoord: 107..184
score: 17.677coord: 7..83
score: 17
IPR012677Nucleotide-binding alpha-beta plait domainGENE3DG3DSA:3.30.70.330coord: 7..88
score: 3.0E-24coord: 102..186
score: 7.7
IPR012677Nucleotide-binding alpha-beta plait domainunknownSSF54928RNA-binding domain, RBDcoord: 103..205
score: 4.03E-26coord: 5..102
score: 3.09
NoneNo IPR availablePANTHERPTHR24012FAMILY NOT NAMEDcoord: 106..336
score: 1.2E-163coord: 5..89
score: 1.2E
NoneNo IPR availablePANTHERPTHR24012:SF433RNA RECOGNITION MOTIF-CONTAINING PROTEINcoord: 106..336
score: 1.2E-163coord: 5..89
score: 1.2E

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cp4.1LG09g06540Cp4.1LG20g06600Cucurbita pepo (Zucchini)cpecpeB046
The following block(s) are covering this gene:
GeneOrganismBlock
Cp4.1LG09g06540Cucurbita pepo (Zucchini)cpecpeB048
Cp4.1LG09g06540Cucurbita maxima (Rimu)cmacpeB212
Cp4.1LG09g06540Cucurbita moschata (Rifu)cmocpeB182
Cp4.1LG09g06540Melon (DHL92) v3.6.1cpemedB033