Cp4.1LG01g01520 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG01g01520
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionRNA-binding family protein
LocationCp4.1LG01 : 2957113 .. 2958420 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGACGCCACCAAGAAGCGGAGGATGGAAGAAAATGGCGTCGACTCCTCCGAAAATTCTTTCTTAAAAATAACCTCCGATGATGCTCGTAAGATAATTGATCGTTTCACTCCGGACCAGCTTATCGATATTCTCCAAGATGCTGTTTCGCGTCACACGGATGTCCTTGATGCGGTTCGATCTATTGCCGACCGAGACGTCTCCCAGCGGAAGCTGTTTATTCGGGGTCTTAGTTGCGATACAAGTACAGAAGGTTTGCGCTCTCTCTTTTCCGCGTATGGGGAGCTTGAAGAAGCGGTTGTTATTATTGACAAGGCGACCGGGAAATCGAAAGGGTATGGCTTTGTGACTTTCAAGCATGTTGATGGGGCTATACTGGCTTTGAAGGAACCGAGCAAGACGATTGATGGCCGTGTTACGGTAACGCAATTGGCTGCAGTGGGAATTTCCGGGCAGAATTCGAACGCTGCGGACATGTCGTTGAGGAAAATTTATGTGGCGAATGTTCCGATGGATATGCCGGCCGATAAACTGTTGGCACACTTTTCGCTGTATGGAGAAATTGAGGAGGGACCGCTAGGATTTGATAAGCAGACGGGTAAGTGCAGAGGTTATGCTTTATTTGTGTACAAGAAGCCAGAAGGGGCACAGGCAGCTCTGGTGGATCCAATCAAGACGATTGATGGAAGGCAGTTGAGTTGTAAATTCGCTAACGATGGGAAGAAAGGGAAACCTGGCGGTGGGCAGGATGGAATTCAAACGCCGGGGGCCGGTCAAGGGAATTTGCATGGAGATGGCATGCCTATGGCTCCTCCTTCGGCAATGCCTGGTTCAGGCGGACAATATGGTGGGCCGGGAGGCATGGGGTCGTATGGAGGCTTTTCTAGTGGCGTACAAGGGGGACAGCCACTGGGTCATCATCCGCTGAACTCCTCAATGGGGCCGGGTCTATCTTCTGTCGGTGGTCAGGCTCCATCTTCGTTGGGAAGCTCTGGCGGATATGGCGGCGGTCCATATGGTGGTGGGTACGGCGGTCCCGCGTCCTCAATTTATGGTGGTATGGGTAGTGTAGGTGGTGGTTTGGGTGGTTCTGCTGGTGGATTGGGCGGAACCGGGGGGCCATCATCTCTGTATAGGTTGCCACAGAGTTCGGTGGGAATGCCTTCGGGTGGTTATCAGGATAGTGGGCATTATAGCATGTCATCAGCATCTGGGCACCCAAATCCGCTCCATCAACAGGCGGTAACATCACCGGCGCCACGTGTTCCGCCTGGAGGAATGTATCCCAGTGTGCCACCTTATTACTGA

mRNA sequence

ATGGACGCCACCAAGAAGCGGAGGATGGAAGAAAATGGCGTCGACTCCTCCGAAAATTCTTTCTTAAAAATAACCTCCGATGATGCTCGTAAGATAATTGATCGTTTCACTCCGGACCAGCTTATCGATATTCTCCAAGATGCTGTTTCGCGTCACACGGATGTCCTTGATGCGGTTCGATCTATTGCCGACCGAGACGTCTCCCAGCGGAAGCTGTTTATTCGGGGTCTTAGTTGCGATACAAGTACAGAAGGTTTGCGCTCTCTCTTTTCCGCGTATGGGGAGCTTGAAGAAGCGGTTGTTATTATTGACAAGGCGACCGGGAAATCGAAAGGGTATGGCTTTGTGACTTTCAAGCATGTTGATGGGGCTATACTGGCTTTGAAGGAACCGAGCAAGACGATTGATGGCCGTGTTACGGTAACGCAATTGGCTGCAGTGGGAATTTCCGGGCAGAATTCGAACGCTGCGGACATGTCGTTGAGGAAAATTTATGTGGCGAATGTTCCGATGGATATGCCGGCCGATAAACTGTTGGCACACTTTTCGCTGTATGGAGAAATTGAGGAGGGACCGCTAGGATTTGATAAGCAGACGGGTAAGTGCAGAGGTTATGCTTTATTTGTGTACAAGAAGCCAGAAGGGGCACAGGCAGCTCTGGTGGATCCAATCAAGACGATTGATGGAAGGCAGTTGAGTTGTAAATTCGCTAACGATGGGAAGAAAGGGAAACCTGGCGGTGGGCAGGATGGAATTCAAACGCCGGGGGCCGGTCAAGGGAATTTGCATGGAGATGGCATGCCTATGGCTCCTCCTTCGGCAATGCCTGGTTCAGGCGGACAATATGGTGGTGGTTTGGGTGGTTCTGCTGGTGGATTGGGCGGAACCGGGGGGCCATCATCTCTGTATAGGTTGCCACAGAGTTCGGTGGGAATGCCTTCGGGTGGTTATCAGGATAGTGGGCATTATAGCATGTCATCAGCATCTGGGCACCCAAATCCGCTCCATCAACAGGCGGTAACATCACCGGCGCCACGTGTTCCGCCTGGAGGAATGTATCCCAGTGTGCCACCTTATTACTGA

Coding sequence (CDS)

ATGGACGCCACCAAGAAGCGGAGGATGGAAGAAAATGGCGTCGACTCCTCCGAAAATTCTTTCTTAAAAATAACCTCCGATGATGCTCGTAAGATAATTGATCGTTTCACTCCGGACCAGCTTATCGATATTCTCCAAGATGCTGTTTCGCGTCACACGGATGTCCTTGATGCGGTTCGATCTATTGCCGACCGAGACGTCTCCCAGCGGAAGCTGTTTATTCGGGGTCTTAGTTGCGATACAAGTACAGAAGGTTTGCGCTCTCTCTTTTCCGCGTATGGGGAGCTTGAAGAAGCGGTTGTTATTATTGACAAGGCGACCGGGAAATCGAAAGGGTATGGCTTTGTGACTTTCAAGCATGTTGATGGGGCTATACTGGCTTTGAAGGAACCGAGCAAGACGATTGATGGCCGTGTTACGGTAACGCAATTGGCTGCAGTGGGAATTTCCGGGCAGAATTCGAACGCTGCGGACATGTCGTTGAGGAAAATTTATGTGGCGAATGTTCCGATGGATATGCCGGCCGATAAACTGTTGGCACACTTTTCGCTGTATGGAGAAATTGAGGAGGGACCGCTAGGATTTGATAAGCAGACGGGTAAGTGCAGAGGTTATGCTTTATTTGTGTACAAGAAGCCAGAAGGGGCACAGGCAGCTCTGGTGGATCCAATCAAGACGATTGATGGAAGGCAGTTGAGTTGTAAATTCGCTAACGATGGGAAGAAAGGGAAACCTGGCGGTGGGCAGGATGGAATTCAAACGCCGGGGGCCGGTCAAGGGAATTTGCATGGAGATGGCATGCCTATGGCTCCTCCTTCGGCAATGCCTGGTTCAGGCGGACAATATGGTGGTGGTTTGGGTGGTTCTGCTGGTGGATTGGGCGGAACCGGGGGGCCATCATCTCTGTATAGGTTGCCACAGAGTTCGGTGGGAATGCCTTCGGGTGGTTATCAGGATAGTGGGCATTATAGCATGTCATCAGCATCTGGGCACCCAAATCCGCTCCATCAACAGGCGGTAACATCACCGGCGCCACGTGTTCCGCCTGGAGGAATGTATCCCAGTGTGCCACCTTATTACTGA

Protein sequence

MDATKKRRMEENGVDSSENSFLKITSDDARKIIDRFTPDQLIDILQDAVSRHTDVLDAVRSIADRDVSQRKLFIRGLSCDTSTEGLRSLFSAYGELEEAVVIIDKATGKSKGYGFVTFKHVDGAILALKEPSKTIDGRVTVTQLAAVGISGQNSNAADMSLRKIYVANVPMDMPADKLLAHFSLYGEIEEGPLGFDKQTGKCRGYALFVYKKPEGAQAALVDPIKTIDGRQLSCKFANDGKKGKPGGGQDGIQTPGAGQGNLHGDGMPMAPPSAMPGSGGQYGGGLGGSAGGLGGTGGPSSLYRLPQSSVGMPSGGYQDSGHYSMSSASGHPNPLHQQAVTSPAPRVPPGGMYPSVPPYY
BLAST of Cp4.1LG01g01520 vs. Swiss-Prot
Match: UBA2C_ARATH (UBP1-associated protein 2C OS=Arabidopsis thaliana GN=UBA2C PE=2 SV=1)

HSP 1 Score: 407.9 bits (1047), Expect = 1.1e-112
Identity = 234/408 (57.35%), Postives = 277/408 (67.89%), Query Frame = 1

Query: 1   MDATKKRRMEENGVDSSENSF-----LKITSDDARKIIDRFTPDQLIDILQDAVSRHTDV 60
           MD  KKR+++ENG   + N        +++  DARKII+RFT DQL+D+LQ+A+ RH DV
Sbjct: 1   MDMMKKRKLDENGNGLNTNGGGTIGPTRLSPQDARKIIERFTTDQLLDLLQEAIVRHPDV 60

Query: 61  LDAVRSIADRDVSQRKLFIRGLSCDTSTEGLRSLFSAYGELEEAVVIIDKATGKSKGYGF 120
           L++VR  AD D+SQRKLFIRGL+ DT+TEGLRSLFS+YG+LEEA+VI+DK TGKSKGYGF
Sbjct: 61  LESVRLTADSDISQRKLFIRGLAADTTTEGLRSLFSSYGDLEEAIVILDKVTGKSKGYGF 120

Query: 121 VTFKHVDGAILALKEPSKTIDGRVTVTQLAAVGISGQNSNAADMSLRKIYVANVPMDMPA 180
           VTF HVDGA+LALKEPSK IDGRVTVTQLAA G  G  S  AD+S+RKIYVANVP DMPA
Sbjct: 121 VTFMHVDGALLALKEPSKKIDGRVTVTQLAASGNQGTGSQIADISMRKIYVANVPFDMPA 180

Query: 181 DKLLAHFSLYGEIEEGPLGFDKQTGKCRGYALFVYKKPEGAQAALVDPIKTIDGRQLSCK 240
           D+LL HF  YG++EEGPLGFDK TGK RG+ALFVYK  EGAQAAL DP+K IDG+ L+CK
Sbjct: 181 DRLLNHFMAYGDVEEGPLGFDKVTGKSRGFALFVYKTAEGAQAALADPVKVIDGKHLNCK 240

Query: 241 FANDGKKG-KPGGGQDGIQTPGAGQGNLHGDGMPMAPPSA-------------------- 300
            A DGKKG KPG  Q   Q  G+G G++HG+GM M  P+                     
Sbjct: 241 LAVDGKKGGKPGMPQ--AQDGGSGHGHVHGEGMGMVRPAGPYGAAGGISAYGGYSGGPPA 300

Query: 301 -------------MPGSGGQYGGG--------LGGSAGGLGGTGGPSSLYRLPQSSVGMP 360
                          G GG YGG          GG  GG GG G  S  YR+P SS  MP
Sbjct: 301 HHMNSTHSSMGVGSAGYGGHYGGYGGPGGTGVYGGLGGGYGGPGTGSGQYRMPPSS--MP 360

BLAST of Cp4.1LG01g01520 vs. Swiss-Prot
Match: UBA2B_ARATH (UBP1-associated protein 2B OS=Arabidopsis thaliana GN=UBA2B PE=2 SV=1)

HSP 1 Score: 198.4 bits (503), Expect = 1.4e-49
Identity = 101/219 (46.12%), Postives = 140/219 (63.93%), Query Frame = 1

Query: 32  IIDRFTPDQLIDILQDAVSRHTDVLDAVRSIADRDVSQRKLFIRGLSCDTSTEGLRSLFS 91
           +++ F+ DQL+ +L++A  RH DV + +R +AD D+  RK+F+ GL  DT  + L   F 
Sbjct: 90  LLEPFSKDQLLILLKEAAERHRDVANRIRIVADEDLVHRKIFVHGLGWDTKADSLIDAFK 149

Query: 92  AYGELEEAVVIIDKATGKSKGYGFVTFKHVDGAILALKEPSKTIDGRVTVTQLAAVG--- 151
            YGE+E+   ++DK +G+SKGYGF+ FK   GA  ALK+P K I  R+T  QLA++G   
Sbjct: 150 QYGEIEDCKCVVDKVSGQSKGYGFILFKSRSGARNALKQPQKKIGTRMTACQLASIGPVQ 209

Query: 152 -----ISGQNSNAADMSLRKIYVANVPMDMPADKLLAHFSLYGEIEEGPLGFDKQTGKCR 211
                   Q+ N  ++  RKIYV+NV  D+   KLL  FS +GEIEEGPLG DK TG+ +
Sbjct: 210 GNPVVAPAQHFNPENVQ-RKIYVSNVSADIDPQKLLEFFSRFGEIEEGPLGLDKATGRPK 269

Query: 212 GYALFVYKKPEGAQAALVDPIKTIDGRQLSCKFANDGKK 243
           G+ALFVY+  E A+ AL +P KT +G  L C  ANDG K
Sbjct: 270 GFALFVYRSLESAKKALEEPHKTFEGHVLHCHKANDGPK 307

BLAST of Cp4.1LG01g01520 vs. Swiss-Prot
Match: UBA2A_ARATH (UBP1-associated protein 2A OS=Arabidopsis thaliana GN=UBA2A PE=1 SV=1)

HSP 1 Score: 184.1 bits (466), Expect = 2.7e-45
Identity = 122/313 (38.98%), Postives = 167/313 (53.35%), Query Frame = 1

Query: 2   DATKKRRMEENGVDSSENSFLKITSDDARKI---IDRFTPDQLIDILQDAVSRHTDVLDA 61
           D T   R+E      S N       DD   I   ++ F+ +Q++ +L++A  +H DV + 
Sbjct: 73  DQTDGNRIEAAATSGSGNQ----EDDDDEPIQDLLEPFSKEQVLSLLKEAAEKHVDVANR 132

Query: 62  VRSIADRDVSQRKLFIRGLSCDTSTEGLRSLFSAYGELEEAVVIIDKATGKSKGYGFVTF 121
           +R +AD D   RK+F+ GL  DT TE L   F  YGE+E+   + DK +GKSKGYGF+ +
Sbjct: 133 IREVADEDPVHRKIFVHGLGWDTKTETLIEAFKQYGEIEDCKAVFDKISGKSKGYGFILY 192

Query: 122 KHVDGAILALKEPSKTIDGRVTVTQL--------------AAVGISGQNSNAADMSLRKI 181
           K   GA  ALK+P K I  R+T  QL              AAV    Q+SN ++ + +KI
Sbjct: 193 KSRSGARNALKQPQKKIGSRMTACQLASKGPVFGGAPIAAAAVSAPAQHSN-SEHTQKKI 252

Query: 182 YVANVPMDMPADKLLAHFSLYGEIEEGPLGFDKQTGKCRGYALFVYKKPEGAQAALVDPI 241
           YV+NV  ++   KLL  FS +GEIEEGPLG DK TG+ +G+ LFVYK  E A+ AL +P 
Sbjct: 253 YVSNVGAELDPQKLLMFFSKFGEIEEGPLGLDKYTGRPKGFCLFVYKSSESAKRALEEPH 312

Query: 242 KTIDGRQLSCKFANDGKKGKPGGGQDGIQTPGAGQGNLH--GDGMPMAPPSAMPGSGGQY 296
           KT +G  L C+ A DG   KPG  Q     P A     +   D     PP       G +
Sbjct: 313 KTFEGHILHCQKAIDGP--KPGKQQQHHHNPHAYNNPRYQRNDNNGYGPP-------GGH 371

BLAST of Cp4.1LG01g01520 vs. Swiss-Prot
Match: UBA1A_ARATH (UBP1-associated proteins 1A OS=Arabidopsis thaliana GN=UBA1A PE=1 SV=2)

HSP 1 Score: 110.9 bits (276), Expect = 2.9e-23
Identity = 59/132 (44.70%), Postives = 84/132 (63.64%), Query Frame = 1

Query: 17  SENSFLKITSDDARKIIDRFTPDQLIDILQDAVSRHTDVLDAVRSIADRDVSQRKLFIRG 76
           S+N F     ++ R+++  ++ DQL+D++  A    + +  AV   ADRDV+ RK+F+ G
Sbjct: 54  SDNEF---DPEELRELLQPYSKDQLVDLVCSASRIGSSIYSAVVEAADRDVTHRKIFVYG 113

Query: 77  LSCDTSTEGLRSLFSAYGELEEAVVIIDKATGKSKGYGFVTFKHVDGAILALKEPSKTID 136
           L  +T+ E L  +F  YGE+EE  V+IDKATGK+KG+GFV FK   GA  ALKEP K I 
Sbjct: 114 LPWETTRETLVGVFEGYGEIEECTVVIDKATGKAKGFGFVMFKTRKGAKEALKEPKKRIL 173

Query: 137 GRVTVTQLAAVG 149
            R    QLA++G
Sbjct: 174 NRTATCQLASMG 182

BLAST of Cp4.1LG01g01520 vs. Swiss-Prot
Match: ROA0_HUMAN (Heterogeneous nuclear ribonucleoprotein A0 OS=Homo sapiens GN=HNRNPA0 PE=1 SV=1)

HSP 1 Score: 106.3 bits (264), Expect = 7.1e-22
Identity = 89/267 (33.33%), Postives = 127/267 (47.57%), Query Frame = 1

Query: 71  KLFIRGLSCDTSTEGLRSLFSAYGELEEAVVIIDKATGKSKGYGFVTFKHVDGAILALKE 130
           KLFI GL+  TS  GLR  F A+G L + VV+++  T +S+ +GFVT+ +V+ A  A+  
Sbjct: 8   KLFIGGLNVQTSESGLRGHFEAFGTLTDCVVVVNPQTKRSRCFGFVTYSNVEEADAAMAA 67

Query: 131 PSKTIDGRVTVTQLAAVGISGQNSNAADMSLRKIYVANVPMDMPADKLLAHFSLYGEIEE 190
               +DG  TV    AV         A   ++K++V  +  D+    L+ HFS +G +E+
Sbjct: 68  SPHAVDGN-TVELKRAVSREDSARPGAHAKVKKLFVGGLKGDVAEGDLIEHFSQFGTVEK 127

Query: 191 GPLGFDKQTGKCRGYALFVYKKPEGAQAALVDPIKTIDGRQLSCKFANDGKKGKPGGGQD 250
             +  DKQ+GK RG+    ++  + A  A V     I G ++  K A   +    GGG  
Sbjct: 128 AEIIADKQSGKKRGFGFVYFQNHDAADKAAVVKFHPIQGHRVEVKKAVPKEDIYSGGGGG 187

Query: 251 GIQTPGAGQGNL-HGDGMPMAPPSAMPGSG----GQYGGGLGGSAGGLGGTGGPSSLYRL 310
           G ++   G+G    G G      S   G G    G YGGG GG     GG GG SS    
Sbjct: 188 GSRSSRGGRGGRGRGGGRDQNGLSKGGGGGYNSYGGYGGGGGGGYNAYGGGGGGSS---Y 247

Query: 311 PQSSVGMPSGGYQDSGHYSMSSASGHP 333
             S  G   GG+   G YS   +S  P
Sbjct: 248 GGSDYGNGFGGF---GSYSQHQSSYGP 267

BLAST of Cp4.1LG01g01520 vs. TrEMBL
Match: A0A0A0KYY6_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G083660 PE=4 SV=1)

HSP 1 Score: 528.1 bits (1359), Expect = 8.4e-147
Identity = 282/334 (84.43%), Postives = 295/334 (88.32%), Query Frame = 1

Query: 1   MDATKKRRMEENGVDSSENSFLKITSDDARKIIDRFTPDQLIDILQDAVSRHTDVLDAVR 60
           MD TKKRRM+ENGVDSSE+SF +IT +DARKIIDRFTPDQLIDILQDAVSRH DVLDAVR
Sbjct: 1   MDVTKKRRMDENGVDSSESSFSRITPEDARKIIDRFTPDQLIDILQDAVSRHLDVLDAVR 60

Query: 61  SIADRDVSQRKLFIRGLSCDTSTEGLRSLFSAYGELEEAVVIIDKATGKSKGYGFVTFKH 120
           SIADRDVSQRKLFIRGLSCDTSTEGLRSLFS+YGELEEAVVIIDKATGKSKGYGFVTFKH
Sbjct: 61  SIADRDVSQRKLFIRGLSCDTSTEGLRSLFSSYGELEEAVVIIDKATGKSKGYGFVTFKH 120

Query: 121 VDGAILALKEPSKTIDGRVTVTQLAAVGISGQNSNAADMSLRKIYVANVPMDMPADKLLA 180
           VDGA+LALKEPSKTIDGRVTVTQLAAVGISGQNSNAAD+SLRKIYVANVPMDMPADKLLA
Sbjct: 121 VDGALLALKEPSKTIDGRVTVTQLAAVGISGQNSNAADLSLRKIYVANVPMDMPADKLLA 180

Query: 181 HFSLYGEIEEGPLGFDKQTGKCRGYALFVYKKPEGAQAALVDPIKTIDGRQLSCKFANDG 240
           HFSLYGEIEEGPLGFDKQTGKCRGYALFVYKKPEGAQAALVDPIKTIDGRQLSCKFANDG
Sbjct: 181 HFSLYGEIEEGPLGFDKQTGKCRGYALFVYKKPEGAQAALVDPIKTIDGRQLSCKFANDG 240

Query: 241 KKGKPGGGQDGIQTPGAGQGNLHGDGMPMAPPSAMPGSGGQYG--GGLGGSAGGLGGTGG 300
           KKGKPGGG DG QT GAGQGN+HGDGMPMAPPSAMPGSGGQYG  GG+G   G   G  G
Sbjct: 241 KKGKPGGGPDGNQTQGAGQGNVHGDGMPMAPPSAMPGSGGQYGGPGGMGSYGGFSSGLQG 300

Query: 301 PSSLYRLP-QSSVGMPSGGYQDSGHYSMSSASGH 332
              L   P  SS+G            S+ S+ G+
Sbjct: 301 AQPLAHHPLNSSMGPGLSSVGGQAPSSLGSSGGY 334

BLAST of Cp4.1LG01g01520 vs. TrEMBL
Match: A0A151SWS3_CAJCA (Uncharacterized protein OS=Cajanus cajan GN=KK1_014679 PE=4 SV=1)

HSP 1 Score: 436.4 bits (1121), Expect = 3.3e-119
Identity = 254/384 (66.15%), Postives = 295/384 (76.82%), Query Frame = 1

Query: 1   MDATKKRRMEENGVDSSENSFLKITSDDARKIIDRFTPDQLIDILQDAVSRHTDVLDAVR 60
           MD TKKR+++ENG + S +  LK++  +ARK+IDRF+PDQL+DILQDAVSRH DVL AVR
Sbjct: 1   MDLTKKRKIDENGFNDSSDP-LKLSPSEARKLIDRFSPDQLLDILQDAVSRHPDVLAAVR 60

Query: 61  SIADRDVSQRKLFIRGLSCDTSTEGLRSLFSAYGELEEAVVIIDKATGKSKGYGFVTFKH 120
           + AD DVSQRKLFIRGL  DT+T+GLRSLFS +G+LEEAVVI+DKATGKSKGYGFVTF+H
Sbjct: 61  AAADPDVSQRKLFIRGLGWDTTTDGLRSLFSTFGDLEEAVVILDKATGKSKGYGFVTFRH 120

Query: 121 VDGAILALKEPSKTIDGRVTVTQLAAVGISGQNSNAADMSLRKIYVANVPMDMPADKLLA 180
           VDGA+LAL+EPSK IDGRVTVTQLAA G S  N+NAAD++LRKIYVANVP D+PADKLLA
Sbjct: 121 VDGALLALREPSKRIDGRVTVTQLAAAGNSASNANAADVALRKIYVANVPPDLPADKLLA 180

Query: 181 HFSLYGEIEEGPLGFDKQTGKCRGYALFVYKKPEGAQAALVDPIKTIDGRQLSCKFA-ND 240
           HFS+YGEIEEGPLGFDKQTGK +G+ALFVYK PEGAQAAL++P+KT++GRQLSCK A  D
Sbjct: 181 HFSVYGEIEEGPLGFDKQTGKSKGFALFVYKTPEGAQAALLEPVKTVEGRQLSCKLAITD 240

Query: 241 GKKGKPGGGQDGIQTPGA---GQGNLHGDGMPMA---PPSAMPGSG--------GQYGG- 300
           GK+GK  G  DG Q  G    G G+  G GM M    PP+A  G G        G YGG 
Sbjct: 241 GKQGKRAG-PDGPQAHGNVQHGHGDGVGAGMGMGMGMPPNAGSGPGQYGPPVGVGSYGGF 300

Query: 301 --GLGGSAGGL--GGTG-GPSSLYRLPQSSVGMPSGG--YQDSG-HYSMSSASGHPNPLH 360
             G GG AGG   GG G G  SLYR P S  GMP GG  Y DSG HYS+S++ G+ N  H
Sbjct: 301 GVGAGGGAGGGIGGGAGAGGGSLYRFPGSG-GMPGGGGGYPDSGGHYSLSASGGYQNQHH 360

BLAST of Cp4.1LG01g01520 vs. TrEMBL
Match: B9GGV7_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0001s38640g PE=4 SV=2)

HSP 1 Score: 422.2 bits (1084), Expect = 6.5e-115
Identity = 241/378 (63.76%), Postives = 286/378 (75.66%), Query Frame = 1

Query: 1   MDATKKRRMEENGVDSSE---NSFLKITSDDARKIIDRFTPDQLIDILQDAVSRHTDVLD 60
           MD TKKR++EENG+ SS    +S  K+T  DARK+++RFTPDQL+DILQ+AV RH D+L+
Sbjct: 3   MDPTKKRKLEENGIVSSTTDLDSPYKLTPQDARKMMERFTPDQLLDILQNAVVRHPDILE 62

Query: 61  AVRSIADRDVSQRKLFIRGLSCDTSTEGLRSLFSAYGELEEAVVIIDKATGKSKGYGFVT 120
           AVRSIAD D +QRKLFIRGL  +T+TE LR+LFS YGELEEAVVI+DK TGKSKGYGFV 
Sbjct: 63  AVRSIADPDATQRKLFIRGLGWETTTENLRNLFSTYGELEEAVVILDKNTGKSKGYGFVI 122

Query: 121 FKHVDGAILALKEPSKTIDGRVTVTQLAAVGISGQNSN--------AADMSLRKIYVANV 180
           +KHVDGA+LALKEPSK IDGRVTVTQLA  G SG N+N          D+++RKIYVANV
Sbjct: 123 YKHVDGALLALKEPSKKIDGRVTVTQLAIAGNSGANNNNNSSANPGVVDVAMRKIYVANV 182

Query: 181 PMDMPADKLLAHFSLYGEIEEGPLGFDKQTGKCRGYALFVYKKPEGAQAALVDPIKTIDG 240
           P +MP+DKLL HF+ YGEIEEGPLGFDKQTGK +G+ALFVYK  EGAQAAL++P+K I+G
Sbjct: 183 PYEMPSDKLLNHFAQYGEIEEGPLGFDKQTGKSKGFALFVYKTAEGAQAALLEPVKMIEG 242

Query: 241 RQLSCKFANDGKKGK-PGG----GQDGIQTPGAGQGNLHG-DGMPMAPPSAMPGSGGQYG 300
           RQL+CK A DGK+G+ PGG    GQDG+Q  G G   L G  G    PP    G  G  G
Sbjct: 243 RQLNCKLAIDGKRGRQPGGGQGPGQDGLQ--GQGVVGLSGTGGGSYGPPYGGYGGPGSTG 302

Query: 301 -GGLGGSAGGLGGTGGPSSLYRLPQSSVGMPSGGYQDSGHYSMSSASGHPNPLHQQAVTS 360
            GGLGG   G+G + G SS +RLP SSVGMP+GGY D G YS+SS++      HQ A  S
Sbjct: 303 YGGLGGGGAGVGASVGASSSFRLPPSSVGMPTGGYPDPGQYSLSSSNASFPSQHQGA--S 362

BLAST of Cp4.1LG01g01520 vs. TrEMBL
Match: M5WGI4_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa005857mg PE=4 SV=1)

HSP 1 Score: 414.1 bits (1063), Expect = 1.8e-112
Identity = 228/325 (70.15%), Postives = 260/325 (80.00%), Query Frame = 1

Query: 1   MDATKKRRMEENGV--DSSENSFLKITSDDARKIIDRFTPDQLIDILQDAVSRHTDVLDA 60
           MD +KKR+++ENGV  D+  +S  K++ +DARK+I+RF PDQLIDILQDAV+RH DVLDA
Sbjct: 1   MDPSKKRKLDENGVVLDTDPSSIPKLSPEDARKLIERFNPDQLIDILQDAVTRHVDVLDA 60

Query: 61  VRSIADRDVSQRKLFIRGLSCDTSTEGLRSLFSAYGELEEAVVIIDKATGKSKGYGFVTF 120
           VRSIAD D SQRKLFIRGL  DT+TEGLR+LFSAYGELEEA+VI+DK TGKS+GYGFVTF
Sbjct: 61  VRSIADLDASQRKLFIRGLGWDTTTEGLRALFSAYGELEEAIVILDKVTGKSRGYGFVTF 120

Query: 121 KHVDGAILALKEPSKTIDGRVTVTQLAAVGISGQN---SNAADMSLRKIYVANVPMDMPA 180
           +HVDGA+LALKEPSK IDGR+TVTQLAA G S  N   +NAAD+SLRKIYVANVP DMPA
Sbjct: 121 RHVDGALLALKEPSKKIDGRMTVTQLAAAGNSSSNVSSNNAADVSLRKIYVANVPYDMPA 180

Query: 181 DKLLAHFSLYGEIEEGPLGFDKQTGKCRGYALFVYKKPEGAQAALVDPIKTIDGRQLSCK 240
           DKLLAHFS YGEIEEGPLGFDKQTGKC+GYALFVYK PEGAQAALVDP+K I+GRQL+CK
Sbjct: 181 DKLLAHFSFYGEIEEGPLGFDKQTGKCKGYALFVYKTPEGAQAALVDPVKNIEGRQLTCK 240

Query: 241 FANDGKKGKPGGGQDGIQTPGAGQGNLHGDGMPMAPPSAMPGSGGQYG--GGLGGSAGGL 300
            A DGKKGKP G   G Q PG+G  N HGDGM +A PS++    GQYG  GG+G   G  
Sbjct: 241 LAIDGKKGKPDGPGQG-QGPGSGP-NAHGDGMGLAQPSSI---AGQYGGPGGIGSYGGFS 300

Query: 301 GGTGGPSSLYRLPQSSVGMPSGGYQ 319
           GG  GP  L   P    G+ S G Q
Sbjct: 301 GGHQGPPPLGHHPLGGPGLSSVGNQ 320

BLAST of Cp4.1LG01g01520 vs. TrEMBL
Match: D7L3Y0_ARALL (Putative uncharacterized protein OS=Arabidopsis lyrata subsp. lyrata GN=ARALYDRAFT_478927 PE=4 SV=1)

HSP 1 Score: 413.3 bits (1061), Expect = 3.0e-112
Identity = 235/406 (57.88%), Postives = 274/406 (67.49%), Query Frame = 1

Query: 1   MDATKKRRMEENGVDSSENSF-----LKITSDDARKIIDRFTPDQLIDILQDAVSRHTDV 60
           MD  KKR+++ENG     N        ++T  DARKII+RFT DQL+D+LQ+A+ RH DV
Sbjct: 1   MDMMKKRKLDENGGGLITNGGGIIGPTRLTPQDARKIIERFTTDQLLDLLQEAIVRHPDV 60

Query: 61  LDAVRSIADRDVSQRKLFIRGLSCDTSTEGLRSLFSAYGELEEAVVIIDKATGKSKGYGF 120
           LD+VRS AD D+SQRKLFIRGL+ DT+TEGL SLFS YG+LEEA+VI+DK TGKSKGYGF
Sbjct: 61  LDSVRSTADSDISQRKLFIRGLAADTTTEGLLSLFSNYGDLEEAIVILDKVTGKSKGYGF 120

Query: 121 VTFKHVDGAILALKEPSKTIDGRVTVTQLAAVGISGQNSNAADMSLRKIYVANVPMDMPA 180
           VTF HVDGA+LALKEPSK IDGRVTVTQLAA G  G  S  AD+S+RKIYVANVP DMPA
Sbjct: 121 VTFMHVDGALLALKEPSKKIDGRVTVTQLAASGNQGTGSQIADISMRKIYVANVPFDMPA 180

Query: 181 DKLLAHFSLYGEIEEGPLGFDKQTGKCRGYALFVYKKPEGAQAALVDPIKTIDGRQLSCK 240
           D+LL HF  YG++EEGPLGFDK TGK RG+ALFVYK  EGAQ AL DP+K IDG+ L+CK
Sbjct: 181 DRLLNHFMAYGDVEEGPLGFDKVTGKSRGFALFVYKTAEGAQTALADPVKVIDGKHLNCK 240

Query: 241 FANDGKKGKPGGGQDGIQTPGAGQGNLHGDGMPM-------------------------- 300
            A DGKKG    G    Q  G+G G++HGD M M                          
Sbjct: 241 LAVDGKKGGGKPGMPQAQDGGSGHGHVHGDVMGMVRPAGPYGAAGGMSAYGGYSGGPPPH 300

Query: 301 ---APPSAMP----GSGGQYGG--------GLGGSAGGLGGTGGPSSLYRLPQSSVGMPS 360
              + PS+M     G GG YGG        G GG   G GG GG S  YR+P SS  MP 
Sbjct: 301 HMNSTPSSMGVGTGGYGGHYGGYGGPGGTGGYGGLGSGYGGPGGGSGPYRMPPSS--MPG 360

BLAST of Cp4.1LG01g01520 vs. TAIR10
Match: AT3G15010.1 (AT3G15010.1 RNA-binding (RRM/RBD/RNP motifs) family protein)

HSP 1 Score: 407.9 bits (1047), Expect = 6.4e-114
Identity = 234/408 (57.35%), Postives = 277/408 (67.89%), Query Frame = 1

Query: 1   MDATKKRRMEENGVDSSENSF-----LKITSDDARKIIDRFTPDQLIDILQDAVSRHTDV 60
           MD  KKR+++ENG   + N        +++  DARKII+RFT DQL+D+LQ+A+ RH DV
Sbjct: 1   MDMMKKRKLDENGNGLNTNGGGTIGPTRLSPQDARKIIERFTTDQLLDLLQEAIVRHPDV 60

Query: 61  LDAVRSIADRDVSQRKLFIRGLSCDTSTEGLRSLFSAYGELEEAVVIIDKATGKSKGYGF 120
           L++VR  AD D+SQRKLFIRGL+ DT+TEGLRSLFS+YG+LEEA+VI+DK TGKSKGYGF
Sbjct: 61  LESVRLTADSDISQRKLFIRGLAADTTTEGLRSLFSSYGDLEEAIVILDKVTGKSKGYGF 120

Query: 121 VTFKHVDGAILALKEPSKTIDGRVTVTQLAAVGISGQNSNAADMSLRKIYVANVPMDMPA 180
           VTF HVDGA+LALKEPSK IDGRVTVTQLAA G  G  S  AD+S+RKIYVANVP DMPA
Sbjct: 121 VTFMHVDGALLALKEPSKKIDGRVTVTQLAASGNQGTGSQIADISMRKIYVANVPFDMPA 180

Query: 181 DKLLAHFSLYGEIEEGPLGFDKQTGKCRGYALFVYKKPEGAQAALVDPIKTIDGRQLSCK 240
           D+LL HF  YG++EEGPLGFDK TGK RG+ALFVYK  EGAQAAL DP+K IDG+ L+CK
Sbjct: 181 DRLLNHFMAYGDVEEGPLGFDKVTGKSRGFALFVYKTAEGAQAALADPVKVIDGKHLNCK 240

Query: 241 FANDGKKG-KPGGGQDGIQTPGAGQGNLHGDGMPMAPPSA-------------------- 300
            A DGKKG KPG  Q   Q  G+G G++HG+GM M  P+                     
Sbjct: 241 LAVDGKKGGKPGMPQ--AQDGGSGHGHVHGEGMGMVRPAGPYGAAGGISAYGGYSGGPPA 300

Query: 301 -------------MPGSGGQYGGG--------LGGSAGGLGGTGGPSSLYRLPQSSVGMP 360
                          G GG YGG          GG  GG GG G  S  YR+P SS  MP
Sbjct: 301 HHMNSTHSSMGVGSAGYGGHYGGYGGPGGTGVYGGLGGGYGGPGTGSGQYRMPPSS--MP 360

BLAST of Cp4.1LG01g01520 vs. TAIR10
Match: AT2G41060.1 (AT2G41060.1 RNA-binding (RRM/RBD/RNP motifs) family protein)

HSP 1 Score: 198.4 bits (503), Expect = 7.7e-51
Identity = 101/219 (46.12%), Postives = 140/219 (63.93%), Query Frame = 1

Query: 32  IIDRFTPDQLIDILQDAVSRHTDVLDAVRSIADRDVSQRKLFIRGLSCDTSTEGLRSLFS 91
           +++ F+ DQL+ +L++A  RH DV + +R +AD D+  RK+F+ GL  DT  + L   F 
Sbjct: 90  LLEPFSKDQLLILLKEAAERHRDVANRIRIVADEDLVHRKIFVHGLGWDTKADSLIDAFK 149

Query: 92  AYGELEEAVVIIDKATGKSKGYGFVTFKHVDGAILALKEPSKTIDGRVTVTQLAAVG--- 151
            YGE+E+   ++DK +G+SKGYGF+ FK   GA  ALK+P K I  R+T  QLA++G   
Sbjct: 150 QYGEIEDCKCVVDKVSGQSKGYGFILFKSRSGARNALKQPQKKIGTRMTACQLASIGPVQ 209

Query: 152 -----ISGQNSNAADMSLRKIYVANVPMDMPADKLLAHFSLYGEIEEGPLGFDKQTGKCR 211
                   Q+ N  ++  RKIYV+NV  D+   KLL  FS +GEIEEGPLG DK TG+ +
Sbjct: 210 GNPVVAPAQHFNPENVQ-RKIYVSNVSADIDPQKLLEFFSRFGEIEEGPLGLDKATGRPK 269

Query: 212 GYALFVYKKPEGAQAALVDPIKTIDGRQLSCKFANDGKK 243
           G+ALFVY+  E A+ AL +P KT +G  L C  ANDG K
Sbjct: 270 GFALFVYRSLESAKKALEEPHKTFEGHVLHCHKANDGPK 307

BLAST of Cp4.1LG01g01520 vs. TAIR10
Match: AT3G56860.3 (AT3G56860.3 UBP1-associated protein 2A)

HSP 1 Score: 184.1 bits (466), Expect = 1.5e-46
Identity = 122/313 (38.98%), Postives = 167/313 (53.35%), Query Frame = 1

Query: 2   DATKKRRMEENGVDSSENSFLKITSDDARKI---IDRFTPDQLIDILQDAVSRHTDVLDA 61
           D T   R+E      S N       DD   I   ++ F+ +Q++ +L++A  +H DV + 
Sbjct: 73  DQTDGNRIEAAATSGSGNQ----EDDDDEPIQDLLEPFSKEQVLSLLKEAAEKHVDVANR 132

Query: 62  VRSIADRDVSQRKLFIRGLSCDTSTEGLRSLFSAYGELEEAVVIIDKATGKSKGYGFVTF 121
           +R +AD D   RK+F+ GL  DT TE L   F  YGE+E+   + DK +GKSKGYGF+ +
Sbjct: 133 IREVADEDPVHRKIFVHGLGWDTKTETLIEAFKQYGEIEDCKAVFDKISGKSKGYGFILY 192

Query: 122 KHVDGAILALKEPSKTIDGRVTVTQL--------------AAVGISGQNSNAADMSLRKI 181
           K   GA  ALK+P K I  R+T  QL              AAV    Q+SN ++ + +KI
Sbjct: 193 KSRSGARNALKQPQKKIGSRMTACQLASKGPVFGGAPIAAAAVSAPAQHSN-SEHTQKKI 252

Query: 182 YVANVPMDMPADKLLAHFSLYGEIEEGPLGFDKQTGKCRGYALFVYKKPEGAQAALVDPI 241
           YV+NV  ++   KLL  FS +GEIEEGPLG DK TG+ +G+ LFVYK  E A+ AL +P 
Sbjct: 253 YVSNVGAELDPQKLLMFFSKFGEIEEGPLGLDKYTGRPKGFCLFVYKSSESAKRALEEPH 312

Query: 242 KTIDGRQLSCKFANDGKKGKPGGGQDGIQTPGAGQGNLH--GDGMPMAPPSAMPGSGGQY 296
           KT +G  L C+ A DG   KPG  Q     P A     +   D     PP       G +
Sbjct: 313 KTFEGHILHCQKAIDGP--KPGKQQQHHHNPHAYNNPRYQRNDNNGYGPP-------GGH 371

BLAST of Cp4.1LG01g01520 vs. TAIR10
Match: AT2G22090.2 (AT2G22090.2 RNA-binding (RRM/RBD/RNP motifs) family protein)

HSP 1 Score: 110.9 bits (276), Expect = 1.6e-24
Identity = 59/132 (44.70%), Postives = 84/132 (63.64%), Query Frame = 1

Query: 17  SENSFLKITSDDARKIIDRFTPDQLIDILQDAVSRHTDVLDAVRSIADRDVSQRKLFIRG 76
           S+N F     ++ R+++  ++ DQL+D++  A    + +  AV   ADRDV+ RK+F+ G
Sbjct: 54  SDNEF---DPEELRELLQPYSKDQLVDLVCSASRIGSSIYSAVVEAADRDVTHRKIFVYG 113

Query: 77  LSCDTSTEGLRSLFSAYGELEEAVVIIDKATGKSKGYGFVTFKHVDGAILALKEPSKTID 136
           L  +T+ E L  +F  YGE+EE  V+IDKATGK+KG+GFV FK   GA  ALKEP K I 
Sbjct: 114 LPWETTRETLVGVFEGYGEIEECTVVIDKATGKAKGFGFVMFKTRKGAKEALKEPKKRIL 173

Query: 137 GRVTVTQLAAVG 149
            R    QLA++G
Sbjct: 174 NRTATCQLASMG 182

BLAST of Cp4.1LG01g01520 vs. TAIR10
Match: AT2G19380.1 (AT2G19380.1 RNA recognition motif (RRM)-containing protein)

HSP 1 Score: 106.3 bits (264), Expect = 4.0e-23
Identity = 55/159 (34.59%), Postives = 97/159 (61.01%), Query Frame = 1

Query: 2   DATKKRRMEENGVDSSENSFLKITSDDARKIIDRFTPDQLIDILQDAVSRHTDVLDAVRS 61
           D  KKR+ ++    S  +S  +   +D ++++  ++ ++L++++     + + ++ A+  
Sbjct: 342 DKEKKRKKDKKQTKS--DSDFEHDKEDIKQLLVAYSKEELVNLIYKTAEKGSRLISAILE 401

Query: 62  IADRDVSQRKLFIRGLSCDTSTEGLRSLFSAYGELEEAVVIIDKATGKSKGYGFVTFKHV 121
            ADRD++QR +F+RG   DT+ E L++ F +YGE+EE  V++DK TG+ KGYGFV FK  
Sbjct: 402 SADRDIAQRNIFVRGFGWDTTQENLKTAFESYGEIEECSVVMDKDTGRGKGYGFVMFKTR 461

Query: 122 DGAILALKEPSKTIDGRVTVTQLAA--VGISGQNSNAAD 159
            GA  ALK P K +  R+ V  LA+   G +G+  + A+
Sbjct: 462 KGAREALKRPEKRMYNRIVVCNLASEKPGKAGKEQDMAE 498

BLAST of Cp4.1LG01g01520 vs. NCBI nr
Match: gi|449438887|ref|XP_004137219.1| (PREDICTED: UBP1-associated protein 2C [Cucumis sativus])

HSP 1 Score: 528.1 bits (1359), Expect = 1.2e-146
Identity = 282/334 (84.43%), Postives = 295/334 (88.32%), Query Frame = 1

Query: 1   MDATKKRRMEENGVDSSENSFLKITSDDARKIIDRFTPDQLIDILQDAVSRHTDVLDAVR 60
           MD TKKRRM+ENGVDSSE+SF +IT +DARKIIDRFTPDQLIDILQDAVSRH DVLDAVR
Sbjct: 1   MDVTKKRRMDENGVDSSESSFSRITPEDARKIIDRFTPDQLIDILQDAVSRHLDVLDAVR 60

Query: 61  SIADRDVSQRKLFIRGLSCDTSTEGLRSLFSAYGELEEAVVIIDKATGKSKGYGFVTFKH 120
           SIADRDVSQRKLFIRGLSCDTSTEGLRSLFS+YGELEEAVVIIDKATGKSKGYGFVTFKH
Sbjct: 61  SIADRDVSQRKLFIRGLSCDTSTEGLRSLFSSYGELEEAVVIIDKATGKSKGYGFVTFKH 120

Query: 121 VDGAILALKEPSKTIDGRVTVTQLAAVGISGQNSNAADMSLRKIYVANVPMDMPADKLLA 180
           VDGA+LALKEPSKTIDGRVTVTQLAAVGISGQNSNAAD+SLRKIYVANVPMDMPADKLLA
Sbjct: 121 VDGALLALKEPSKTIDGRVTVTQLAAVGISGQNSNAADLSLRKIYVANVPMDMPADKLLA 180

Query: 181 HFSLYGEIEEGPLGFDKQTGKCRGYALFVYKKPEGAQAALVDPIKTIDGRQLSCKFANDG 240
           HFSLYGEIEEGPLGFDKQTGKCRGYALFVYKKPEGAQAALVDPIKTIDGRQLSCKFANDG
Sbjct: 181 HFSLYGEIEEGPLGFDKQTGKCRGYALFVYKKPEGAQAALVDPIKTIDGRQLSCKFANDG 240

Query: 241 KKGKPGGGQDGIQTPGAGQGNLHGDGMPMAPPSAMPGSGGQYG--GGLGGSAGGLGGTGG 300
           KKGKPGGG DG QT GAGQGN+HGDGMPMAPPSAMPGSGGQYG  GG+G   G   G  G
Sbjct: 241 KKGKPGGGPDGNQTQGAGQGNVHGDGMPMAPPSAMPGSGGQYGGPGGMGSYGGFSSGLQG 300

Query: 301 PSSLYRLP-QSSVGMPSGGYQDSGHYSMSSASGH 332
              L   P  SS+G            S+ S+ G+
Sbjct: 301 AQPLAHHPLNSSMGPGLSSVGGQAPSSLGSSGGY 334

BLAST of Cp4.1LG01g01520 vs. NCBI nr
Match: gi|659101647|ref|XP_008451717.1| (PREDICTED: UBP1-associated protein 2C [Cucumis melo])

HSP 1 Score: 527.7 bits (1358), Expect = 1.6e-146
Identity = 282/334 (84.43%), Postives = 295/334 (88.32%), Query Frame = 1

Query: 1   MDATKKRRMEENGVDSSENSFLKITSDDARKIIDRFTPDQLIDILQDAVSRHTDVLDAVR 60
           MD TKKRRM+ENGVDSSENSF +IT +DARKIIDRFTPDQLIDILQDAVSRH DVL+AVR
Sbjct: 1   MDVTKKRRMDENGVDSSENSFSRITPEDARKIIDRFTPDQLIDILQDAVSRHLDVLEAVR 60

Query: 61  SIADRDVSQRKLFIRGLSCDTSTEGLRSLFSAYGELEEAVVIIDKATGKSKGYGFVTFKH 120
           SIADRDVSQRKLFIRGLSCDTSTEGLRSLFS+YGELEEAVVIIDKATGKSKGYGFVTFKH
Sbjct: 61  SIADRDVSQRKLFIRGLSCDTSTEGLRSLFSSYGELEEAVVIIDKATGKSKGYGFVTFKH 120

Query: 121 VDGAILALKEPSKTIDGRVTVTQLAAVGISGQNSNAADMSLRKIYVANVPMDMPADKLLA 180
           VDGA+LALKEPSKTIDGRVTVTQLAAVGISGQNSNAAD+SLRKIYVANVPMDMPADKLLA
Sbjct: 121 VDGALLALKEPSKTIDGRVTVTQLAAVGISGQNSNAADLSLRKIYVANVPMDMPADKLLA 180

Query: 181 HFSLYGEIEEGPLGFDKQTGKCRGYALFVYKKPEGAQAALVDPIKTIDGRQLSCKFANDG 240
           HFSLYGEIEEGPLGFDKQTGKCRGYALFVYKKPEGAQAALVDPIKTIDGRQLSCKFANDG
Sbjct: 181 HFSLYGEIEEGPLGFDKQTGKCRGYALFVYKKPEGAQAALVDPIKTIDGRQLSCKFANDG 240

Query: 241 KKGKPGGGQDGIQTPGAGQGNLHGDGMPMAPPSAMPGSGGQYG--GGLGGSAGGLGGTGG 300
           KKGKPGGG DG QT GAGQGN+HGDGMPMAPPSAMPGSG QYG  GG+G  AG   G  G
Sbjct: 241 KKGKPGGGPDGNQTQGAGQGNVHGDGMPMAPPSAMPGSGAQYGGPGGMGSYAGFSSGLQG 300

Query: 301 PSSLYRLP-QSSVGMPSGGYQDSGHYSMSSASGH 332
              L   P  SS+G            S+ S+ G+
Sbjct: 301 AQPLAHHPLNSSMGPGLSSVGGQAPSSLGSSGGY 334

BLAST of Cp4.1LG01g01520 vs. NCBI nr
Match: gi|1009161987|ref|XP_015899190.1| (PREDICTED: UBP1-associated protein 2C [Ziziphus jujuba])

HSP 1 Score: 451.1 bits (1159), Expect = 1.9e-123
Identity = 244/336 (72.62%), Postives = 273/336 (81.25%), Query Frame = 1

Query: 1   MDATKKRRMEENGVDSSENSFLKITSDDARKIIDRFTPDQLIDILQDAVSRHTDVLDAVR 60
           MD TKKR+MEENG   ++ S LK+T +DARKII+RFTPDQLI+ILQ+AV+ H DVL+AVR
Sbjct: 1   MDLTKKRKMEENGAGDADPSVLKLTPEDARKIIERFTPDQLIEILQEAVAHHADVLEAVR 60

Query: 61  SIADRDVSQRKLFIRGLSCDTSTEGLRSLFSAYGELEEAVVIIDKATGKSKGYGFVTFKH 120
           S+AD D SQRKLFIRGL  DT+TEGLR+LFSAYGELEEAVVI+DK TGKSKGYGFVTF+H
Sbjct: 61  SVADPDTSQRKLFIRGLGWDTTTEGLRALFSAYGELEEAVVILDKTTGKSKGYGFVTFRH 120

Query: 121 VDGAILALKEPSKTIDGRVTVTQLAAVGISGQNSN---AADMSLRKIYVANVPMDMPADK 180
           VDGA+LALKEPSK IDGR+TVTQLAA G SG N+N   A D+SLRKIYVANVP DMPADK
Sbjct: 121 VDGALLALKEPSKKIDGRMTVTQLAAAGNSGANTNSNAAVDVSLRKIYVANVPYDMPADK 180

Query: 181 LLAHFSLYGEIEEGPLGFDKQTGKCRGYALFVYKKPEGAQAALVDPIKTIDGRQLSCKFA 240
           LLAHFSLYGEIEEGPLGFDKQTGKCRGYALFVYK PEGAQAALVDP+KTI+GRQL+CK A
Sbjct: 181 LLAHFSLYGEIEEGPLGFDKQTGKCRGYALFVYKTPEGAQAALVDPVKTIEGRQLTCKLA 240

Query: 241 NDGKKGKPGGGQDGIQTPGAGQGNLHGDGMPMAPPSAMPGS-GGQYGG--GLGGSAGGLG 300
            DGKKGK G G DGIQ P  G GN HGDGM +APPS+MPGS GGQYGG  G+G   G  G
Sbjct: 241 IDGKKGKQGSGADGIQAPVGGPGNAHGDGMGLAPPSSMPGSIGGQYGGPAGIGSYGGFSG 300

Query: 301 GTGGPSSLYRLPQSSVGMPSGGYQDSGHYSMSSASG 331
           G  GP   +    SS+G P  G    G+ + SS  G
Sbjct: 301 GLQGPPMGHHPLNSSIGGP--GLSSVGNQAPSSLGG 334

BLAST of Cp4.1LG01g01520 vs. NCBI nr
Match: gi|1012348056|gb|KYP59247.1| (hypothetical protein KK1_014679 [Cajanus cajan])

HSP 1 Score: 436.4 bits (1121), Expect = 4.8e-119
Identity = 254/384 (66.15%), Postives = 295/384 (76.82%), Query Frame = 1

Query: 1   MDATKKRRMEENGVDSSENSFLKITSDDARKIIDRFTPDQLIDILQDAVSRHTDVLDAVR 60
           MD TKKR+++ENG + S +  LK++  +ARK+IDRF+PDQL+DILQDAVSRH DVL AVR
Sbjct: 1   MDLTKKRKIDENGFNDSSDP-LKLSPSEARKLIDRFSPDQLLDILQDAVSRHPDVLAAVR 60

Query: 61  SIADRDVSQRKLFIRGLSCDTSTEGLRSLFSAYGELEEAVVIIDKATGKSKGYGFVTFKH 120
           + AD DVSQRKLFIRGL  DT+T+GLRSLFS +G+LEEAVVI+DKATGKSKGYGFVTF+H
Sbjct: 61  AAADPDVSQRKLFIRGLGWDTTTDGLRSLFSTFGDLEEAVVILDKATGKSKGYGFVTFRH 120

Query: 121 VDGAILALKEPSKTIDGRVTVTQLAAVGISGQNSNAADMSLRKIYVANVPMDMPADKLLA 180
           VDGA+LAL+EPSK IDGRVTVTQLAA G S  N+NAAD++LRKIYVANVP D+PADKLLA
Sbjct: 121 VDGALLALREPSKRIDGRVTVTQLAAAGNSASNANAADVALRKIYVANVPPDLPADKLLA 180

Query: 181 HFSLYGEIEEGPLGFDKQTGKCRGYALFVYKKPEGAQAALVDPIKTIDGRQLSCKFA-ND 240
           HFS+YGEIEEGPLGFDKQTGK +G+ALFVYK PEGAQAAL++P+KT++GRQLSCK A  D
Sbjct: 181 HFSVYGEIEEGPLGFDKQTGKSKGFALFVYKTPEGAQAALLEPVKTVEGRQLSCKLAITD 240

Query: 241 GKKGKPGGGQDGIQTPGA---GQGNLHGDGMPMA---PPSAMPGSG--------GQYGG- 300
           GK+GK  G  DG Q  G    G G+  G GM M    PP+A  G G        G YGG 
Sbjct: 241 GKQGKRAG-PDGPQAHGNVQHGHGDGVGAGMGMGMGMPPNAGSGPGQYGPPVGVGSYGGF 300

Query: 301 --GLGGSAGGL--GGTG-GPSSLYRLPQSSVGMPSGG--YQDSG-HYSMSSASGHPNPLH 360
             G GG AGG   GG G G  SLYR P S  GMP GG  Y DSG HYS+S++ G+ N  H
Sbjct: 301 GVGAGGGAGGGIGGGAGAGGGSLYRFPGSG-GMPGGGGGYPDSGGHYSLSASGGYQNQHH 360

BLAST of Cp4.1LG01g01520 vs. NCBI nr
Match: gi|694413896|ref|XP_009335191.1| (PREDICTED: UBP1-associated protein 2C-like [Pyrus x bretschneideri])

HSP 1 Score: 425.2 bits (1092), Expect = 1.1e-115
Identity = 252/442 (57.01%), Postives = 290/442 (65.61%), Query Frame = 1

Query: 1   MDATKKRRMEENGV--DSSENSFLKITSDDARKIIDRFTPDQLIDILQDAVSRHTDVLDA 60
           MDATKKR+++ENGV  D+  +S  K++ +DAR++I+RFTPDQL+DILQDA+SRH DVLDA
Sbjct: 1   MDATKKRKLDENGVVLDTDPSSAPKLSPEDARRLIERFTPDQLLDILQDALSRHVDVLDA 60

Query: 61  VRSIADRDVSQRKLFIRGLSCDTSTEGLRSLFSAYGELEEAVVIIDKATGKSKGYGFVTF 120
           VRSIAD D SQRKLFIRGL  DT+TEGLRSLFSAYGE+E+A+VI+DK TGKSKGYGFVTF
Sbjct: 61  VRSIADPDASQRKLFIRGLGWDTTTEGLRSLFSAYGEIEDAIVILDKTTGKSKGYGFVTF 120

Query: 121 KHVDGAILALKEPSKTIDGRVTVTQLAAVGISGQNS---NAADMSLRKIYVANVPMDMPA 180
           +HVDGA++ALKEPSK IDGR+TVTQLA+ G S  N+   N AD+SLRKIYVANVP DMP+
Sbjct: 121 RHVDGALMALKEPSKKIDGRMTVTQLASAGNSNSNTASNNVADVSLRKIYVANVPYDMPS 180

Query: 181 DKLLAHFSLYGEIEEGPLGFDKQTGKCRGYALFVYKKPEGAQAALVDPIKTIDGRQLSCK 240
           DKLLAHF+LYGEIEEGPLGFDKQTGKC+GYALFVYK PEGAQAALVDP+K I+GRQL+CK
Sbjct: 181 DKLLAHFALYGEIEEGPLGFDKQTGKCKGYALFVYKTPEGAQAALVDPVKNIEGRQLTCK 240

Query: 241 FANDGKKGKPGGGQDGIQTPGAGQGNLHGDGMPMAPPSAMPGSG------GQYGGGLGGS 300
            A DGKKGK  G   G Q PG+G  N HGDGM MAPPS++PG        G YGG   G 
Sbjct: 241 LAIDGKKGKSDGPGQG-QGPGSGS-NTHGDGMGMAPPSSIPGQYGIPGGIGSYGGYTSGL 300

Query: 301 AG-----------------------GLGGTGGPSSLYRLPQSSVGMPSGGYQDSGHYSMS 360
            G                       GLGG GG   L        G P G Y   G Y + 
Sbjct: 301 QGQPPLGHHPLGGPGLSGIGNQVNSGLGGGGGYGGL--------GGPYGNYGGPGGYGLG 360

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
UBA2C_ARATH1.1e-11257.35UBP1-associated protein 2C OS=Arabidopsis thaliana GN=UBA2C PE=2 SV=1[more]
UBA2B_ARATH1.4e-4946.12UBP1-associated protein 2B OS=Arabidopsis thaliana GN=UBA2B PE=2 SV=1[more]
UBA2A_ARATH2.7e-4538.98UBP1-associated protein 2A OS=Arabidopsis thaliana GN=UBA2A PE=1 SV=1[more]
UBA1A_ARATH2.9e-2344.70UBP1-associated proteins 1A OS=Arabidopsis thaliana GN=UBA1A PE=1 SV=2[more]
ROA0_HUMAN7.1e-2233.33Heterogeneous nuclear ribonucleoprotein A0 OS=Homo sapiens GN=HNRNPA0 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KYY6_CUCSA8.4e-14784.43Uncharacterized protein OS=Cucumis sativus GN=Csa_4G083660 PE=4 SV=1[more]
A0A151SWS3_CAJCA3.3e-11966.15Uncharacterized protein OS=Cajanus cajan GN=KK1_014679 PE=4 SV=1[more]
B9GGV7_POPTR6.5e-11563.76Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0001s38640g PE=4 SV=2[more]
M5WGI4_PRUPE1.8e-11270.15Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa005857mg PE=4 SV=1[more]
D7L3Y0_ARALL3.0e-11257.88Putative uncharacterized protein OS=Arabidopsis lyrata subsp. lyrata GN=ARALYDRA... [more]
Match NameE-valueIdentityDescription
AT3G15010.16.4e-11457.35 RNA-binding (RRM/RBD/RNP motifs) family protein[more]
AT2G41060.17.7e-5146.12 RNA-binding (RRM/RBD/RNP motifs) family protein[more]
AT3G56860.31.5e-4638.98 UBP1-associated protein 2A[more]
AT2G22090.21.6e-2444.70 RNA-binding (RRM/RBD/RNP motifs) family protein[more]
AT2G19380.14.0e-2334.59 RNA recognition motif (RRM)-containing protein[more]
Match NameE-valueIdentityDescription
gi|449438887|ref|XP_004137219.1|1.2e-14684.43PREDICTED: UBP1-associated protein 2C [Cucumis sativus][more]
gi|659101647|ref|XP_008451717.1|1.6e-14684.43PREDICTED: UBP1-associated protein 2C [Cucumis melo][more]
gi|1009161987|ref|XP_015899190.1|1.9e-12372.62PREDICTED: UBP1-associated protein 2C [Ziziphus jujuba][more]
gi|1012348056|gb|KYP59247.1|4.8e-11966.15hypothetical protein KK1_014679 [Cajanus cajan][more]
gi|694413896|ref|XP_009335191.1|1.1e-11557.01PREDICTED: UBP1-associated protein 2C-like [Pyrus x bretschneideri][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0000166nucleotide binding
GO:0003676nucleic acid binding
Vocabulary: INTERPRO
TermDefinition
IPR012677Nucleotide-bd_a/b_plait_sf
IPR000504RRM_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0008219 cell death
biological_process GO:0006952 defense response
biological_process GO:0009693 ethylene biosynthetic process
biological_process GO:0010150 leaf senescence
cellular_component GO:0044424 intracellular part
cellular_component GO:0030529 intracellular ribonucleoprotein complex
cellular_component GO:0005730 nucleolus
cellular_component GO:0019013 viral nucleocapsid
cellular_component GO:0005575 cellular_component
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0000166 nucleotide binding
molecular_function GO:0003729 mRNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG01g01520.1Cp4.1LG01g01520.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000504RNA recognition motif domainPFAMPF00076RRM_1coord: 164..232
score: 8.3E-9coord: 72..139
score: 1.7
IPR000504RNA recognition motif domainSMARTSM00360rrm1_1coord: 71..143
score: 2.8E-18coord: 163..235
score: 3.3
IPR000504RNA recognition motif domainPROFILEPS50102RRMcoord: 70..138
score: 15.153coord: 162..239
score: 13
IPR012677Nucleotide-binding alpha-beta plait domainGENE3DG3DSA:3.30.70.330coord: 150..248
score: 2.0E-16coord: 65..143
score: 2.8
IPR012677Nucleotide-binding alpha-beta plait domainunknownSSF54928RNA-binding domain, RBDcoord: 63..145
score: 2.15E-21coord: 149..265
score: 9.67
NoneNo IPR availablePANTHERPTHR24012FAMILY NOT NAMEDcoord: 11..28
score: 7.5E-127coord: 67..244
score: 7.5E-127coord: 280..359
score: 7.5E
NoneNo IPR availablePANTHERPTHR24012:SF490UBP1-ASSOCIATED PROTEIN 2Ccoord: 11..28
score: 7.5E-127coord: 67..244
score: 7.5E-127coord: 280..359
score: 7.5E