Cp4.1LG07g07550 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG07g07550
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionPoly(RC)-binding-like protein
LocationCp4.1LG07 : 6607090 .. 6613319 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TCCTTTCTCTCTTCCTTCGTCCTCTTAAAACCTTCTGAAAATCTCAACCCTAACTTCTTCCTCCTTCTTCATCTTCCTCCATGGCCACCAACGCAACTACAGAGAATGGCTCCACTGATGCCGACGCCCTCAATTCCAGCCCCCAACTTGCCTGTGAAGCTGCTGCAGAACCGGACTCTACGGAAGCCGGTGGCGCCGAATCCGACATTGATCCCTCTAACCAAGACTATTTATCGGCTCCCGCTTCGGATTATGTACCTCACGAAAGTCCCAATCATGCTGGCGCTTCCGATAAAAAGTGGCCTGGATGGCCCGGAGACTGCGTCTTCCGGATTATTGTACCGGTCGTTAAAGTCGGCAGCATTATTGGGCGCAGGGGCGATCTTATCAAGAAGATCTGTGAGGAAACTAGGGCTAGAATTCGTGTTCTTGATGGTGCTGTTGGCACTCCTGATCGCGTCGTGAGTGTTTTTTTTTAATTAATTAATTTATTTAGCAATGGAAATGGCTTTGTTACTCATGGTTACGTAGGGATGACGAGTTGAATCTCTATCTTTATATGGTTTAAACTGGATTAAATGATTCATAGTCTAAAAATATCTCCATTTTCGTGCCCTCTGGTTTCCAGAAAACCAAGGCCGTTCTTCCATCTTTGGCTGGTTCCATCAATTTTATACTAGTTAGTCATAAAAACTATTTCCCACTTCTTTCACAGCTGCCGTCCGATAATAGGTCAAGTAGTTGTATAAATGGAGCTTCTTGTGGTCGAAAGAAACATATATTATAGAATTGAGGTCTGTTCCATGGTGGCGCAATGCTCTTATCACTTAGAAGTAGAGTTAGACTTGGGAGGCAAAAGGGTTTTTCTTTTTTTATGAACATTTTGATTTTTTTTTCCTATAATTATTTGTTGGGCTTAGTTACTATGGAATTTTTTTCCCCACGTAGTTTAGGTTTTATTTGAGCTCTTCTTGTGGGTTCTTTTTCTTCGCATGCCCTTGTATTTTCTTTAAAATTTTCAAGGAAACCCCGGTTTTCAATTGAAAAAAAATAAAGACTTGAGGCAAAATATTACAGCGATCTGTCTATTAACTTCCATAATCTTGTTGTTTTTGGATAGGTTGCTTTTACATTATTTTGTTTACTGGTTTAGTTAAGATTGTGCACCTTATACTTAGAGGAGTGCCCGAGTGATTGTGACTAGGTTCAATTAGAAACTGTAATATATATAGATCAGAGGTATAACTTTTCCTTTTTGAAGGCTGTTGCAATAACAATATTGGATGCTTCACTTTGATTTGAAAGAAAAATTTGGATGCATTCTTGTACAATTTCTTTTGAGTAGGAGATCTTGTTTTATCTTGTTCGTTCAGCTTTTAATGTTTAGTTTTTACTTCAGGTGCTAATTTCAGGAAAGGAAGAGCCTGAGGCACCCCTTTCCCCTGCAATGGATGCAGTTTTAAGAATCTTTAAACGTGTGTCTGGATTTTCTGAGAATGAAGATGAGGCTAAAGCTTCATTCTGTTCAGTCCGGTTACTGGTGGCATCAACTCAAGCAATCAATCTAATTGGAAAGCAGGGTTCATTAATCAAATCTATACAGGAGAGTACAGGGGCTTCTGTGAGAGTTTTATCAGGAGGTATGCTTTTGTTCTCTGTATTTCACTTCACGTGTCAGGCTCTTTTACTGACCGGTTTTGGACTTGGCTACATGAAATCTACCTGATAGCTAAATTTTAGAGACATCAGAAAGGCTATTGTTGAGGGAAAGACTATTAAGCACACACTTAAAAACCTCACAGCAAGGGAAGGATCATTTAAAAGAATAATAATGTCCCAAAGAAACCAGGAGATCAATAGAAAACACCCATCATTGTGAGAAATGCAAATGGGTCTAGTAAGAATTGATCCACCTTACTAAGCTTCGGTTTGGCAGTGTTTATGATAAGGTACATAGTTGTTATTCTTTAACCCTTTTGATCTCACTCCAATTTTATATTGATCAACTATTCGACCAGAGGGGTGATTCCGTTAGGAAAATTCGATCAAACGTTTGTAAGAAGTGGAAATTTGGGATGCCTCATTGCAAGATTGCAACCTCGCAAGTTGTTCTATATTAGAGTTTAGAATTGGATAGTTCGTAGAGACAGAATACTGAAAGACCAGTGTTAGGCCACCCTTCCATATAAACAGTTAGACTCCAATTGTGGACAAGTTAGGAGTTCAAGTTTAAAGTACTCATCTCATTGACTATGAGTCCTTGAAGAAAATACTAATATCCAAGAGGATCATTAGTGTCGATGGGGATTACAAGTTTACCTTTTAAGGTTCCTTTTGGTTAAAAGAAACCCCTTCTCATGGTTGTCTGATTAAGCTTCCTTACTGGTAAGAGCTTACCATGTTTCTCCCCAAAGGAGAGGGGCCCTTGAGGTCAAAACTCCCTTTATTCATTAGCCCATTGAAAAGCCTTCCAGTAAAGGCATCTATCGTGTTATTTTGACATGTTTGATTTGAAGGATTAGAGTTGGTAATTAAAGCGAAGAAGAATTTCCATGATCTATCGTAGGTGTCTTTTGTTCAAATCAAACAACAAAACAAAATCAATTGGGTAGAAACTACCGGGTGAAAGGGAGTATTCGTACAAGTCTATTGTCAATTAGCGAAAGACCAATGGCAGTTGAATCTATTGAAAACAAAGTGTGGCCCTGATACCAACTGTCAAGCTTATTGGTCGTTAGGGGAATGGATTCACAATTTGTTCTCAATAAAATTGTGAACCCATTCCCCTAACGGCCAATAAACTTGAGAAAAATTTCAGTCCCATTCATCCCTAGGAATGTTTCTCAGTAGTCAGTTCTATTCCGAAATGCTAGTGATATTCCATGAAGACTACAGTGTAGTACTTAGGATGCCATGTACCGTTTCTATGATGGGCAACCATAAGTCATTCCCCAGTTAAGAGAATCATAATGCTCGAATCTATTTTAACGTTGTTAACCAAGTTTTCGTCATTGGAAATAGCAGACAGGTAGTTTATGTGGGGTTAATTGTTGGAGCAGGGAAGTGATGGAAAGTGGAAATAGAACTACATTGCTCAGCTTAGTATCATGGTTGCTCACATGCCTTTCCCTAGCTGCAAGCTACTTGTCTCATTACTCATTGCGCTTGGTAACGGTCATCCCCATAACATTGATCTTTGGAATGATTTAACTTGCACATACACTTCGTCACTTACAACCATTGTCTTCAACAATACGTAGATGTTGATGTTCAAATATTTGACCATTTGTGTCGATGTTTCCTTAACTAGTTGAGCTATGCTTAGATCGATGGTAGACGGTAGTCTTGTCACCACATTTGAACTTGTTGGTATAGGTGGCATGTCAACCTATTGTCAGCTAACCTTAGAGAAGCTTACTTAATCTCTGATTCACTAACATCTTTCTGTTGCTGATACATGGTAAAGATGACCTAAAACTAAAAGTATATGACAATAGCTAGATAAATTAAGTGTTATTTCTTTCCTCAATTATGCTGTTATAAATCAAAGTAACCAAGTATGATGACTATTAACTTCAAGCCTTGAGATTCAGTCGACAGCTGCAAGCAAATAATAATGTTCTGAGATAATAATACATTTCCTACTATAGTCATTCATCTATGAGTACTCTCTGAATGTTCCCTTACTCGTCTACATTCGATGCTTGCAGATGAGATGCCATTCTATGCTGGTCCGGATGAGAGAATGGTGGAATTGCAGGGAGAAACCTTAAAGGTTCTCAAGGCTCTAGAGGCAGTGGTGGGCCACCTAAGGAAGTTTTTAGTTGATCATAGTGTTCTTCCTCTATTTGAAAAGAGTGTAAGTTTTTTCCTCTTAGCTTATGTTCAACCTCCTCTTTTACCATATTTTAGGACTGCCAATGAATTGTTGCAACTTTTTTTTAATAGTTCAATACACCTGCCTCACAAGATCGCCAAACGGATGCCTGGGCTGACAAGTCCTCACTTCTTTCTGCATCTCAAAGTGTAATCTCCAACGAGTATCCTCCATCTTCAAAAAGAGAATCTCTGTTTTTTGACCGAGAAACTCATCTTGATTCTCATATTTCATCCTCCGGTATTTCTCTCTATGGACCAGATCGAGTGCTTCCTGCAATTCGATCATCAGGCGTTGGTCGCTCCGGTGTGCCCATTGTTACTCAGGTTGGTATATATGTAGTCATTGGTTGGAAATGGTCTATGTTGACTCAGGCCTGTGGGAGGCTGCAAACTATCCCATCATTGCATGTCCATTATTCTTTTTCCAGTCTTGAGGTCCAGGCAAGCTGGAAGCTTGTCAGCTTTACATGTCCATCGTGTCTTGGAAACACTATGTTCAAAATGTTTGTCATATTGCATACAAGTATTATTTAGCATAATGCTGAATGGAGGCCGTGGGGTCACACAGTAGTTACCAAGTTCGTCTTTATCATCTTTGACCAAGTCACCTCTTTGTGGTTCCTGTCAGGTCACGCAGACAATGCAAATCCCGTTATCTTATGCTGAGGACATCATTGGAGTTGGAGGAGCAAATATCGCATTCATCCGTCGCAATAGTGGTGCAATCTTAACCATACAAGAGAGCAGGGGATTACCAGATGAAATCACTGTGGAAATAAAAGGCACTTCCTCACAAGTTCAGATGGCTCAACAATTAATTCAGGTAATATTTAATTTCCCCGTTTGACTATTAACTTTGTGGTTATTGCTTGTTATGGTCCATAGCGTTGCGCCCATGTCATGCAAAAAGTTACTTATATTTCATTGTGCAACTGTAACGGCCCAGATCCACTGCTAGCGGATATTGTCCTCTTTGGGCTTTCCCTTTCGGGCTTTCCCTCAAGGCTTTAAAACGCGTCTGCTAGGGAAAGGTTTCCACTCCCTTATAAGGGTGTTTTGTTCTCCTCCCCAACCAATGTGGGACATCACAATCCACCCCCCTTCGGCACCCAGCGTCCTCCCTGGCACTCGTTCCTTTCTCCAATTCGATGTGGGACCGCCACCAAATCTGGTTTCCACTCCCTTATAAGGGTGTTTTGTTCTCCTCCCCAACCAATGTGGGACATCACAATCCACCCCTCTTCAGGGCCCAGCATCCTCGCTGGCACTCGTTCCTTTCTCCAATTCGATGTGGGACCGCCACCAAATCCACCCCCCTTCGGGGCCCAACGTCCTTACTAGCACACCGCCTCGTGTCTACCCCCCTTCGGGGAACAGCGAGAAAGCTGGCACATCGTCCGGTGTCTGACTCTGATACCATTTGTAACAGCTCAGATCCACCGCTAGCAGATATTGTCCTCTTTGGGCTTTCCCTTTTGGGCTTCCCCTTAAGGCTTTAAAACGCGTCTGCTAGGGAAAGGTTTCCACTTCCTTATAAGGGTGTTTTGTTCTCCTCCCCAACCAATGTGGGACATCACAGCAACTCTTTCCTTTCCTTGGCCAAAGTTTTAGCCTCAATGCTTCATTACACTAGATAGTTTCGGTTATTATTATGGATTACTAGGATTCTCGAGGAAAATAGCAAGCAAATGGTAACTGCTGGACATTGATATTTATGATTTTGTTTTCCTTGTATTGTACTTATAGGAAGCAATAAGTGGCCCTAAGGAACCAGTAACAAGCAGCAGCTATGGCAGGCTAGACACCACAGGCTTGAGGTCATCATACTCCCAGTTGGGTGCTTCAGGTTCATCTTACACTCCATCCTCTTTATCATCACAATCATATGGTGGTTATGGGTCATCTGGGTTAGGAGGGTACAGTACCTTCAGGCTCTAAATTGACTGATCTGCAGATTCATATCCACTCGGATTTGATACGCATCGTATTAATTCTTGTCTGTTAAGAGGATGAATGAATCATGTTGGAAGGTAGTTCTTGTAAAGTCAGCCTAAAGTAGTTAAAAAAACATTTTACCCTCTTGCATATCAAAACTGATTTAGCTGAGTAGATTCGTCTTACGTTTTCTTTTCCCTTCTTTTTCTGTGTCGATGTATTCGTTTGTGAGTAGGTTCTTATTAGTCACTTCAGGTCATCCATCTAAAGTTTAAAGTTTAAAAGTATTTAGTTTTTAGTTCAACACGTTCTAACAATGTGGAGCATTCGAACTTTCGAGTTCTTGGTTGAAATCGTACATACTTTAAACTTTATACTTTATCTAAGATAATAAATTCAGTTTCATTATTATACAT

mRNA sequence

TCCTTTCTCTCTTCCTTCGTCCTCTTAAAACCTTCTGAAAATCTCAACCCTAACTTCTTCCTCCTTCTTCATCTTCCTCCATGGCCACCAACGCAACTACAGAGAATGGCTCCACTGATGCCGACGCCCTCAATTCCAGCCCCCAACTTGCCTGTGAAGCTGCTGCAGAACCGGACTCTACGGAAGCCGGTGGCGCCGAATCCGACATTGATCCCTCTAACCAAGACTATTTATCGGCTCCCGCTTCGGATTATGTACCTCACGAAAGTCCCAATCATGCTGGCGCTTCCGATAAAAAGTGGCCTGGATGGCCCGGAGACTGCGTCTTCCGGATTATTGTACCGGTCGTTAAAGTCGGCAGCATTATTGGGCGCAGGGGCGATCTTATCAAGAAGATCTGTGAGGAAACTAGGGCTAGAATTCGTGTTCTTGATGGTGCTGTTGGCACTCCTGATCGCGTCGTGCTAATTTCAGGAAAGGAAGAGCCTGAGGCACCCCTTTCCCCTGCAATGGATGCAGTTTTAAGAATCTTTAAACGTGTGTCTGGATTTTCTGAGAATGAAGATGAGGCTAAAGCTTCATTCTGTTCAGTCCGGTTACTGGTGGCATCAACTCAAGCAATCAATCTAATTGGAAAGCAGGGTTCATTAATCAAATCTATACAGGAGAGTACAGGGGCTTCTGTGAGAGTTTTATCAGGAGATGAGATGCCATTCTATGCTGGTCCGGATGAGAGAATGGTGGAATTGCAGGGAGAAACCTTAAAGGTTCTCAAGGCTCTAGAGGCAGTGGTGGGCCACCTAAGGAAGTTTTTAGTTGATCATAGTGTTCTTCCTCTATTTGAAAAGAGTTTCAATACACCTGCCTCACAAGATCGCCAAACGGATGCCTGGGCTGACAAGTCCTCACTTCTTTCTGCATCTCAAAGTGTAATCTCCAACGAGTATCCTCCATCTTCAAAAAGAGAATCTCTGTTTTTTGACCGAGAAACTCATCTTGATTCTCATATTTCATCCTCCGGTATTTCTCTCTATGGACCAGATCGAGTGCTTCCTGCAATTCGATCATCAGGCGTTGGTCGCTCCGGTGTGCCCATTGTTACTCAGGTCACGCAGACAATGCAAATCCCGTTATCTTATGCTGAGGACATCATTGGAGTTGGAGGAGCAAATATCGCATTCATCCGTCGCAATAGTGGTGCAATCTTAACCATACAAGAGAGCAGGGGATTACCAGATGAAATCACTGTGGAAATAAAAGGCACTTCCTCACAAGTTCAGATGGCTCAACAATTAATTCAGGAAGCAATAAGTGGCCCTAAGGAACCAGTAACAAGCAGCAGCTATGGCAGGCTAGACACCACAGGCTTGAGGTCATCATACTCCCAGTTGGGTGCTTCAGGTTCATCTTACACTCCATCCTCTTTATCATCACAATCATATGGTGGTTATGGGTCATCTGGGTTAGGAGGGTACAGTACCTTCAGGCTCTAAATTGACTGATCTGCAGATTCATATCCACTCGGATTTGATACGCATCGTATTAATTCTTGTCTGTTAAGAGGATGAATGAATCATGTTGGAAGGTAGTTCTTGTAAAGTCAGCCTAAAGTAGTTAAAAAAACATTTTACCCTCTTGCATATCAAAACTGATTTAGCTGAGTAGATTCGTCTTACGTTTTCTTTTCCCTTCTTTTTCTGTGTCGATGTATTCGTTTGTGAGTAGGTTCTTATTAGTCACTTCAGGTCATCCATCTAAAGTTTAAAGTTTAAAAGTATTTAGTTTTTAGTTCAACACGTTCTAACAATGTGGAGCATTCGAACTTTCGAGTTCTTGGTTGAAATCGTACATACTTTAAACTTTATACTTTATCTAAGATAATAAATTCAGTTTCATTATTATACAT

Coding sequence (CDS)

ATGGCCACCAACGCAACTACAGAGAATGGCTCCACTGATGCCGACGCCCTCAATTCCAGCCCCCAACTTGCCTGTGAAGCTGCTGCAGAACCGGACTCTACGGAAGCCGGTGGCGCCGAATCCGACATTGATCCCTCTAACCAAGACTATTTATCGGCTCCCGCTTCGGATTATGTACCTCACGAAAGTCCCAATCATGCTGGCGCTTCCGATAAAAAGTGGCCTGGATGGCCCGGAGACTGCGTCTTCCGGATTATTGTACCGGTCGTTAAAGTCGGCAGCATTATTGGGCGCAGGGGCGATCTTATCAAGAAGATCTGTGAGGAAACTAGGGCTAGAATTCGTGTTCTTGATGGTGCTGTTGGCACTCCTGATCGCGTCGTGCTAATTTCAGGAAAGGAAGAGCCTGAGGCACCCCTTTCCCCTGCAATGGATGCAGTTTTAAGAATCTTTAAACGTGTGTCTGGATTTTCTGAGAATGAAGATGAGGCTAAAGCTTCATTCTGTTCAGTCCGGTTACTGGTGGCATCAACTCAAGCAATCAATCTAATTGGAAAGCAGGGTTCATTAATCAAATCTATACAGGAGAGTACAGGGGCTTCTGTGAGAGTTTTATCAGGAGATGAGATGCCATTCTATGCTGGTCCGGATGAGAGAATGGTGGAATTGCAGGGAGAAACCTTAAAGGTTCTCAAGGCTCTAGAGGCAGTGGTGGGCCACCTAAGGAAGTTTTTAGTTGATCATAGTGTTCTTCCTCTATTTGAAAAGAGTTTCAATACACCTGCCTCACAAGATCGCCAAACGGATGCCTGGGCTGACAAGTCCTCACTTCTTTCTGCATCTCAAAGTGTAATCTCCAACGAGTATCCTCCATCTTCAAAAAGAGAATCTCTGTTTTTTGACCGAGAAACTCATCTTGATTCTCATATTTCATCCTCCGGTATTTCTCTCTATGGACCAGATCGAGTGCTTCCTGCAATTCGATCATCAGGCGTTGGTCGCTCCGGTGTGCCCATTGTTACTCAGGTCACGCAGACAATGCAAATCCCGTTATCTTATGCTGAGGACATCATTGGAGTTGGAGGAGCAAATATCGCATTCATCCGTCGCAATAGTGGTGCAATCTTAACCATACAAGAGAGCAGGGGATTACCAGATGAAATCACTGTGGAAATAAAAGGCACTTCCTCACAAGTTCAGATGGCTCAACAATTAATTCAGGAAGCAATAAGTGGCCCTAAGGAACCAGTAACAAGCAGCAGCTATGGCAGGCTAGACACCACAGGCTTGAGGTCATCATACTCCCAGTTGGGTGCTTCAGGTTCATCTTACACTCCATCCTCTTTATCATCACAATCATATGGTGGTTATGGGTCATCTGGGTTAGGAGGGTACAGTACCTTCAGGCTCTAA

Protein sequence

MATNATTENGSTDADALNSSPQLACEAAAEPDSTEAGGAESDIDPSNQDYLSAPASDYVPHESPNHAGASDKKWPGWPGDCVFRIIVPVVKVGSIIGRRGDLIKKICEETRARIRVLDGAVGTPDRVVLISGKEEPEAPLSPAMDAVLRIFKRVSGFSENEDEAKASFCSVRLLVASTQAINLIGKQGSLIKSIQESTGASVRVLSGDEMPFYAGPDERMVELQGETLKVLKALEAVVGHLRKFLVDHSVLPLFEKSFNTPASQDRQTDAWADKSSLLSASQSVISNEYPPSSKRESLFFDRETHLDSHISSSGISLYGPDRVLPAIRSSGVGRSGVPIVTQVTQTMQIPLSYAEDIIGVGGANIAFIRRNSGAILTIQESRGLPDEITVEIKGTSSQVQMAQQLIQEAISGPKEPVTSSSYGRLDTTGLRSSYSQLGASGSSYTPSSLSSQSYGGYGSSGLGGYSTFRL
BLAST of Cp4.1LG07g07550 vs. Swiss-Prot
Match: PEP_ARATH (RNA-binding KH domain-containing protein PEPPER OS=Arabidopsis thaliana GN=PEP PE=1 SV=1)

HSP 1 Score: 448.7 bits (1153), Expect = 7.6e-125
Identity = 254/442 (57.47%), Postives = 324/442 (73.30%), Query Frame = 1

Query: 64  PNHAGASDKKWPGWPGDCVFRIIVPVVKVGSIIGRRGDLIKKICEETRARIRVLDGAVGT 123
           P+   +++++WPGWPGDCVFR+IVPV KVG+IIGR+GD IKK+CEETRARI+VLDG V T
Sbjct: 57  PDTNDSAEERWPGWPGDCVFRMIVPVTKVGAIIGRKGDFIKKMCEETRARIKVLDGPVNT 116

Query: 124 PDRVVLISGKEEPEAPLSPAMDAVLRIFKRVSGFSENEDE----AKASFCSVRLLVASTQ 183
           PDR+VLISGKEEPEA +SPAMDAVLR+F+RVSG  +N+D+    A + F SVRLLVASTQ
Sbjct: 117 PDRIVLISGKEEPEAYMSPAMDAVLRVFRRVSGLPDNDDDDVQNAGSVFSSVRLLVASTQ 176

Query: 184 AINLIGKQGSLIKSIQESTGASVRVLSGDEMPFYAGPDERMVELQGETLKVLKALEAVVG 243
           AINLIGKQGSLIKSI E++GASVR+LS +E PFYA  DER+V+LQGE LK+LKALEA+VG
Sbjct: 177 AINLIGKQGSLIKSIVENSGASVRILSEEETPFYAAQDERIVDLQGEALKILKALEAIVG 236

Query: 244 HLRKFLVDHSVLPLFEKSFNTPASQDRQTDAWAD-KSSLLSASQSVISNEYPPSSKRESL 303
           HLR+FLVDH+V+PLFEK +    SQ RQ +  A+ KSSL + S +++  ++   ++RE L
Sbjct: 237 HLRRFLVDHTVVPLFEKQYLARVSQTRQEEPLAESKSSLHTISSNLMEPDFSLLARREPL 296

Query: 304 FFDRETHLDSHISSSGISLYGPDRVLPAIRSSGVGRSGVPIVTQVTQTMQIPLSYAEDII 363
           F +R++ +DS +  SG+S+Y  D VL A  S G+ R     VTQV+QTMQIP SYAEDII
Sbjct: 297 FLERDSRVDSRVQPSGVSIYSQDPVLSARHSPGLARVSSAFVTQVSQTMQIPFSYAEDII 356

Query: 364 GVGGANIAFIRRNSGAILTIQESRGLPDEITVEIKGTSSQVQMAQQLIQEAISGPKEPVT 423
           GV GANIA+IRR SGA +TI+ES   PD+ITVEIKGT+SQVQ A+QLIQE I   KEPV+
Sbjct: 357 GVEGANIAYIRRRSGATITIKESPH-PDQITVEIKGTTSQVQTAEQLIQEFIINHKEPVS 416

Query: 424 -SSSYGRLDTTGL---------------------------RSSYSQLGASGSSYTPSSLS 471
            S  Y R+D+  +                            ++YSQLG   S+YTP +L+
Sbjct: 417 VSGGYARIDSGYVPAYPPQLSNRQEPLPSTYMGTEPVQYRPTAYSQLGGP-STYTP-TLT 476

BLAST of Cp4.1LG07g07550 vs. Swiss-Prot
Match: FLK_ARATH (Flowering locus K homology domain OS=Arabidopsis thaliana GN=FLK PE=1 SV=1)

HSP 1 Score: 276.6 bits (706), Expect = 5.2e-73
Identity = 174/419 (41.53%), Postives = 239/419 (57.04%), Query Frame = 1

Query: 68  GASDKKWPGWPGDCVFRIIVPVVKVGSIIGRRGDLIKKICEETRARIRVLDGAVGTPDRV 127
           G  +K+WPGWPG+ VFR++VP  KVGSIIGR+GD+IKKI EETRARI++LDG  GT +R 
Sbjct: 174 GGEEKRWPGWPGETVFRMLVPAQKVGSIIGRKGDVIKKIVEETRARIKILDGPPGTTERA 233

Query: 128 VLISGKEEPEAPLSPAMDAVLRIFKR-VSGFSENEDEA-KASFCSVRLLVASTQAINLIG 187
           V++SGKEEPE+ L P+MD +LR+  R V G      +A   S  S RLLV ++QA +LIG
Sbjct: 234 VMVSGKEEPESSLPPSMDGLLRVHMRIVDGLDGEASQAPPPSKVSTRLLVPASQAGSLIG 293

Query: 188 KQGSLIKSIQESTGASVRVLSGDEMPFYAGPDERMVELQGETLKVLKALEAVVGHLRKFL 247
           KQG  +K+IQE++   VRVL  +++P +A  D+R+VE+ GE   V +ALE +  HLRKFL
Sbjct: 294 KQGGTVKAIQEASACIVRVLGSEDLPVFALQDDRVVEVVGEPTSVHRALELIASHLRKFL 353

Query: 248 VDHSVLPLFEKSFNTPASQ-DRQTDAWADKSSLLSASQSV------------------IS 307
           VD S++P FE     P  Q D               + SV                    
Sbjct: 354 VDRSIIPFFENQMQKPTRQMDHMPPPHQSWGPPQGHAPSVGGGGYGHNPPPYMQPPPRHD 413

Query: 308 NEYPPSSKRESLFFDRETHLDSHISSSGISLYGPDRVLPAIRSSGVGRSGVPIVTQVTQT 367
           + YPP   R+    +++ H        GIS YG +  +    +  V  +   +  QVTQ 
Sbjct: 414 SYYPPPEMRQPP-MEKQPH-------QGISAYGREPPM----NVHVSSAPPMVAQQVTQQ 473

Query: 368 MQIPLSYAEDIIGVGGANIAFIRRNSGAILTIQESRGLPDEITVEIKGTSSQVQMAQQLI 427
           MQIPLSYA+ +IG  G+NI++ RR SGA +TIQE+RG+P E+TVE+ GT SQVQ A QLI
Sbjct: 474 MQIPLSYADAVIGTSGSNISYTRRLSGATVTIQETRGVPGEMTVEVSGTGSQVQTAVQLI 533

Query: 428 QEAISGPKEPVTSSSYGRLDTTGLRSSYSQLGASGSSYTPSSLSSQSYGGYGSSGLGGY 466
           Q  ++    P  +           +  Y+     GS Y  ++  +   GGY +    GY
Sbjct: 534 QNFMAEAGAPAPAQPQ---TVAPEQQGYNPYATHGSVY--AAAPTNPPGGYATDYSSGY 575

BLAST of Cp4.1LG07g07550 vs. Swiss-Prot
Match: HEN4_ARATH (KH domain-containing protein HEN4 OS=Arabidopsis thaliana GN=HEN4 PE=1 SV=1)

HSP 1 Score: 91.3 bits (225), Expect = 3.1e-17
Identity = 67/243 (27.57%), Postives = 122/243 (50.21%), Query Frame = 1

Query: 80  DCVFRIIVPVVKVGSIIGRRGDLIKKICEETRARIRVLDGAVGTPDRVVLISGKEEPEAP 139
           D VF+I+      G +IG  G +++ +  ET A I V +      +R++ ++  E PE  
Sbjct: 451 DVVFKILCSTENAGGVIGTGGKVVRMLHSETGAFINVGNTLDDCEERLIAVTASENPECQ 510

Query: 140 LSPAMDAVLRIFKRVSGFSENE--DEAKASFCSVRLLVASTQAINLIGKQGSLIKSIQES 199
            SPA  A++ IF R+   + N+  D    S  + RL+V ++Q   ++GK G ++  ++++
Sbjct: 511 SSPAQKAIMLIFSRLFELATNKILDNGPRSSITARLVVPTSQIGCVLGKGGVIVSEMRKT 570

Query: 200 TGASVRVLSGDEMPFYAGPDERMVELQGETLKVLKALEAVVGHLRKFLVDHSVLPLFEKS 259
           TGA++++L  ++ P     ++++V++ GE   V +A+  +   LR  +  +S+     KS
Sbjct: 571 TGAAIQILKVEQNPKCISENDQVVQITGEFPNVREAIFHITSRLRDSVFSNSMKNSLAKS 630

Query: 260 ---FNTPASQDRQTDAWADKSSLLSASQSVISNEYPPSSKRESLFFDRETHLDSHISSSG 318
                T    DRQ+D     + L   S   +SN   P++   SL    E   DS +S S 
Sbjct: 631 SSALTTERFYDRQSD-----NPLSIGSHQSVSN---PATNSSSLHRRSE---DSFLSGSH 682

BLAST of Cp4.1LG07g07550 vs. Swiss-Prot
Match: Y4837_ARATH (KH domain-containing protein At4g18375 OS=Arabidopsis thaliana GN=At4g18375 PE=2 SV=1)

HSP 1 Score: 90.5 bits (223), Expect = 5.3e-17
Identity = 86/334 (25.75%), Postives = 154/334 (46.11%), Query Frame = 1

Query: 80  DCVFRIIVPVVKVGSIIGRRGDLIKKICEETRARIRVLDGAV--GTPDRVVLISGKEEPE 139
           + VF+++ P+  +  +IG+ G  IK+I E + + I V D     G  + V++++  E P+
Sbjct: 311 ELVFKVLCPLCNIMRVIGKGGSTIKRIREASGSCIEVNDSRTKCGDDECVIIVTATESPD 370

Query: 140 APLSPAMDAVLRIFKRVSGFSENEDEAKASFCSVRLLVASTQAINLIGKQGSLIKSIQES 199
              S A++AVL + + +     N+++A+     ++LLV+S     +IGK GS+I  I++ 
Sbjct: 371 DMKSMAVEAVLLLQEYI-----NDEDAEN--VKMQLLVSSKVIGCVIGKSGSVINEIRKR 430

Query: 200 TGASVRVLSGDEMPFYAGPDERMVELQGETLKVLKALEAVVGHLRKFLVDHSVLPLFEKS 259
           T A++ +  G +        + +VE+ GE   V  AL  +V  LR+ ++           
Sbjct: 431 TNANICISKGKK--------DDLVEVSGEVSSVRDALIQIVLRLREDVLG---------- 490

Query: 260 FNTPASQDRQTDAWADKSSLLSASQSVISNEYPPSSKRESLFFDRETHLDSHISSSGISL 319
            +  +   R+  A  D  S LS S +                +   + + S  S+SG   
Sbjct: 491 -DKDSVATRKPPARTDNCSFLSGSSNA--------------GYTLPSFMSSMASTSGFHG 550

Query: 320 YGP----DRVLPAIRSSGVGRSGVPIVTQVTQTMQIPLSYAEDIIGVGGANIAFIRRNSG 379
           YG     D VL +      GR    + +     + IP      ++G GG N+  IRR SG
Sbjct: 551 YGSFPAGDNVLGSTGPYSYGR----LPSSSALEILIPAHAMSKVMGKGGGNLENIRRISG 600

Query: 380 AILTIQESRGLPDEITVEIKGTSSQVQMAQQLIQ 408
           A++ I  S+    +    + GT  Q++ A+ L+Q
Sbjct: 611 AMIEISASKTSHGDHIALLSGTLEQMRCAENLVQ 600

BLAST of Cp4.1LG07g07550 vs. Swiss-Prot
Match: PCBP3_HUMAN (Poly(rC)-binding protein 3 OS=Homo sapiens GN=PCBP3 PE=2 SV=2)

HSP 1 Score: 89.7 bits (221), Expect = 9.0e-17
Identity = 93/338 (27.51%), Postives = 155/338 (45.86%), Query Frame = 1

Query: 84  RIIVPVVKVGSIIGRRGDLIKKICEETRARIRVLDGAVGTPDRVVLISGKEEPEAPLSPA 143
           R+++   +VGSIIG++G+ +KK+ EE+ ARI + +G    P+R+V I+G   P   +  A
Sbjct: 49  RLLMHGKEVGSIIGKKGETVKKMREESGARINISEG--NCPERIVTITG---PTDAIFKA 108

Query: 144 MDAVLRIFKR-VSGFSENEDEAKASFCSVRLLVASTQAINLIGKQGSLIKSIQESTGASV 203
              +   F+  +     N         ++RL+V ++Q  +LIGK GS IK I+ESTGA V
Sbjct: 109 FAMIAYKFEEDIINSMSNSPATSKPPVTLRLVVPASQCGSLIGKGGSKIKEIRESTGAQV 168

Query: 204 RVLSGDEMPFYAGPDERMVELQGETLKVLKALEAVVGHLRKFLVDHSVLPLFEKSFNTP- 263
           +V +GD +P      ER V + G    +++ ++ +   + +     + +P   K  +TP 
Sbjct: 169 QV-AGDMLP---NSTERAVTISGTPDAIIQCVKQICVVMLESPPKGATIPYRPKPASTPV 228

Query: 264 --------ASQDRQTDAWADKSSLLSASQSVISNEYPPSSKRESLFFDRETHLDSHISSS 323
                     Q +      D+ + L    ++    +PP  +    F   +  L  H S  
Sbjct: 229 IFAGGQAYTIQGQYAIPHPDQLTKLH-QLAMQQTPFPPLGQTNPAFPGEK--LPLHSSEE 288

Query: 324 GISLYGPDRVLPAIRSSGVGRSGVPIVTQVTQTMQIPLSYAEDIIGVGGANIAFIRRNSG 383
             +L G        +SSG+  S        T  + IP      IIG  G  I  IR+ SG
Sbjct: 289 AQNLMG--------QSSGLDAS----PPASTHELTIPNDLIGCIIGRQGTKINEIRQMSG 348

Query: 384 AILTIQESRGLPDEITVEIKGTSSQVQMAQQLIQEAIS 412
           A + I  +     E  + I GT + + +AQ LI   ++
Sbjct: 349 AQIKIANATEGSSERQITITGTPANISLAQYLINARLT 362

BLAST of Cp4.1LG07g07550 vs. TrEMBL
Match: A0A0A0LMH4_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_2G061550 PE=4 SV=1)

HSP 1 Score: 779.6 bits (2012), Expect = 2.1e-222
Identity = 412/470 (87.66%), Postives = 429/470 (91.28%), Query Frame = 1

Query: 1   MATNATTENGSTDADALNSSPQLACEAAAEPDSTEAGGAESDIDPSNQDYLSAPASDYVP 60
           MATN TTENGSTDA     S  L   AAAEPDSTEAG  +SD DPSNQDY S P SD   
Sbjct: 1   MATNTTTENGSTDAIPNPLSSTLPSLAAAEPDSTEAGDDDSDSDPSNQDYSSVPPSDSAA 60

Query: 61  HESPNHAGASDKKWPGWPGDCVFRIIVPVVKVGSIIGRRGDLIKKICEETRARIRVLDGA 120
           HE  NH G SDKKWPGWPGDCVFR+IVPVVKVGSIIGR+GDLIKK+CEETRARIRVLDGA
Sbjct: 61  HEPSNHTGPSDKKWPGWPGDCVFRLIVPVVKVGSIIGRKGDLIKKMCEETRARIRVLDGA 120

Query: 121 VGTPDRVVLISGKEEPEAPLSPAMDAVLRIFKRVSGFSENEDEAKASFCSVRLLVASTQA 180
           VGTPDRVVLISGKEE E+PLSPAMDAV+R+FKRVSG SENEDEAKASFCS+RLLVASTQA
Sbjct: 121 VGTPDRVVLISGKEELESPLSPAMDAVIRVFKRVSGLSENEDEAKASFCSIRLLVASTQA 180

Query: 181 INLIGKQGSLIKSIQESTGASVRVLSGDEMPFYAGPDERMVELQGETLKVLKALEAVVGH 240
           INLIGKQGSLIKSIQESTGASVRVLSGDEMPFYAG DERMVELQGE+LKVLKALE VVGH
Sbjct: 181 INLIGKQGSLIKSIQESTGASVRVLSGDEMPFYAGADERMVELQGESLKVLKALEGVVGH 240

Query: 241 LRKFLVDHSVLPLFEKSFNTPASQDRQTDAWADKSSLLSASQSVISNEYPPSSKRESLFF 300
           LRKFLVDHSVLPLFEKSFNTPASQDRQT+ WADKSSLL+ASQS+IS EY PS+KRESLF 
Sbjct: 241 LRKFLVDHSVLPLFEKSFNTPASQDRQTETWADKSSLLTASQSIISAEYAPSTKRESLFL 300

Query: 301 DRETHLDSHISSSGISLYGPDRVLPAIRSSGVGRSGVPIVTQVTQTMQIPLSYAEDIIGV 360
           DRE H DSHISSSGISLYG DRVLP IRSSGVGRSG PIVTQVTQTMQIPLSYAEDIIGV
Sbjct: 301 DREAHFDSHISSSGISLYGQDRVLPTIRSSGVGRSGGPIVTQVTQTMQIPLSYAEDIIGV 360

Query: 361 GGANIAFIRRNSGAILTIQESRGLPDEITVEIKGTSSQVQMAQQLIQEAISGPKEPVTSS 420
           GG NIAFIRRNSGAILTIQESRGLPDEITVEIKGTSSQVQMAQQLIQEA++ PKEPVTSS
Sbjct: 361 GGTNIAFIRRNSGAILTIQESRGLPDEITVEIKGTSSQVQMAQQLIQEAVNAPKEPVTSS 420

Query: 421 SYGRLDTTGLRSSYSQLGASGSSYTPSSLSSQSYGGYGSSGLGGYSTFRL 471
           SYGRLDTTGLRSSYSQL ASGSS+T SSLSSQSYGGYGSSGLGGY+TFRL
Sbjct: 421 SYGRLDTTGLRSSYSQLAASGSSFTSSSLSSQSYGGYGSSGLGGYTTFRL 470

BLAST of Cp4.1LG07g07550 vs. TrEMBL
Match: W9SIR6_9ROSA (Poly(RC)-binding protein 3 OS=Morus notabilis GN=L484_005636 PE=4 SV=1)

HSP 1 Score: 583.6 bits (1503), Expect = 2.2e-163
Identity = 330/480 (68.75%), Postives = 382/480 (79.58%), Query Frame = 1

Query: 1   MATNATTENGSTDADALNSSPQLA-CEAAAEPDSTEAGGAESDIDPSNQDYLSAPASDYV 60
           MATN  T NG T A   +S P  +   AAA   + +A   E      N++  S PA    
Sbjct: 1   MATNEPTANGVTKAPKSDSQPIPSDATAAASTANEKAPVTEPGHATGNRNMESEPAPPGE 60

Query: 61  PHESP-NHAGAS-----DKKWPGWPGDCVFRIIVPVVKVGSIIGRRGDLIKKICEETRAR 120
             E+P ++AG +     DKKW GWPGDCVFR+IVPV+KVGSIIGR+G+LIKK+CEETRAR
Sbjct: 61  SGEAPASNAGTTATTDADKKWLGWPGDCVFRLIVPVLKVGSIIGRKGELIKKMCEETRAR 120

Query: 121 IRVLDGAVGTPDRVVLISGKEEPEAPLSPAMDAVLRIFKRVSGFSENEDEAKASFCSVRL 180
           IRVLDGAVGTPDR+VLISGKEEPEA LSPAMDAV+R+FKRVSG SENE  A  +FCS+RL
Sbjct: 121 IRVLDGAVGTPDRIVLISGKEEPEAALSPAMDAVIRVFKRVSGLSENE-AAGVAFCSIRL 180

Query: 181 LVASTQAINLIGKQGSLIKSIQESTGASVRVLSGDEMPFYAGPDERMVELQGETLKVLKA 240
           LVASTQAINLIGKQGSLIKSIQESTGASVRVL+GDE+PFYA  DER+VELQGE LKVLKA
Sbjct: 181 LVASTQAINLIGKQGSLIKSIQESTGASVRVLTGDEVPFYAAADERIVELQGEGLKVLKA 240

Query: 241 LEAVVGHLRKFLVDHSVLPLFEKSFNTPASQDRQTDAWADKSSLLSASQSVISNEYPPSS 300
           LEAV+GHLRKFLVDHSV+PLFEKS+N+  SQ+RQ D WADKS L +++Q+ +   YP ++
Sbjct: 241 LEAVIGHLRKFLVDHSVIPLFEKSYNSTISQERQVDTWADKSMLQASTQTGVGTNYPITA 300

Query: 301 KRESLFFDRETHLDSHISSSGISLYGPDRVLPAIRSSGVGRSGVPIVTQVTQTMQIPLSY 360
           KRE  F +RET L+  + SSGIS+YG D  LP IR +G GR+G PIVTQ+ QTMQIPLSY
Sbjct: 301 KREPYFLERETQLEPQLPSSGISMYGQDTSLPGIRPTGFGRAGAPIVTQIAQTMQIPLSY 360

Query: 361 AEDIIGVGGANIAFIRRNSGAILTIQESRGLPDEITVEIKGTSSQVQMAQQLIQEAISGP 420
           AEDIIG+ G+NIA+IRR SGAILT+QESRGLPDEITVEIKGTSSQVQ+AQQLIQE I+  
Sbjct: 361 AEDIIGIEGSNIAYIRRTSGAILTVQESRGLPDEITVEIKGTSSQVQLAQQLIQEVINNH 420

Query: 421 KEPVTSSSYGRLDTTGLRSSYSQLGASGSSYTPS-SLSSQSY--GGYGSSGLGGYSTFRL 471
           KEPV SS YGR+DT  LRS+YSQL    SSY P+ SL  +SY  GGYGSSGLGGYSTFRL
Sbjct: 421 KEPVPSS-YGRIDTA-LRSNYSQLS---SSYPPTTSLPPRSYSGGGYGSSGLGGYSTFRL 474

BLAST of Cp4.1LG07g07550 vs. TrEMBL
Match: F6GXK2_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_11s0052g00030 PE=4 SV=1)

HSP 1 Score: 580.5 bits (1495), Expect = 1.9e-162
Identity = 325/478 (67.99%), Postives = 380/478 (79.50%), Query Frame = 1

Query: 1   MATNATTENGSTDADALNSSPQLACEAAAEP-DSTEAGGAESDIDPSNQDYLSAPASDYV 60
           MAT  TTE    +  A +       E +  P  ++ A  AES+  PS     +  +    
Sbjct: 1   MATTGTTEPAPVNGAAQSPGSDPKTELSETPLSASNAATAESEQAPSE----NLESESTA 60

Query: 61  PHESPNHAGASDKKWPGWPGDCVFRIIVPVVKVGSIIGRRGDLIKKICEETRARIRVLDG 120
           P E+   A AS+KKWPGWPGDCVFR+IVPV+KVGSIIGR+G+LIKK+CEETRARIRVLDG
Sbjct: 61  PPETEAPAPASEKKWPGWPGDCVFRLIVPVLKVGSIIGRKGELIKKMCEETRARIRVLDG 120

Query: 121 AVGTPDRVVLISGKEEPEAPLSPAMDAVLRIFKRVSGFSENEDEAKA------SFCSVRL 180
           AVGT DR+VLISG+EEPEAPLSPAMDAV+R+FKRV+G SE+E + KA      +FCS+RL
Sbjct: 121 AVGTSDRIVLISGREEPEAPLSPAMDAVIRVFKRVTGLSESEGDGKAYGAAGVAFCSIRL 180

Query: 181 LVASTQAINLIGKQGSLIKSIQESTGASVRVLSGDEMPFYAGPDERMVELQGETLKVLKA 240
           LVASTQAINLIGKQGSLIKSIQESTGASVRVLSGDE+PFYA  DER+VELQGE LKV KA
Sbjct: 181 LVASTQAINLIGKQGSLIKSIQESTGASVRVLSGDEVPFYAAADERIVELQGEALKVQKA 240

Query: 241 LEAVVGHLRKFLVDHSVLPLFEKSFNTPASQDRQTDAWADKSSLLSASQSVISNEYPPSS 300
           LEAVVGHLRKFLVDHSVLPLFE+++N   SQDRQ+D WADKS L   SQ+ + ++Y   +
Sbjct: 241 LEAVVGHLRKFLVDHSVLPLFERTYNATISQDRQSDTWADKSLLHGTSQTGMGSDYSLPA 300

Query: 301 KRESLFFDRETHLDSHISSSGISLYGPDRVLPAIRSSGVGRSGVPIVTQVTQTMQIPLSY 360
           KRESL+ DRET ++     SG+ +YG +  L  IRSSG+GR+G PIVTQ+ QTMQIPLSY
Sbjct: 301 KRESLYLDRETQMEH----SGLPMYGQEHGLSGIRSSGLGRAGAPIVTQIAQTMQIPLSY 360

Query: 361 AEDIIGVGGANIAFIRRNSGAILTIQESRGLPDEITVEIKGTSSQVQMAQQLIQEAISGP 420
           AEDIIG+GGANIA+IRR SGAILT+QESRGLPDEITVEIKGTSSQVQ AQQLIQE IS  
Sbjct: 361 AEDIIGIGGANIAYIRRTSGAILTVQESRGLPDEITVEIKGTSSQVQTAQQLIQEFISNH 420

Query: 421 KEPVTSSSYGRLDTTGLRSSYSQLGASGSSYTPSSLSSQSY-GGYGSSGLGGYSTFRL 471
           KEPV  SSYG++D +GLRSSYSQLG   +SY+ SSLSSQ Y GGYGSSG+GGYS++RL
Sbjct: 421 KEPV-PSSYGKMD-SGLRSSYSQLG--NTSYSSSSLSSQPYGGGYGSSGVGGYSSYRL 466

BLAST of Cp4.1LG07g07550 vs. TrEMBL
Match: A0A067F3I5_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g012307mg PE=4 SV=1)

HSP 1 Score: 559.7 bits (1441), Expect = 3.4e-156
Identity = 311/478 (65.06%), Postives = 370/478 (77.41%), Query Frame = 1

Query: 1   MATNATTENGSTDADALNSSPQLACEAAAEPDSTEAGGAESDIDPS----NQDYLSAPAS 60
           MAT  TT+NGS +    +  P+ A        +TE+   ES+  P+    N ++ +   S
Sbjct: 1   MATAETTDNGSINLPEPDPQPETA--------TTESPTTESEAPPTIGSENTEHTTESDS 60

Query: 61  DYVPHESPNHAGA-SDKKWPGWPGDCVFRIIVPVVKVGSIIGRRGDLIKKICEETRARIR 120
                  P+ A    +KKWPGWPG CVFR+IVPV+KVGSIIGR+G+LIKK CE+TRARI+
Sbjct: 61  ALPQSTGPDAAAVPEEKKWPGWPGHCVFRMIVPVLKVGSIIGRKGELIKKTCEDTRARIK 120

Query: 121 VLDGAVGTPDRVVLISGKEEPEAPLSPAMDAVLRIFKRVSGFSENEDEAKAS---FCSVR 180
           VLDG V +PDR+VLISGKEEPEAP+SPAMDA +R+FKRVSG  EN+ +AKAS   FCSVR
Sbjct: 121 VLDGPVSSPDRIVLISGKEEPEAPVSPAMDAAVRVFKRVSGLPENDVDAKASGAAFCSVR 180

Query: 181 LLVASTQAINLIGKQGSLIKSIQESTGASVRVLSGDEMPFYAGPDERMVELQGETLKVLK 240
           LLV STQAINLIGKQGSLIKSIQE++GASVRVLS DE PFY   DER+VE+QGE  KVLK
Sbjct: 181 LLVPSTQAINLIGKQGSLIKSIQENSGASVRVLSADEAPFYVTEDERIVEMQGEAAKVLK 240

Query: 241 ALEAVVGHLRKFLVDHSVLPLFEKSFNTPASQDRQTDAWADKSSLLSASQSVISNEYPPS 300
           ALEAVVGHLRKFLVD  VLPLFEK++N   SQ+RQ + WADKSSL +A+QS IS EY PS
Sbjct: 241 ALEAVVGHLRKFLVDQGVLPLFEKTYNASISQERQVETWADKSSLHAATQSAISTEYTPS 300

Query: 301 SKRESLFFDRETHLDSHISSSGISLYGPDRVLPAIRSSGVGRSGVPIVTQVTQTMQIPLS 360
           ++RESLF +RE  L+S    SGIS+YG D  L  IRSS +GR+  PIVTQ+TQTMQIP+S
Sbjct: 301 TRRESLFLEREPQLESRYRLSGISIYGQDPALSTIRSSALGRASGPIVTQITQTMQIPIS 360

Query: 361 YAEDIIGVGGANIAFIRRNSGAILTIQESRGLPDEITVEIKGTSSQVQMAQQLIQEAISG 420
           YAEDIIGVGG +I  IRR SGAI+T+QESRGLPDEITVEIKGTSSQVQ+AQQLIQE ++ 
Sbjct: 361 YAEDIIGVGGTSIENIRRTSGAIITVQESRGLPDEITVEIKGTSSQVQLAQQLIQEYMNN 420

Query: 421 PKEPVTSSSYGRLDTTGLRSSYSQLGASGSSYTPSSLSSQSYGGYGSSGLGGYSTFRL 471
            KE +T SSYG++D TG R SY QLG   SSY  SSLS+Q YGGYGSSG+GGY+++RL
Sbjct: 421 HKESIT-SSYGQID-TGYRPSYPQLG--NSSYPSSSLSTQPYGGYGSSGVGGYTSYRL 466

BLAST of Cp4.1LG07g07550 vs. TrEMBL
Match: V4U0W5_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10008200mg PE=4 SV=1)

HSP 1 Score: 559.7 bits (1441), Expect = 3.4e-156
Identity = 311/478 (65.06%), Postives = 370/478 (77.41%), Query Frame = 1

Query: 1   MATNATTENGSTDADALNSSPQLACEAAAEPDSTEAGGAESDIDPS----NQDYLSAPAS 60
           MAT  TT+NGS +    +  P+ A        +TE+   ES+  P+    N ++ +   S
Sbjct: 1   MATAETTDNGSINLPEPDPQPETA--------TTESPTTESEAPPTIGSENTEHTTESDS 60

Query: 61  DYVPHESPNHAGA-SDKKWPGWPGDCVFRIIVPVVKVGSIIGRRGDLIKKICEETRARIR 120
                  P+ A    +KKWPGWPG CVFR+IVPV+KVGSIIGR+G+LIKK CE+TRARI+
Sbjct: 61  ALPQSTGPDAAAVPEEKKWPGWPGHCVFRMIVPVLKVGSIIGRKGELIKKTCEDTRARIK 120

Query: 121 VLDGAVGTPDRVVLISGKEEPEAPLSPAMDAVLRIFKRVSGFSENEDEAKAS---FCSVR 180
           VLDG V +PDR+VLISGKEEPEAP+SPAMDA +R+FKRVSG  EN+ +AKAS   FCSVR
Sbjct: 121 VLDGPVSSPDRIVLISGKEEPEAPVSPAMDAAVRVFKRVSGLPENDVDAKASGAAFCSVR 180

Query: 181 LLVASTQAINLIGKQGSLIKSIQESTGASVRVLSGDEMPFYAGPDERMVELQGETLKVLK 240
           LLV STQAINLIGKQGSLIKSIQE++GASVRVLS DE PFY   DER+VE+QGE  KVLK
Sbjct: 181 LLVPSTQAINLIGKQGSLIKSIQENSGASVRVLSSDEAPFYVTEDERIVEMQGEAAKVLK 240

Query: 241 ALEAVVGHLRKFLVDHSVLPLFEKSFNTPASQDRQTDAWADKSSLLSASQSVISNEYPPS 300
           ALEAVVGHLRKFLVD  VLPLFEK++N   SQ+RQ + WADKSSL +A+QS IS EY PS
Sbjct: 241 ALEAVVGHLRKFLVDQGVLPLFEKTYNASISQERQVETWADKSSLHAATQSAISTEYTPS 300

Query: 301 SKRESLFFDRETHLDSHISSSGISLYGPDRVLPAIRSSGVGRSGVPIVTQVTQTMQIPLS 360
           ++RESLF +RE  L+S    SGIS+YG D  L  IRSS +GR+  PIVTQ+TQTMQIP+S
Sbjct: 301 TRRESLFLEREPQLESRYRLSGISIYGQDPALSTIRSSALGRASGPIVTQITQTMQIPIS 360

Query: 361 YAEDIIGVGGANIAFIRRNSGAILTIQESRGLPDEITVEIKGTSSQVQMAQQLIQEAISG 420
           YAEDIIGVGG +I  IRR SGAI+T+QESRGLPDEITVEIKGTSSQVQ+AQQLIQE ++ 
Sbjct: 361 YAEDIIGVGGTSIENIRRTSGAIITVQESRGLPDEITVEIKGTSSQVQLAQQLIQEYMNN 420

Query: 421 PKEPVTSSSYGRLDTTGLRSSYSQLGASGSSYTPSSLSSQSYGGYGSSGLGGYSTFRL 471
            KE +T SSYG++D TG R SY QLG   SSY  SSLS+Q YGGYGSSG+GGY+++RL
Sbjct: 421 HKESIT-SSYGQID-TGYRPSYPQLG--NSSYPSSSLSTQPYGGYGSSGVGGYTSYRL 466

BLAST of Cp4.1LG07g07550 vs. TAIR10
Match: AT4G26000.1 (AT4G26000.1 RNA-binding KH domain-containing protein)

HSP 1 Score: 448.7 bits (1153), Expect = 4.3e-126
Identity = 254/442 (57.47%), Postives = 324/442 (73.30%), Query Frame = 1

Query: 64  PNHAGASDKKWPGWPGDCVFRIIVPVVKVGSIIGRRGDLIKKICEETRARIRVLDGAVGT 123
           P+   +++++WPGWPGDCVFR+IVPV KVG+IIGR+GD IKK+CEETRARI+VLDG V T
Sbjct: 57  PDTNDSAEERWPGWPGDCVFRMIVPVTKVGAIIGRKGDFIKKMCEETRARIKVLDGPVNT 116

Query: 124 PDRVVLISGKEEPEAPLSPAMDAVLRIFKRVSGFSENEDE----AKASFCSVRLLVASTQ 183
           PDR+VLISGKEEPEA +SPAMDAVLR+F+RVSG  +N+D+    A + F SVRLLVASTQ
Sbjct: 117 PDRIVLISGKEEPEAYMSPAMDAVLRVFRRVSGLPDNDDDDVQNAGSVFSSVRLLVASTQ 176

Query: 184 AINLIGKQGSLIKSIQESTGASVRVLSGDEMPFYAGPDERMVELQGETLKVLKALEAVVG 243
           AINLIGKQGSLIKSI E++GASVR+LS +E PFYA  DER+V+LQGE LK+LKALEA+VG
Sbjct: 177 AINLIGKQGSLIKSIVENSGASVRILSEEETPFYAAQDERIVDLQGEALKILKALEAIVG 236

Query: 244 HLRKFLVDHSVLPLFEKSFNTPASQDRQTDAWAD-KSSLLSASQSVISNEYPPSSKRESL 303
           HLR+FLVDH+V+PLFEK +    SQ RQ +  A+ KSSL + S +++  ++   ++RE L
Sbjct: 237 HLRRFLVDHTVVPLFEKQYLARVSQTRQEEPLAESKSSLHTISSNLMEPDFSLLARREPL 296

Query: 304 FFDRETHLDSHISSSGISLYGPDRVLPAIRSSGVGRSGVPIVTQVTQTMQIPLSYAEDII 363
           F +R++ +DS +  SG+S+Y  D VL A  S G+ R     VTQV+QTMQIP SYAEDII
Sbjct: 297 FLERDSRVDSRVQPSGVSIYSQDPVLSARHSPGLARVSSAFVTQVSQTMQIPFSYAEDII 356

Query: 364 GVGGANIAFIRRNSGAILTIQESRGLPDEITVEIKGTSSQVQMAQQLIQEAISGPKEPVT 423
           GV GANIA+IRR SGA +TI+ES   PD+ITVEIKGT+SQVQ A+QLIQE I   KEPV+
Sbjct: 357 GVEGANIAYIRRRSGATITIKESPH-PDQITVEIKGTTSQVQTAEQLIQEFIINHKEPVS 416

Query: 424 -SSSYGRLDTTGL---------------------------RSSYSQLGASGSSYTPSSLS 471
            S  Y R+D+  +                            ++YSQLG   S+YTP +L+
Sbjct: 417 VSGGYARIDSGYVPAYPPQLSNRQEPLPSTYMGTEPVQYRPTAYSQLGGP-STYTP-TLT 476

BLAST of Cp4.1LG07g07550 vs. TAIR10
Match: AT3G04610.1 (AT3G04610.1 RNA-binding KH domain-containing protein)

HSP 1 Score: 276.6 bits (706), Expect = 2.9e-74
Identity = 174/419 (41.53%), Postives = 239/419 (57.04%), Query Frame = 1

Query: 68  GASDKKWPGWPGDCVFRIIVPVVKVGSIIGRRGDLIKKICEETRARIRVLDGAVGTPDRV 127
           G  +K+WPGWPG+ VFR++VP  KVGSIIGR+GD+IKKI EETRARI++LDG  GT +R 
Sbjct: 174 GGEEKRWPGWPGETVFRMLVPAQKVGSIIGRKGDVIKKIVEETRARIKILDGPPGTTERA 233

Query: 128 VLISGKEEPEAPLSPAMDAVLRIFKR-VSGFSENEDEA-KASFCSVRLLVASTQAINLIG 187
           V++SGKEEPE+ L P+MD +LR+  R V G      +A   S  S RLLV ++QA +LIG
Sbjct: 234 VMVSGKEEPESSLPPSMDGLLRVHMRIVDGLDGEASQAPPPSKVSTRLLVPASQAGSLIG 293

Query: 188 KQGSLIKSIQESTGASVRVLSGDEMPFYAGPDERMVELQGETLKVLKALEAVVGHLRKFL 247
           KQG  +K+IQE++   VRVL  +++P +A  D+R+VE+ GE   V +ALE +  HLRKFL
Sbjct: 294 KQGGTVKAIQEASACIVRVLGSEDLPVFALQDDRVVEVVGEPTSVHRALELIASHLRKFL 353

Query: 248 VDHSVLPLFEKSFNTPASQ-DRQTDAWADKSSLLSASQSV------------------IS 307
           VD S++P FE     P  Q D               + SV                    
Sbjct: 354 VDRSIIPFFENQMQKPTRQMDHMPPPHQSWGPPQGHAPSVGGGGYGHNPPPYMQPPPRHD 413

Query: 308 NEYPPSSKRESLFFDRETHLDSHISSSGISLYGPDRVLPAIRSSGVGRSGVPIVTQVTQT 367
           + YPP   R+    +++ H        GIS YG +  +    +  V  +   +  QVTQ 
Sbjct: 414 SYYPPPEMRQPP-MEKQPH-------QGISAYGREPPM----NVHVSSAPPMVAQQVTQQ 473

Query: 368 MQIPLSYAEDIIGVGGANIAFIRRNSGAILTIQESRGLPDEITVEIKGTSSQVQMAQQLI 427
           MQIPLSYA+ +IG  G+NI++ RR SGA +TIQE+RG+P E+TVE+ GT SQVQ A QLI
Sbjct: 474 MQIPLSYADAVIGTSGSNISYTRRLSGATVTIQETRGVPGEMTVEVSGTGSQVQTAVQLI 533

Query: 428 QEAISGPKEPVTSSSYGRLDTTGLRSSYSQLGASGSSYTPSSLSSQSYGGYGSSGLGGY 466
           Q  ++    P  +           +  Y+     GS Y  ++  +   GGY +    GY
Sbjct: 534 QNFMAEAGAPAPAQPQ---TVAPEQQGYNPYATHGSVY--AAAPTNPPGGYATDYSSGY 575

BLAST of Cp4.1LG07g07550 vs. TAIR10
Match: AT5G46190.1 (AT5G46190.1 RNA-binding KH domain-containing protein)

HSP 1 Score: 107.5 bits (267), Expect = 2.3e-23
Identity = 102/339 (30.09%), Postives = 164/339 (48.38%), Query Frame = 1

Query: 84  RIIVPVVKVGSIIGRRGDLIKKICEETRARIRVLDGAVGTPDR-VVLISGKEEPEAPLSP 143
           ++I    K+G +IG+ G  IK I + + + I V D      D  V+ ++  E P+   S 
Sbjct: 320 KVICSSSKIGRVIGKGGLTIKGIRQASGSHIEVNDSRTNHDDDCVITVTATESPDDLKSM 379

Query: 144 AMDAVLRIFKRVSGFSENEDEAKASFCSVRLLVASTQAINLIGKQGSLIKSIQESTGASV 203
           A++AVL + ++++   E+ED+ K     ++LLV+S     +IGK GS+I  I++ T A +
Sbjct: 380 AVEAVLLLQEKIN--DEDEDKVK-----MQLLVSSKVIGCIIGKSGSIISEIRKRTKADI 439

Query: 204 RVLSGDEMPFYAGPDERMVELQGETLKVLKALEAVVGHLRKFLVDHSVLPLFEKSFNTPA 263
            +  G+  P  A P++ +VE+ GE   V  AL  +V  LR    D  +      S N P 
Sbjct: 440 HISKGNNTPKCADPNDELVEISGEVSNVRDALIQIVLRLR----DDVLRDRETGSRNQPP 499

Query: 264 SQDRQTDAWADKSSL--LSASQSVISNEYPPSSKRESLFFDRETHLDSHI----SSSGIS 323
           ++    + ++  SS   L+  QS +S+      +  S+ F+R     S +    SS GI 
Sbjct: 500 ARSENNNFFSSSSSNTGLALPQSFMSS----VPQVASVDFNRRPETGSSMSMLPSSGGIY 559

Query: 324 LYGPDRVLPAIRSSGVGRS-----GVPIVTQVTQTMQIPLSYAEDIIGVGGANIAFIRRN 383
            YG   V      S    S     G+P  T  T  ++IP +    ++G GG N+  IRR 
Sbjct: 560 GYGSFPVGNTSYGSNSSYSSNLYGGLPQST--TMEVRIPANAVGKVMGRGGGNLDNIRRI 619

Query: 384 SGAILTIQESRGLPDEITVEIKGTSSQVQMAQQLIQEAI 411
           SGA++ I +S+         I GTS Q + A+ L Q  I
Sbjct: 620 SGAMIEISDSKNSHGGRVALISGTSEQKRTAENLFQAFI 641

BLAST of Cp4.1LG07g07550 vs. TAIR10
Match: AT5G53060.1 (AT5G53060.1 RNA-binding KH domain-containing protein)

HSP 1 Score: 107.5 bits (267), Expect = 2.3e-23
Identity = 95/343 (27.70%), Postives = 165/343 (48.10%), Query Frame = 1

Query: 80  DCVFRIIVPVVKVGSIIGRRGDLIKKICEETRARIRVLDGAVGTPDRVVLISGKEEPEAP 139
           + VF+I+ P  K+  ++G    +I  +  E    +RV D   G+ ++++ IS +E P+ P
Sbjct: 324 ELVFQILCPADKIVRVVGESQGIIDLLQNEIGVDVRVSDPVAGSDEQIITISSEEAPDDP 383

Query: 140 LSPAMDAVLRIFKRVSGFSENEDEAKASFCSVRLLVASTQAINLIGKQGSLIKSIQESTG 199
             PA +A+L I  ++     ++D    +  + RLLV S  +I L GK GS +  I   TG
Sbjct: 384 FFPAQEALLHIQTQIIDLIPDKD----NLITTRLLVPSRDSICLEGKAGS-VSEISRLTG 443

Query: 200 ASVRVLSGDEMPFYAGPDERMVELQGETLKVLKALEAVVGHLRKFLVDHSVLPLFEKSFN 259
            SV++L+ +E+P  A  ++ ++++ GE   +  A EA+V  L   L  H    L +K   
Sbjct: 444 TSVQILAREEIPRCASINDVVIQITGE---IRAAREALV-ELTLLLRSHMFKELSQK--E 503

Query: 260 TPASQDRQT---DAWADKSSLLSASQSVISNEYPPSS----KRESLFFDRETHLDSHISS 319
           TP +    T   +  A    + S++ ++ S E P SS    ++ S    +       ++ 
Sbjct: 504 TPPASTSTTGPLEGVAGVMEVASSNNTIQSREGPTSSNLNLQQVSTILPQFKEGFGSVAK 563

Query: 320 SGISLYGPDRVLPAIRSSGVGRSGVPIVTQVTQTMQIPLSYAEDIIGVGGANIAFIRRNS 379
           +G S +  +  +P   S    R  VP+VT+ T  + +P +    ++      +A I   S
Sbjct: 564 AGESEHREE--VPVTTS----RMAVPLVTRSTLEVVLPEAVVPKLVTKSRNKLAQISEWS 623

Query: 380 GAILTIQESRGLPDEITVEIKGTSSQVQMAQQLIQEAISGPKE 416
           GA +TI E R    +  + I GT  Q + AQ L+Q  I   +E
Sbjct: 624 GASVTIVEDRPEETQNIIRISGTPEQAERAQSLLQGFILSIQE 649

BLAST of Cp4.1LG07g07550 vs. TAIR10
Match: AT5G15270.2 (AT5G15270.2 RNA-binding KH domain-containing protein)

HSP 1 Score: 103.2 bits (256), Expect = 4.4e-22
Identity = 105/391 (26.85%), Postives = 167/391 (42.71%), Query Frame = 1

Query: 52  SAPASDYVPHESPNHAGASDKKWPGWP--------GDCVFRIIVPVVKVGSIIGRRGDLI 111
           S P SDY      +  G S +++ G           D VFR + PV K+GS+IGR GD++
Sbjct: 19  SRPQSDY------DDNGGSKRRYRGDDRDSLVIDRDDTVFRYLCPVKKIGSVIGRGGDIV 78

Query: 112 KKICEETRARIRVLDGAVGTPDRVVLISGKEEP-------EAPLSPAMDAVLRIFKRV-- 171
           K++  +TR++IR+ +   G  +RV+ I    +        E  LSPA DA+ RI  RV  
Sbjct: 79  KQLRNDTRSKIRIGEAIPGCDERVITIYSPSDETNAFGDGEKVLSPAQDALFRIHDRVVA 138

Query: 172 -SGFSENEDEAKASFCSVRLLVASTQAINLIGKQGSLIKSIQESTGASVRVLSGDEMPFY 231
               SE+  E +    + +LLV S Q   ++G+ G ++++I+  TGA +R++    MP  
Sbjct: 139 DDARSEDSPEGEKQ-VTAKLLVPSDQIGCILGRGGQIVQNIRSETGAQIRIVKDRNMPLC 198

Query: 232 AGPDERMVELQGETLKVLKALEAVVGHLRKFLVDHSVLPLFEKSFNTPASQDRQTDAWAD 291
           A   + ++++ GE L V KAL  +   L +               N   SQ         
Sbjct: 199 ALNSDELIQISGEVLIVKKALLQIASRLHE---------------NPSRSQ--------- 258

Query: 292 KSSLLSASQSVISNEYPPSSKRESLFFDRETHLDSHISSSGIS-------LYGPDRVLPA 351
             +LLS+     S  YP  S        R   L   + S G         LY P R    
Sbjct: 259 --NLLSS-----SGGYPAGSLMSHAGGPRLVGLAPLMGSYGRDAGDWSRPLYQPPR---- 318

Query: 352 IRSSGVGRSGVPIVTQVTQTMQIPLSYAEDIIGVGGANIAFIRRNSGAILTIQESRGLPD 411
                      P  T+    +  P+     +IG GGA I  +R+ + A + +  SR   +
Sbjct: 319 ---------NDPPATEFFIRLVSPVENIASVIGKGGALINQLRQETRATIKVDSSRTEGN 350

Query: 412 EITVEIKGTSSQVQMAQQLIQEAISGPKEPV 418
           +  + I         A+++ ++A S   E V
Sbjct: 379 DCLITIS--------AREVFEDAYSPTIEAV 350

BLAST of Cp4.1LG07g07550 vs. NCBI nr
Match: gi|778667604|ref|XP_004149510.2| (PREDICTED: RNA-binding KH domain-containing protein PEPPER [Cucumis sativus])

HSP 1 Score: 779.6 bits (2012), Expect = 3.0e-222
Identity = 412/470 (87.66%), Postives = 429/470 (91.28%), Query Frame = 1

Query: 1   MATNATTENGSTDADALNSSPQLACEAAAEPDSTEAGGAESDIDPSNQDYLSAPASDYVP 60
           MATN TTENGSTDA     S  L   AAAEPDSTEAG  +SD DPSNQDY S P SD   
Sbjct: 1   MATNTTTENGSTDAIPNPLSSTLPSLAAAEPDSTEAGDDDSDSDPSNQDYSSVPPSDSAA 60

Query: 61  HESPNHAGASDKKWPGWPGDCVFRIIVPVVKVGSIIGRRGDLIKKICEETRARIRVLDGA 120
           HE  NH G SDKKWPGWPGDCVFR+IVPVVKVGSIIGR+GDLIKK+CEETRARIRVLDGA
Sbjct: 61  HEPSNHTGPSDKKWPGWPGDCVFRLIVPVVKVGSIIGRKGDLIKKMCEETRARIRVLDGA 120

Query: 121 VGTPDRVVLISGKEEPEAPLSPAMDAVLRIFKRVSGFSENEDEAKASFCSVRLLVASTQA 180
           VGTPDRVVLISGKEE E+PLSPAMDAV+R+FKRVSG SENEDEAKASFCS+RLLVASTQA
Sbjct: 121 VGTPDRVVLISGKEELESPLSPAMDAVIRVFKRVSGLSENEDEAKASFCSIRLLVASTQA 180

Query: 181 INLIGKQGSLIKSIQESTGASVRVLSGDEMPFYAGPDERMVELQGETLKVLKALEAVVGH 240
           INLIGKQGSLIKSIQESTGASVRVLSGDEMPFYAG DERMVELQGE+LKVLKALE VVGH
Sbjct: 181 INLIGKQGSLIKSIQESTGASVRVLSGDEMPFYAGADERMVELQGESLKVLKALEGVVGH 240

Query: 241 LRKFLVDHSVLPLFEKSFNTPASQDRQTDAWADKSSLLSASQSVISNEYPPSSKRESLFF 300
           LRKFLVDHSVLPLFEKSFNTPASQDRQT+ WADKSSLL+ASQS+IS EY PS+KRESLF 
Sbjct: 241 LRKFLVDHSVLPLFEKSFNTPASQDRQTETWADKSSLLTASQSIISAEYAPSTKRESLFL 300

Query: 301 DRETHLDSHISSSGISLYGPDRVLPAIRSSGVGRSGVPIVTQVTQTMQIPLSYAEDIIGV 360
           DRE H DSHISSSGISLYG DRVLP IRSSGVGRSG PIVTQVTQTMQIPLSYAEDIIGV
Sbjct: 301 DREAHFDSHISSSGISLYGQDRVLPTIRSSGVGRSGGPIVTQVTQTMQIPLSYAEDIIGV 360

Query: 361 GGANIAFIRRNSGAILTIQESRGLPDEITVEIKGTSSQVQMAQQLIQEAISGPKEPVTSS 420
           GG NIAFIRRNSGAILTIQESRGLPDEITVEIKGTSSQVQMAQQLIQEA++ PKEPVTSS
Sbjct: 361 GGTNIAFIRRNSGAILTIQESRGLPDEITVEIKGTSSQVQMAQQLIQEAVNAPKEPVTSS 420

Query: 421 SYGRLDTTGLRSSYSQLGASGSSYTPSSLSSQSYGGYGSSGLGGYSTFRL 471
           SYGRLDTTGLRSSYSQL ASGSS+T SSLSSQSYGGYGSSGLGGY+TFRL
Sbjct: 421 SYGRLDTTGLRSSYSQLAASGSSFTSSSLSSQSYGGYGSSGLGGYTTFRL 470

BLAST of Cp4.1LG07g07550 vs. NCBI nr
Match: gi|659069666|ref|XP_008451191.1| (PREDICTED: poly(rC)-binding protein 3 [Cucumis melo])

HSP 1 Score: 768.1 bits (1982), Expect = 9.1e-219
Identity = 409/470 (87.02%), Postives = 428/470 (91.06%), Query Frame = 1

Query: 1   MATNATTENGSTDADALNSSPQLACEAAAEPDSTEAGGAESDIDPSNQDYLSAPASDYVP 60
           MATN TT+NGSTDA A   S  L   AAAE DSTE G  +SDIDPSNQDY S P SD   
Sbjct: 1   MATNTTTDNGSTDAIANPQSSTLPSLAAAEADSTEVGDDDSDIDPSNQDYSSLPPSDSAA 60

Query: 61  HESPNHAGASDKKWPGWPGDCVFRIIVPVVKVGSIIGRRGDLIKKICEETRARIRVLDGA 120
           HE  +H GASDKKWPGWPGDCVFR+IVPVVKVGSIIGR+GDLIKK+CEETRARIRVLDGA
Sbjct: 61  HEPSSHTGASDKKWPGWPGDCVFRLIVPVVKVGSIIGRKGDLIKKMCEETRARIRVLDGA 120

Query: 121 VGTPDRVVLISGKEEPEAPLSPAMDAVLRIFKRVSGFSENEDEAKASFCSVRLLVASTQA 180
           VGTPDRVVLISGKEE ++PLSPAMDAV+R+FKRVSG SENEDEAKASF S+RLLVASTQA
Sbjct: 121 VGTPDRVVLISGKEELDSPLSPAMDAVIRVFKRVSGLSENEDEAKASFSSIRLLVASTQA 180

Query: 181 INLIGKQGSLIKSIQESTGASVRVLSGDEMPFYAGPDERMVELQGETLKVLKALEAVVGH 240
           INLIGKQGSLIKSIQESTGASVRVLSGDEMPFYAG DERMVELQGE+LKVLKALE VVGH
Sbjct: 181 INLIGKQGSLIKSIQESTGASVRVLSGDEMPFYAGADERMVELQGESLKVLKALEGVVGH 240

Query: 241 LRKFLVDHSVLPLFEKSFNTPASQDRQTDAWADKSSLLSASQSVISNEYPPSSKRESLFF 300
           LRKFLVDHSVLPLFEKSFNTPASQDRQT+ WADKSSLL+ASQS+IS EYPPS+KRESLF 
Sbjct: 241 LRKFLVDHSVLPLFEKSFNTPASQDRQTETWADKSSLLTASQSIISAEYPPSTKRESLFL 300

Query: 301 DRETHLDSHISSSGISLYGPDRVLPAIRSSGVGRSGVPIVTQVTQTMQIPLSYAEDIIGV 360
           DRETHLDSHISSSGISLYG DRVLP IRSSGVGRSG     QVTQTMQIPLSYAEDIIGV
Sbjct: 301 DRETHLDSHISSSGISLYGQDRVLPTIRSSGVGRSGGS--CQVTQTMQIPLSYAEDIIGV 360

Query: 361 GGANIAFIRRNSGAILTIQESRGLPDEITVEIKGTSSQVQMAQQLIQEAISGPKEPVTSS 420
           GG NIAFIRRNSGAILTIQESRGLPDEITVEIKGTSSQVQMAQQLIQEA++ PKEPVTSS
Sbjct: 361 GGTNIAFIRRNSGAILTIQESRGLPDEITVEIKGTSSQVQMAQQLIQEAVNAPKEPVTSS 420

Query: 421 SYGRLDTTGLRSSYSQLGASGSSYTPSSLSSQSYGGYGSSGLGGYSTFRL 471
           SYGRLDTTGLRSSYSQL ASGSSYT SSLSSQSYGGYGSSGLGGY+TFRL
Sbjct: 421 SYGRLDTTGLRSSYSQLAASGSSYTSSSLSSQSYGGYGSSGLGGYTTFRL 468

BLAST of Cp4.1LG07g07550 vs. NCBI nr
Match: gi|703156789|ref|XP_010111553.1| (Poly(rC)-binding protein 3 [Morus notabilis])

HSP 1 Score: 583.6 bits (1503), Expect = 3.2e-163
Identity = 330/480 (68.75%), Postives = 382/480 (79.58%), Query Frame = 1

Query: 1   MATNATTENGSTDADALNSSPQLA-CEAAAEPDSTEAGGAESDIDPSNQDYLSAPASDYV 60
           MATN  T NG T A   +S P  +   AAA   + +A   E      N++  S PA    
Sbjct: 1   MATNEPTANGVTKAPKSDSQPIPSDATAAASTANEKAPVTEPGHATGNRNMESEPAPPGE 60

Query: 61  PHESP-NHAGAS-----DKKWPGWPGDCVFRIIVPVVKVGSIIGRRGDLIKKICEETRAR 120
             E+P ++AG +     DKKW GWPGDCVFR+IVPV+KVGSIIGR+G+LIKK+CEETRAR
Sbjct: 61  SGEAPASNAGTTATTDADKKWLGWPGDCVFRLIVPVLKVGSIIGRKGELIKKMCEETRAR 120

Query: 121 IRVLDGAVGTPDRVVLISGKEEPEAPLSPAMDAVLRIFKRVSGFSENEDEAKASFCSVRL 180
           IRVLDGAVGTPDR+VLISGKEEPEA LSPAMDAV+R+FKRVSG SENE  A  +FCS+RL
Sbjct: 121 IRVLDGAVGTPDRIVLISGKEEPEAALSPAMDAVIRVFKRVSGLSENE-AAGVAFCSIRL 180

Query: 181 LVASTQAINLIGKQGSLIKSIQESTGASVRVLSGDEMPFYAGPDERMVELQGETLKVLKA 240
           LVASTQAINLIGKQGSLIKSIQESTGASVRVL+GDE+PFYA  DER+VELQGE LKVLKA
Sbjct: 181 LVASTQAINLIGKQGSLIKSIQESTGASVRVLTGDEVPFYAAADERIVELQGEGLKVLKA 240

Query: 241 LEAVVGHLRKFLVDHSVLPLFEKSFNTPASQDRQTDAWADKSSLLSASQSVISNEYPPSS 300
           LEAV+GHLRKFLVDHSV+PLFEKS+N+  SQ+RQ D WADKS L +++Q+ +   YP ++
Sbjct: 241 LEAVIGHLRKFLVDHSVIPLFEKSYNSTISQERQVDTWADKSMLQASTQTGVGTNYPITA 300

Query: 301 KRESLFFDRETHLDSHISSSGISLYGPDRVLPAIRSSGVGRSGVPIVTQVTQTMQIPLSY 360
           KRE  F +RET L+  + SSGIS+YG D  LP IR +G GR+G PIVTQ+ QTMQIPLSY
Sbjct: 301 KREPYFLERETQLEPQLPSSGISMYGQDTSLPGIRPTGFGRAGAPIVTQIAQTMQIPLSY 360

Query: 361 AEDIIGVGGANIAFIRRNSGAILTIQESRGLPDEITVEIKGTSSQVQMAQQLIQEAISGP 420
           AEDIIG+ G+NIA+IRR SGAILT+QESRGLPDEITVEIKGTSSQVQ+AQQLIQE I+  
Sbjct: 361 AEDIIGIEGSNIAYIRRTSGAILTVQESRGLPDEITVEIKGTSSQVQLAQQLIQEVINNH 420

Query: 421 KEPVTSSSYGRLDTTGLRSSYSQLGASGSSYTPS-SLSSQSY--GGYGSSGLGGYSTFRL 471
           KEPV SS YGR+DT  LRS+YSQL    SSY P+ SL  +SY  GGYGSSGLGGYSTFRL
Sbjct: 421 KEPVPSS-YGRIDTA-LRSNYSQLS---SSYPPTTSLPPRSYSGGGYGSSGLGGYSTFRL 474

BLAST of Cp4.1LG07g07550 vs. NCBI nr
Match: gi|225445949|ref|XP_002264417.1| (PREDICTED: KH domain-containing protein At4g18375 [Vitis vinifera])

HSP 1 Score: 580.5 bits (1495), Expect = 2.7e-162
Identity = 325/478 (67.99%), Postives = 380/478 (79.50%), Query Frame = 1

Query: 1   MATNATTENGSTDADALNSSPQLACEAAAEP-DSTEAGGAESDIDPSNQDYLSAPASDYV 60
           MAT  TTE    +  A +       E +  P  ++ A  AES+  PS     +  +    
Sbjct: 1   MATTGTTEPAPVNGAAQSPGSDPKTELSETPLSASNAATAESEQAPSE----NLESESTA 60

Query: 61  PHESPNHAGASDKKWPGWPGDCVFRIIVPVVKVGSIIGRRGDLIKKICEETRARIRVLDG 120
           P E+   A AS+KKWPGWPGDCVFR+IVPV+KVGSIIGR+G+LIKK+CEETRARIRVLDG
Sbjct: 61  PPETEAPAPASEKKWPGWPGDCVFRLIVPVLKVGSIIGRKGELIKKMCEETRARIRVLDG 120

Query: 121 AVGTPDRVVLISGKEEPEAPLSPAMDAVLRIFKRVSGFSENEDEAKA------SFCSVRL 180
           AVGT DR+VLISG+EEPEAPLSPAMDAV+R+FKRV+G SE+E + KA      +FCS+RL
Sbjct: 121 AVGTSDRIVLISGREEPEAPLSPAMDAVIRVFKRVTGLSESEGDGKAYGAAGVAFCSIRL 180

Query: 181 LVASTQAINLIGKQGSLIKSIQESTGASVRVLSGDEMPFYAGPDERMVELQGETLKVLKA 240
           LVASTQAINLIGKQGSLIKSIQESTGASVRVLSGDE+PFYA  DER+VELQGE LKV KA
Sbjct: 181 LVASTQAINLIGKQGSLIKSIQESTGASVRVLSGDEVPFYAAADERIVELQGEALKVQKA 240

Query: 241 LEAVVGHLRKFLVDHSVLPLFEKSFNTPASQDRQTDAWADKSSLLSASQSVISNEYPPSS 300
           LEAVVGHLRKFLVDHSVLPLFE+++N   SQDRQ+D WADKS L   SQ+ + ++Y   +
Sbjct: 241 LEAVVGHLRKFLVDHSVLPLFERTYNATISQDRQSDTWADKSLLHGTSQTGMGSDYSLPA 300

Query: 301 KRESLFFDRETHLDSHISSSGISLYGPDRVLPAIRSSGVGRSGVPIVTQVTQTMQIPLSY 360
           KRESL+ DRET ++     SG+ +YG +  L  IRSSG+GR+G PIVTQ+ QTMQIPLSY
Sbjct: 301 KRESLYLDRETQMEH----SGLPMYGQEHGLSGIRSSGLGRAGAPIVTQIAQTMQIPLSY 360

Query: 361 AEDIIGVGGANIAFIRRNSGAILTIQESRGLPDEITVEIKGTSSQVQMAQQLIQEAISGP 420
           AEDIIG+GGANIA+IRR SGAILT+QESRGLPDEITVEIKGTSSQVQ AQQLIQE IS  
Sbjct: 361 AEDIIGIGGANIAYIRRTSGAILTVQESRGLPDEITVEIKGTSSQVQTAQQLIQEFISNH 420

Query: 421 KEPVTSSSYGRLDTTGLRSSYSQLGASGSSYTPSSLSSQSY-GGYGSSGLGGYSTFRL 471
           KEPV  SSYG++D +GLRSSYSQLG   +SY+ SSLSSQ Y GGYGSSG+GGYS++RL
Sbjct: 421 KEPV-PSSYGKMD-SGLRSSYSQLG--NTSYSSSSLSSQPYGGGYGSSGVGGYSSYRL 466

BLAST of Cp4.1LG07g07550 vs. NCBI nr
Match: gi|470120836|ref|XP_004296493.1| (PREDICTED: poly(rC)-binding protein 3 [Fragaria vesca subsp. vesca])

HSP 1 Score: 562.0 bits (1447), Expect = 9.9e-157
Identity = 320/493 (64.91%), Postives = 373/493 (75.66%), Query Frame = 1

Query: 1   MATNATTENGSTDADALNSSPQLACEAAA---EPDSTEAGGAESDIDPSNQDYLSAPASD 60
           MAT   +ENGS      +  P+ A    A   +P STE          +NQD  SAP SD
Sbjct: 1   MATTQPSENGSAKVTEPDPQPESAAATPAPDSDPLSTEI--------QNNQDSESAPKSD 60

Query: 61  Y---VPHESPNHAGASDKKWPGWPGDCVFRIIVPVVKVGSIIGRRGDLIKKICEETRARI 120
                   S +   A+DK+WPGWPGDCVFR+IVPV+KVGSIIGR+G+LIKK+CEETRARI
Sbjct: 61  SEAPATTSSSDAVSAADKRWPGWPGDCVFRLIVPVLKVGSIIGRKGELIKKMCEETRARI 120

Query: 121 RVLDGAVGTPDRVVLISGKEEPEAPLSPAMDAVLRIFKRVSGFSENEDEAKAS------F 180
           RVLDGA GT DR+VLISG+EEPEAPLSPAMDAV+R+FKRVSG SEN  +A+ S      F
Sbjct: 121 RVLDGAAGTTDRIVLISGREEPEAPLSPAMDAVIRVFKRVSGLSENAGDAELSGAAGVAF 180

Query: 181 CSVRLLVASTQAINLIGKQGSLIKSIQESTGASVRVLSGDEMPFYAGPDERMVELQGETL 240
           CS+R+LVASTQAINLIGKQGSLIKSIQEST ASVRVLSG+E+PFYA  DER++E+QGETL
Sbjct: 181 CSIRMLVASTQAINLIGKQGSLIKSIQESTAASVRVLSGEEVPFYAAADERIIEMQGETL 240

Query: 241 KVLKALEAVVGHLRKFLVDHSVLPLFEKSFNTPASQDRQTDAWADKSSLLSASQSVISNE 300
           KVL+ALEAVV HLRKFLVDHSVLPLFEK++  P SQ+RQ D WADKS L +A+Q+V    
Sbjct: 241 KVLRALEAVVSHLRKFLVDHSVLPLFEKTYTAPISQERQPDPWADKSLLHTATQTVGGTS 300

Query: 301 YPPSSKRESLFFDRETHLDSHISSSGISLYGPDRVLPAIRSSGVGRSGVPIVTQVTQTMQ 360
           YP ++ RESLF  RET L+S + SSG+S+YG +  L ++RSSG+GR G PIVTQ+TQTMQ
Sbjct: 301 YPLTATRESLFLGRETQLESQLPSSGLSIYGQEAALSSLRSSGLGRPGAPIVTQITQTMQ 360

Query: 361 IPLSYAEDIIGVGGANIAFIRRNSGAILTIQESRGLPDEITVEIKGTSSQVQMAQQLIQE 420
           IPLSYAEDIIGV G +I FIRR+SGA+LT+QESRGLPDEITVEIKGTSSQVQ AQQLIQE
Sbjct: 361 IPLSYAEDIIGVEGRSIEFIRRSSGALLTVQESRGLPDEITVEIKGTSSQVQAAQQLIQE 420

Query: 421 AISGPKEPVTSSSYGRLDTTGLRSSYSQLGASGSSYTPSSLSSQSYG----------GYG 471
            I+   +    SSYGR+D TGLRSSYSQL    +SY  SS+S   Y            YG
Sbjct: 421 VIATSNKDQIPSSYGRMD-TGLRSSYSQL--DSTSYPSSSVSLLPYDVYGNSLPSQQPYG 480

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PEP_ARATH7.6e-12557.47RNA-binding KH domain-containing protein PEPPER OS=Arabidopsis thaliana GN=PEP P... [more]
FLK_ARATH5.2e-7341.53Flowering locus K homology domain OS=Arabidopsis thaliana GN=FLK PE=1 SV=1[more]
HEN4_ARATH3.1e-1727.57KH domain-containing protein HEN4 OS=Arabidopsis thaliana GN=HEN4 PE=1 SV=1[more]
Y4837_ARATH5.3e-1725.75KH domain-containing protein At4g18375 OS=Arabidopsis thaliana GN=At4g18375 PE=2... [more]
PCBP3_HUMAN9.0e-1727.51Poly(rC)-binding protein 3 OS=Homo sapiens GN=PCBP3 PE=2 SV=2[more]
Match NameE-valueIdentityDescription
A0A0A0LMH4_CUCSA2.1e-22287.66Uncharacterized protein OS=Cucumis sativus GN=Csa_2G061550 PE=4 SV=1[more]
W9SIR6_9ROSA2.2e-16368.75Poly(RC)-binding protein 3 OS=Morus notabilis GN=L484_005636 PE=4 SV=1[more]
F6GXK2_VITVI1.9e-16267.99Putative uncharacterized protein OS=Vitis vinifera GN=VIT_11s0052g00030 PE=4 SV=... [more]
A0A067F3I5_CITSI3.4e-15665.06Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g012307mg PE=4 SV=1[more]
V4U0W5_9ROSI3.4e-15665.06Uncharacterized protein OS=Citrus clementina GN=CICLE_v10008200mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G26000.14.3e-12657.47 RNA-binding KH domain-containing protein[more]
AT3G04610.12.9e-7441.53 RNA-binding KH domain-containing protein[more]
AT5G46190.12.3e-2330.09 RNA-binding KH domain-containing protein[more]
AT5G53060.12.3e-2327.70 RNA-binding KH domain-containing protein[more]
AT5G15270.24.4e-2226.85 RNA-binding KH domain-containing protein[more]
Match NameE-valueIdentityDescription
gi|778667604|ref|XP_004149510.2|3.0e-22287.66PREDICTED: RNA-binding KH domain-containing protein PEPPER [Cucumis sativus][more]
gi|659069666|ref|XP_008451191.1|9.1e-21987.02PREDICTED: poly(rC)-binding protein 3 [Cucumis melo][more]
gi|703156789|ref|XP_010111553.1|3.2e-16368.75Poly(rC)-binding protein 3 [Morus notabilis][more]
gi|225445949|ref|XP_002264417.1|2.7e-16267.99PREDICTED: KH domain-containing protein At4g18375 [Vitis vinifera][more]
gi|470120836|ref|XP_004296493.1|9.9e-15764.91PREDICTED: poly(rC)-binding protein 3 [Fragaria vesca subsp. vesca][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0003723RNA binding
GO:0003676nucleic acid binding
Vocabulary: INTERPRO
TermDefinition
IPR004088KH_dom_type_1
IPR004087KH_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0010467 gene expression
biological_process GO:0048467 gynoecium development
biological_process GO:0048367 shoot system development
biological_process GO:0006396 RNA processing
biological_process GO:0048577 negative regulation of short-day photoperiodism, flowering
biological_process GO:0048579 negative regulation of long-day photoperiodism, flowering
biological_process GO:0009299 mRNA transcription
biological_process GO:0008150 biological_process
biological_process GO:0016070 RNA metabolic process
biological_process GO:2000028 regulation of photoperiodism, flowering
biological_process GO:0048585 negative regulation of response to stimulus
biological_process GO:2000242 negative regulation of reproductive process
biological_process GO:0048581 negative regulation of post-embryonic development
cellular_component GO:0005575 cellular_component
cellular_component GO:0005634 nucleus
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0003723 RNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG07g07550.1Cp4.1LG07g07550.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR004087K Homology domainSMARTSM00322kh_6coord: 341..411
score: 1.9E-11coord: 167..242
score: 6.6E-10coord: 79..152
score: 3.4
IPR004088K Homology domain, type 1GENE3DG3DSA:3.30.1370.10coord: 341..412
score: 1.0E-14coord: 70..135
score: 1.9E-16coord: 168..243
score: 8.7
IPR004088K Homology domain, type 1PFAMPF00013KH_1coord: 171..237
score: 7.5E-10coord: 344..408
score: 3.3E-12coord: 83..135
score: 2.6
IPR004088K Homology domain, type 1PROFILEPS50084KH_TYPE_1coord: 168..237
score: 13.691coord: 342..406
score: 14.424coord: 80..147
score: 14
IPR004088K Homology domain, type 1unknownSSF54791Eukaryotic type KH-domain (KH-domain type I)coord: 165..248
score: 2.19E-12coord: 74..155
score: 6.43E-14coord: 337..412
score: 7.74
NoneNo IPR availablePANTHERPTHR10288KH DOMAIN CONTAINING RNA BINDING PROTEINcoord: 65..467
score: 8.4E-217coord: 29..48
score: 8.4E
NoneNo IPR availablePANTHERPTHR10288:SF132ZINC FINGER CCCH DOMAIN-CONTAINING PROTEIN 40-RELATEDcoord: 65..467
score: 8.4E-217coord: 29..48
score: 8.4E

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cp4.1LG07g07550Cp4.1LG11g06100Cucurbita pepo (Zucchini)cpecpeB150