Cp4.1LG07g09850 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG07g09850
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionPoly(RC)-binding protein, putative
LocationCp4.1LG07 : 8745676 .. 8751552 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TATACCATTGTGGCCGCACGCCTCCTCTCTCTCTCTCTCTATCTCTCTCTCTCTCTTCCTCGCCAAATCTCTAAAACCCTAGAAGGCCACATTCGAGGTTCCAGATTTTCCAGTTCGAATTACTCCGTTCTCTCATAATTTTTGTACCTCGCTTCGATTCTTCAGTAGGTATGATAAAATAGGTTTATTTGCTTCGTGAATTTGATGTCTAGGGTTTGTTTGTTTGTTTGTTTGTTCGCGTGCTTTGTTATCTGTTCTGTTCGTCTATGAAGTTTTTATTTTTCCGTTTATATTTGTGATGGAAGTTGCGGAATGTTTTTAGTTTGTTTGGATTGGTTTCGTTTCAGATGGGTAGGATTCATTTTGTTGGGAATAAGCCAGGAGATTTCGTGTTTCTTGGATTTTTATCGTTGACTTGGAATTTTCTAATTTTTGTATGTTCTACTGGTGAGATTTCCCTTGTCTTGTCAATTGTCAAGTTCACAGTAGATTTTTTGTTTTCAATTCTGGATTATGATACTGTCGTTCTGTCCAGAAAGGAAAAGAATATATATATAAGCTGGAAATTTCCATATTCGAACACTTGAGGAGTGCATGTCTTATTGCTAATTTCACTTGTTAATATGAGTTGATGGCATGAGGATTTTCAGGGTAGTTTTATATTTGTTTACTGAAATTTTATAGGTTATGCACTTTGGTGCGAATGAGGGAAAGTTTTCTATTTTAATTGTTTTACTTATATATGGTACTTTGTTGCAGCTTAATCTGTAAGGAACACATGGCTGATGTGGAGCAGAGCTACACGCATGTCGACGATAATGAAGTTGATGAGACTGCACAAATATATGAGAACGCACAATTATATGAGAATGCTCAGGTACAGGACAATACACTAGAACAGGAGAGTACAGAGGAGCCAGAAAACTTGCATGGACTTGAGAATGAACATGTACCAGAAACCTTAGAGTTGGAGCCAAAAGAGGTGCATAATGAGGATACTCTGGCTGTAGTAGGTGAGAAGAAGTGGCCTGGATGGCCAGGAGAGAGTGTTTTTCGAATGTTGGTTCCTGCTCAAAAGGTTGGTAGTATAATTGGTCGCAAAGGTGAGTTCATCAAGAAAATAGTCGAGGAAACGAGAGCTCGGATTAAAATTCTTGACGGTCCTCCTGGAACAGCTGAAAGAGCTGTAAGTTTTCCTTCCTCCAGATAAGAACTTTTCTTTAATAATTTAGTAGTATTTTTGTGAAAATTATTTGGATTTAAGGAGGTGCAAATGATTTAATATCAAATTAATAGTATCTATCCATTTTTTTTTTTATATAATTTTTTTAATTCTAATGGATATTTTGATGATTCATTAACAATTATTGTGTTAGGTTTTCTTAATTTTGTATTCTCAATATGTTTAGCACCTTGCTTCAAGGTAGGCAAATATTGGGCATTATATTGCCTAATTAGCCCAATCTGGTCGAGCATCAAGGTTGGGTGATTATTTTTAGTTAGTTGTTTAATCTTGATTGTACTCCTGTTATATTCAAGCATGTGACATCAGTTATATGGCTGTGTAATGATGGAAATATGACACTAAAATGAAAATTCAGACCTTAAAGAGTTAATAAAATATTATCAATAATAACTTGATTAGAACTTGTACTAAAAGCCATTAAATCGTTGATTAAAAAAATTAAAGTTAATATTCGAAAATTCTTTCGAGTCATGGAACAGACATTTGATTATAAGAAACCATTTGACACACTTAAAATGGATCTGCTAAGACTTATAGGGAAGATTGAACTCCATCTTTCAAGATATCTGTGCTTCTACACCCCATCAAACTGATTCTCTTACAGCTCTATAACTGCCATAAGGAAGTGTCCAGAAACTAAGATTAACCCTTTAAACTCATAAAAGCAAAATCATAACTAAAACAGAAAGGAATGTAAGAGTATTCCAGGTAATCCCGAATATGGATACAGTTGCCTTGATTGTAGTAACAAAAAATGAATCTGTGCAGGTTGTGTGCAAAATTTAAGGCAACTACTCAATCTTCAATAGGATCCCATATTTGAAGAACCAAAGATCCAGATGCGGAGATCAGCATTCAACATTACAAAGTTCTGAAAAGTTGATCTGAATAAAGGCATCTGGAACTTTTGTACCATGAAGCACTGCAATATCTCTGGTAACTTCAGATCTGAAGGTTGATGTCAAGATTTCCAACGGGATAATAATATGAAAAATGGAGAAAACATAATGATAAATGTTAAGTCTTATAAAAAAAATAATAATGATAAATGTTAAGAGTTAAAATTTTTAGTAATTTAAAAGATGTTTATACGCAATTAAAGATCTGCTTAAAATATAAATTGAGGCACTAAAACTGGCCAATCAATATCGAATAGAAGTGATGTATTTTTGCTTATGCTTCTATTGTGTCTTTGCCAATAACTGGGTCTATAATCCAACAACATATAAGGTAATGTGAAAACCTGAACTCATTAATTTAAGGAGGAAAACATAACTATATTATCTAGTTTCTTGAAACATAGAAAGCTGGTAAATTTCAGCATTTGCCCTTAGATATATTTGCGGCCTCTAAGTTGTCTCGGGCATGATATATAGGAAGAAATCATCCACCATTTTGGCCGTTGCCTTTTCAGATATATAGTTCGGGAGAAACTATTTTTGAATTGTTTGGTCTGTCTTTTAGTTTTGGCAACTTTATTATTCTAATTGCAAGGTGCTCAAAATTTGATGAAGACGTTAAGCACGTTTGATGACTAAGGGTTTGGACACCTAGTGGGAGATCTAAATGGAAATTAATATGAATAGTTTTTTTTTTTTTTATGGTTCTCACTTGCAGATAAATTTAATTTTTCGTTGTAGACCTTACAAATTGGGAGGTGCTCTTTTTATTATTTCCATTTTTTTCTACTATTGGATGCACAACACGATCCATCAATGCATATATCATTTGATACAACTGGAAGTTTGTGCAAGTAAATTGATCTGCTAGTGTATTTAATAGTGAATTTGTATCATTATAAGTTTTGGATGGTGATCCTTGGTTATTTTCTTATAGATATTATGAACTAATCCTATCTTTTCATTGGTTGGGCTTTTTTTGTTCGTGTTTTCCCTGTCGTGAGCTCATTAGCTGTCTTTGGAATTCGACAAATTGTTTTTGCTTTTTATTTGTTATATGAGTGATTCTGTCTTTGTAGAATGGGGAATTTCAGGTTCCAGTACTTCATAATACTCATTTGATTTTTTTTGTTGTTGTGATAATTGGTCAATTTATGATGAAAATTTGTAGTTCTCACAAGATGGTATGCTTCATTTAATAGGTAATGGTGTCTGCGAAGGAGGAACCCGATTCTGATTTTCCTCCTGCGGTGGATGGACTTCTGAGGGTTCATAAACGTATTGTTGATGGTCTGGAGGGTGATAATGCTAATGCTCCCAACATGGGAGGCAAAGTCTCAACGAGACTGCTAGTCGCAGCTTCACAAGCAGGAAGTTTAATCGGGAAGCAGGGAGGGACCGTTAAATCCATTCAAGAAGAGTCAAATTGTATAGTTAGAGTTCTTGGATCAGGTGGACTTCTCTGGATGTCATTTGATTCTCTGCTTTATCGTTCTTATACTATGTTCTTCATTGGTTTCTGAGTTGATAATTATATACTCCATTTTGGTTTTCCATTTTCATTGCATTCTTATACTTGTCACCCGGGAACTTTCTGCAGAAGATTTGCCTGTTTTTGCTCTCCAAGACGATAGAGTTGTTGAAGTACTTGGCGATCCAGCTGGTGTGCACAAAGCCGTGGAACTCATTGCTTCTCATCTGAGAAAGTTTCTGGTTGACCGAAGCATTATCCCGGTTTTTGAAATGAATGTGAGTATGATTTTCTAATGATTTTTCGAACACTTCTACTTTTAATAGTATTGAAGTCTTAAACATTCCTATTCTTCCTTCCAACCCAAATGCCCAAAAATCAGATGCAGATGACAAATCAACAGATGGAGCACATGCCACCACCACACCAATCTTGGGGCCCACCTCAAGGAATGCCACCAAATGCTGGGGGTGGTCCTGGCTTCGGACCGAACCCTCCATACATGCCACCACCTCGTCAATTCGACAATTATTATCCACCTGCTGATATGCCATCGGCTATGGATAAGCAACCTCATCATGGAATTTCTGCTTATGGTAGGGAAGCTCCTATGGGTGTTCATCCGCAAACAAATGCACAGTCAGTGCCTTCAATTGTCACACAGGTTTGATAAGCCTTCGAACTCCTATATTTGATATGTTTTTGCAAACTCTTCAAAGTTTTTCAAGTATTTCCCTTTTTTTTTATTGCTAAGACAAGAAACAAGTTAACTGCATGCCATCATCTGAGTCATCATGTTACAAGTGTTCGCCCATTAAACACTTGCCATTTGGCACGATACCATGGTTCCTTAGATCTCCATTTTTCCTCCATAGTTGGGAGAACCCTTGATAGACTTGGTGAATATTTTATGATATTCAGAATGTTTTTTGTTGTACTTGCAGACAACCCAACAAATGCAGATTCCATTATCTTTTGCTGACGCTGTCATTGGAACTGCTGGCGCTAGTATAAGCTACATTCGTCGTGCTAGCGGGGCTACCGTTACTATCCAAGAAACGAGGGGAGTCCCTGGAGAAATGACGGTAGAAATGAGTGGAACTGCATCTCAAGTTCAAGCAGCCCAGCAGCTTATACAGGCAATTCTTTTTTCTTGCTTGTTTCGATTTTGCATCTTCTTGATGAAAACACTAAATATAATGTCATCTTCTCGATGATTTTGCATCGTTCCCTTTATTTTCGGAGATAAATGGCATCGTCATAGTTTCTTAAACATACTCATTAGACCTGATCAATATTAATTTGAAGGCCACGTTTTCTGCTCGTCGATGCAATCTATTCCCTTTAACATTCTGTTTTTAATTGTGTAACAGAATTTTATGGCCGGTGCTGGAGCTCCAGCGCAGACTCAAGCTGGGGGTTCGGCAGACCAAGGGTACAACTCTTACGCTGCTCACGGTTCAGTCTACGCGTCACCCCCTGCAGCAAACCCAGCTCATGCAAGTCATGCTGGAGGCTATGGCTCTGTCTACAGTACGAATTACGGGTACTAAGCCAGCCTTTGTACGATGTAAAATTCTCCCCAACTTGGCGTGGTTGCGGCGATCACTCAAAACCTCCATCATTAGTAAGCTTTGCATCATGTAGCCTCCTTTTAAACAGATTCTACCTGAAAACCTTTTGAACCATTCTGCATTAGTAGCCTATTTGTATGATTTAGCACGAAATCTCTCTCTGAAATTTAGCACCAAGTTCTTTTATTTTACTTATAACATGTGATGATGAACGGACATTTTAAGAGGAGCTGCTACACTGTCGTACCCGAAAAATCGAAGAAGAATGAGATTAGAGTATCTATTAGATCCTTTTTAAAATGCAGGTGTTGAATGGCTGCTGCGGTTGGAGCTAACTAATGAGATTAGAGTATCTACCCCAAGGCAAGCCTCTCTCCTTTGCTTTTGATTCTTAGTGTTCTTATTTTTTTTGTTTTTGTTTCTGTTTCAAAGACTACCAAATAACATTTTTTTTTGTTCGAAATGATGATAAAATAAAGTCGTGATTATGATTTGTAAAATGGTGGGCGATTTTCGATTGACTTAGTAGTCGTAAGAGCTTTGGAGAAATAAGTTGGAATGGGTTCAATTCATGGAAGAAGTCGATACTCGAATGTTATTGGGTAAGGTATACTATAAGAACGAGTTTCATTTCTACTTTACGATAAGAAGTCGATACTTTACGAGTTTCATTCTTCTTGTATTCTT

mRNA sequence

TATACCATTGTGGCCGCACGCCTCCTCTCTCTCTCTCTCTATCTCTCTCTCTCTCTTCCTCGCCAAATCTCTAAAACCCTAGAAGGCCACATTCGAGGTTCCAGATTTTCCAGTTCGAATTACTCCGTTCTCTCATAATTTTTGTACCTCGCTTCGATTCTTCAGTAGCTTAATCTGTAAGGAACACATGGCTGATGTGGAGCAGAGCTACACGCATGTCGACGATAATGAAGTTGATGAGACTGCACAAATATATGAGAACGCACAATTATATGAGAATGCTCAGGTACAGGACAATACACTAGAACAGGAGAGTACAGAGGAGCCAGAAAACTTGCATGGACTTGAGAATGAACATGTACCAGAAACCTTAGAGTTGGAGCCAAAAGAGGTGCATAATGAGGATACTCTGGCTGTAGTAGGTGAGAAGAAGTGGCCTGGATGGCCAGGAGAGAGTGTTTTTCGAATGTTGGTTCCTGCTCAAAAGGTTGGTAGTATAATTGGTCGCAAAGGTGAGTTCATCAAGAAAATAGTCGAGGAAACGAGAGCTCGGATTAAAATTCTTGACGGTCCTCCTGGAACAGCTGAAAGAGCTGTAATGGTGTCTGCGAAGGAGGAACCCGATTCTGATTTTCCTCCTGCGGTGGATGGACTTCTGAGGGTTCATAAACGTATTGTTGATGGTCTGGAGGGTGATAATGCTAATGCTCCCAACATGGGAGGCAAAGTCTCAACGAGACTGCTAGTCGCAGCTTCACAAGCAGGAAGTTTAATCGGGAAGCAGGGAGGGACCGTTAAATCCATTCAAGAAGAGTCAAATTGTATAGTTAGAGTTCTTGGATCAGAAGATTTGCCTGTTTTTGCTCTCCAAGACGATAGAGTTGTTGAAGTACTTGGCGATCCAGCTGGTGTGCACAAAGCCGTGGAACTCATTGCTTCTCATCTGAGAAAGTTTCTGGTTGACCGAAGCATTATCCCGATGACAAATCAACAGATGGAGCACATGCCACCACCACACCAATCTTGGGGCCCACCTCAAGGAATGCCACCAAATGCTGGGGGTGGTCCTGGCTTCGGACCGAACCCTCCATACATGCCACCACCTCGTCAATTCGACAATTATTATCCACCTGCTGATATGCCATCGGCTATGGATAAGCAACCTCATCATGGAATTTCTGCTTATGGTAGGGAAGCTCCTATGGGTGTTCATCCGCAAACAAATGCACAGTCAGTGCCTTCAATTGTCACACAGACAACCCAACAAATGCAGATTCCATTATCTTTTGCTGACGCTGTCATTGGAACTGCTGGCGCTAGTATAAGCTACATTCGTCGTGCTAGCGGGGCTACCGTTACTATCCAAGAAACGAGGGGAGTCCCTGGAGAAATGACGGTAGAAATGAGTGGAACTGCATCTCAAAATTTTATGGCCGGTGCTGGAGCTCCAGCGCAGACTCAAGCTGGGGGTTCGGCAGACCAAGGGTACAACTCTTACGCTGCTCACGGTTCAGTCTACGCGTCACCCCCTGCAGCAAACCCAGCTCATGCAAGTCATGCTGGAGGCTATGGCTCTGTCTACAGTACGAATTACGGGTACTAAGCCAGCCTTTGTACGATGTAAAATTCTCCCCAACTTGGCGTGGTTGCGGCGATCACTCAAAACCTCCATCATTAGTAAGCTTTGCATCATGTAGCCTCCTTTTAAACAGATTCTACCTGAAAACCTTTTGAACCATTCTGCATTAGTAGCCTATTTGTATGATTTAGCACGAAATCTCTCTCTGAAATTTAGCACCAAGTTCTTTTATTTTACTTATAACATGTGATGATGAACGGACATTTTAAGAGGAGCTGCTACACTGTCGTACCCGAAAAATCGAAGAAGAATGAGATTAGAGTATCTATTAGATCCTTTTTAAAATGCAGGTGTTGAATGGCTGCTGCGGTTGGAGCTAACTAATGAGATTAGAGTATCTACCCCAAGGCAAGCCTCTCTCCTTTGCTTTTGATTCTTAGTGTTCTTATTTTTTTTGTTTTTGTTTCTGTTTCAAAGACTACCAAATAACATTTTTTTTTGTTCGAAATGATGATAAAATAAAGTCGTGATTATGATTTGTAAAATGGTGGGCGATTTTCGATTGACTTAGTAGTCGTAAGAGCTTTGGAGAAATAAGTTGGAATGGGTTCAATTCATGGAAGAAGTCGATACTCGAATGTTATTGGGTAAGGTATACTATAAGAACGAGTTTCATTTCTACTTTACGATAAGAAGTCGATACTTTACGAGTTTCATTCTTCTTGTATTCTT

Coding sequence (CDS)

ATGGCTGATGTGGAGCAGAGCTACACGCATGTCGACGATAATGAAGTTGATGAGACTGCACAAATATATGAGAACGCACAATTATATGAGAATGCTCAGGTACAGGACAATACACTAGAACAGGAGAGTACAGAGGAGCCAGAAAACTTGCATGGACTTGAGAATGAACATGTACCAGAAACCTTAGAGTTGGAGCCAAAAGAGGTGCATAATGAGGATACTCTGGCTGTAGTAGGTGAGAAGAAGTGGCCTGGATGGCCAGGAGAGAGTGTTTTTCGAATGTTGGTTCCTGCTCAAAAGGTTGGTAGTATAATTGGTCGCAAAGGTGAGTTCATCAAGAAAATAGTCGAGGAAACGAGAGCTCGGATTAAAATTCTTGACGGTCCTCCTGGAACAGCTGAAAGAGCTGTAATGGTGTCTGCGAAGGAGGAACCCGATTCTGATTTTCCTCCTGCGGTGGATGGACTTCTGAGGGTTCATAAACGTATTGTTGATGGTCTGGAGGGTGATAATGCTAATGCTCCCAACATGGGAGGCAAAGTCTCAACGAGACTGCTAGTCGCAGCTTCACAAGCAGGAAGTTTAATCGGGAAGCAGGGAGGGACCGTTAAATCCATTCAAGAAGAGTCAAATTGTATAGTTAGAGTTCTTGGATCAGAAGATTTGCCTGTTTTTGCTCTCCAAGACGATAGAGTTGTTGAAGTACTTGGCGATCCAGCTGGTGTGCACAAAGCCGTGGAACTCATTGCTTCTCATCTGAGAAAGTTTCTGGTTGACCGAAGCATTATCCCGATGACAAATCAACAGATGGAGCACATGCCACCACCACACCAATCTTGGGGCCCACCTCAAGGAATGCCACCAAATGCTGGGGGTGGTCCTGGCTTCGGACCGAACCCTCCATACATGCCACCACCTCGTCAATTCGACAATTATTATCCACCTGCTGATATGCCATCGGCTATGGATAAGCAACCTCATCATGGAATTTCTGCTTATGGTAGGGAAGCTCCTATGGGTGTTCATCCGCAAACAAATGCACAGTCAGTGCCTTCAATTGTCACACAGACAACCCAACAAATGCAGATTCCATTATCTTTTGCTGACGCTGTCATTGGAACTGCTGGCGCTAGTATAAGCTACATTCGTCGTGCTAGCGGGGCTACCGTTACTATCCAAGAAACGAGGGGAGTCCCTGGAGAAATGACGGTAGAAATGAGTGGAACTGCATCTCAAAATTTTATGGCCGGTGCTGGAGCTCCAGCGCAGACTCAAGCTGGGGGTTCGGCAGACCAAGGGTACAACTCTTACGCTGCTCACGGTTCAGTCTACGCGTCACCCCCTGCAGCAAACCCAGCTCATGCAAGTCATGCTGGAGGCTATGGCTCTGTCTACAGTACGAATTACGGGTACTAA

Protein sequence

MADVEQSYTHVDDNEVDETAQIYENAQLYENAQVQDNTLEQESTEEPENLHGLENEHVPETLELEPKEVHNEDTLAVVGEKKWPGWPGESVFRMLVPAQKVGSIIGRKGEFIKKIVEETRARIKILDGPPGTAERAVMVSAKEEPDSDFPPAVDGLLRVHKRIVDGLEGDNANAPNMGGKVSTRLLVAASQAGSLIGKQGGTVKSIQEESNCIVRVLGSEDLPVFALQDDRVVEVLGDPAGVHKAVELIASHLRKFLVDRSIIPMTNQQMEHMPPPHQSWGPPQGMPPNAGGGPGFGPNPPYMPPPRQFDNYYPPADMPSAMDKQPHHGISAYGREAPMGVHPQTNAQSVPSIVTQTTQQMQIPLSFADAVIGTAGASISYIRRASGATVTIQETRGVPGEMTVEMSGTASQNFMAGAGAPAQTQAGGSADQGYNSYAAHGSVYASPPAANPAHASHAGGYGSVYSTNYGY
BLAST of Cp4.1LG07g09850 vs. Swiss-Prot
Match: FLK_ARATH (Flowering locus K homology domain OS=Arabidopsis thaliana GN=FLK PE=1 SV=1)

HSP 1 Score: 521.2 bits (1341), Expect = 1.2e-146
Identity = 302/474 (63.71%), Postives = 343/474 (72.36%), Query Frame = 1

Query: 18  ETAQIYENAQLYENAQVQDNTLEQESTEEPENLHGLENEHVPETLELEPKEVHNEDTLAV 77
           E  Q+ + A      Q Q +  +    E  + +   + E +PE LE   K    ED    
Sbjct: 117 EQFQLQDEAHDQAQYQAQGDVQDHNGDEVQDKVE--DEEGIPEHLESLQKSEPEEDATVG 176

Query: 78  VGEKKWPGWPGESVFRMLVPAQKVGSIIGRKGEFIKKIVEETRARIKILDGPPGTAERAV 137
             EK+WPGWPGE+VFRMLVPAQKVGSIIGRKG+ IKKIVEETRARIKILDGPPGT ERAV
Sbjct: 177 GEEKRWPGWPGETVFRMLVPAQKVGSIIGRKGDVIKKIVEETRARIKILDGPPGTTERAV 236

Query: 138 MVSAKEEPDSDFPPAVDGLLRVHKRIVDGLEGDNANAPNMGGKVSTRLLVAASQAGSLIG 197
           MVS KEEP+S  PP++DGLLRVH RIVDGL+G+ + AP    KVSTRLLV ASQAGSLIG
Sbjct: 237 MVSGKEEPESSLPPSMDGLLRVHMRIVDGLDGEASQAPPPS-KVSTRLLVPASQAGSLIG 296

Query: 198 KQGGTVKSIQEESNCIVRVLGSEDLPVFALQDDRVVEVLGDPAGVHKAVELIASHLRKFL 257
           KQGGTVK+IQE S CIVRVLGSEDLPVFALQDDRVVEV+G+P  VH+A+ELIASHLRKFL
Sbjct: 297 KQGGTVKAIQEASACIVRVLGSEDLPVFALQDDRVVEVVGEPTSVHRALELIASHLRKFL 356

Query: 258 VDRSIIPM-------TNQQMEHMPPPHQSWGPPQGMPPNAGGGPGFGPN-PPYMPPPRQF 317
           VDRSIIP          +QM+HMPPPHQSWGPPQG  P+ GGG G+G N PPYM PP + 
Sbjct: 357 VDRSIIPFFENQMQKPTRQMDHMPPPHQSWGPPQGHAPSVGGG-GYGHNPPPYMQPPPRH 416

Query: 318 DNYYPPADM-PSAMDKQPHHGISAYGREAPMGVHPQTNAQSVPSIVTQTTQQMQIPLSFA 377
           D+YYPP +M    M+KQPH GISAYGRE PM VH    + + P +  Q TQQMQIPLS+A
Sbjct: 417 DSYYPPPEMRQPPMEKQPHQGISAYGREPPMNVHV---SSAPPMVAQQVTQQMQIPLSYA 476

Query: 378 DAVIGTAGASISYIRRASGATVTIQETRGVPGEMTVEMSGTAS---------QNFM--AG 437
           DAVIGT+G++ISY RR SGATVTIQETRGVPGEMTVE+SGT S         QNFM  AG
Sbjct: 477 DAVIGTSGSNISYTRRLSGATVTIQETRGVPGEMTVEVSGTGSQVQTAVQLIQNFMAEAG 536

Query: 438 AGAPAQTQAGGSADQGYNSYAAHGSVYASPPAANPAHASHAGGYGSVYSTNYGY 472
           A APAQ Q      QGYN YA HGSVYA+ P   P      GGY + YS+ YGY
Sbjct: 537 APAPAQPQTVAPEQQGYNPYATHGSVYAAAPTNPP------GGYATDYSSGYGY 577

BLAST of Cp4.1LG07g09850 vs. Swiss-Prot
Match: PEP_ARATH (RNA-binding KH domain-containing protein PEPPER OS=Arabidopsis thaliana GN=PEP PE=1 SV=1)

HSP 1 Score: 239.2 bits (609), Expect = 9.2e-62
Identity = 146/346 (42.20%), Postives = 212/346 (61.27%), Query Frame = 1

Query: 80  EKKWPGWPGESVFRMLVPAQKVGSIIGRKGEFIKKIVEETRARIKILDGPPGTAERAVMV 139
           E++WPGWPG+ VFRM+VP  KVG+IIGRKG+FIKK+ EETRARIK+LDGP  T +R V++
Sbjct: 64  EERWPGWPGDCVFRMIVPVTKVGAIIGRKGDFIKKMCEETRARIKVLDGPVNTPDRIVLI 123

Query: 140 SAKEEPDSDFPPAVDGLLRVHKRIVDGLEGDNANAPNMGGKVST-RLLVAASQAGSLIGK 199
           S KEEP++   PA+D +LRV +R+    + D+ +  N G   S+ RLLVA++QA +LIGK
Sbjct: 124 SGKEEPEAYMSPAMDAVLRVFRRVSGLPDNDDDDVQNAGSVFSSVRLLVASTQAINLIGK 183

Query: 200 QGGTVKSIQEESNCIVRVLGSEDLPVFALQDDRVVEVLGDPAGVHKAVELIASHLRKFLV 259
           QG  +KSI E S   VR+L  E+ P +A QD+R+V++ G+   + KA+E I  HLR+FLV
Sbjct: 184 QGSLIKSIVENSGASVRILSEEETPFYAAQDERIVDLQGEALKILKALEAIVGHLRRFLV 243

Query: 260 DRSIIPMTNQ-------QMEHMPPPHQSWGPPQGMPPNAGGGPGFG----PNPPYMPPPR 319
           D +++P+  +       Q     P  +S      +  N    P F       P ++    
Sbjct: 244 DHTVVPLFEKQYLARVSQTRQEEPLAESKSSLHTISSNL-MEPDFSLLARREPLFLERDS 303

Query: 320 QFDNYYPPADMPSAMDKQPHHGISAYGREAPMGV-HPQTNAQSVPSIVTQTTQQMQIPLS 379
           + D+   P+            G+S Y ++  +   H    A+   + VTQ +Q MQIP S
Sbjct: 304 RVDSRVQPS------------GVSIYSQDPVLSARHSPGLARVSSAFVTQVSQTMQIPFS 363

Query: 380 FADAVIGTAGASISYIRRASGATVTIQETRGVPGEMTVEMSGTASQ 413
           +A+ +IG  GA+I+YIRR SGAT+TI+E+   P ++TVE+ GT SQ
Sbjct: 364 YAEDIIGVEGANIAYIRRRSGATITIKESPH-PDQITVEIKGTTSQ 395

BLAST of Cp4.1LG07g09850 vs. Swiss-Prot
Match: Y4837_ARATH (KH domain-containing protein At4g18375 OS=Arabidopsis thaliana GN=At4g18375 PE=2 SV=1)

HSP 1 Score: 79.0 bits (193), Expect = 1.6e-13
Identity = 94/344 (27.33%), Postives = 157/344 (45.64%), Query Frame = 1

Query: 91  VFRMLVPAQKVGSIIGRKGEFIKKIVEETRARIKILDGPPGTAERAVMV--SAKEEPD-- 150
           V+R+L P   VG +IG+ G+ I  I   T+A+IK+ D   G ++R + +  S KE+ +  
Sbjct: 37  VYRILCPIDVVGGVIGKSGKVINAIRHNTKAKIKVFDQLHGCSQRVITIYCSVKEKQEEI 96

Query: 151 ----SDFPP---AVDGLLRVHKRIVDGLEGDNANA-PNMGGKVSTRLLVAASQAGSLIGK 210
               S+  P   A D LL+V+  IV   E +N     +       RLLV  SQ+ SLIGK
Sbjct: 97  GFTKSENEPLCCAQDALLKVYDAIVASDEENNTKTNVDRDDNKECRLLVPFSQSSSLIGK 156

Query: 211 QGGTVKSIQEESNCIVRVLG---SEDLPVFALQDDRVVEVLGDPAGVHKAVELIASHLRK 270
            G  +K I+  +   V+V+    S+   V A++ D VV + G+P  V +A+  +++ + K
Sbjct: 157 AGENIKRIRRRTRASVKVVSKDVSDPSHVCAMEYDNVVVISGEPESVKQALFAVSAIMYK 216

Query: 271 FLVDRSIIPMTNQQMEHMPPPHQSWGPPQGMPPNAGGGPGFGPNPPYMPPPRQFDNYYPP 330
            +  R  IP+ +   +    P  S   P  +  +     GF  N  ++            
Sbjct: 217 -INPRENIPLDSTSQD---VPAASVIVPSDLSNSVYPQTGFYSNQDHI--------LQQG 276

Query: 331 ADMPSAMDKQPHHGISAYGREA--PMGVH----PQTNAQSVPSIVTQTTQQMQIPLSFAD 390
           A +PS  +         Y   A  P+ V     P T+     S   +   ++  PL    
Sbjct: 277 AGVPSYFNALSVSDFQGYAETAANPVPVFASSLPVTHGFGGSSRSEELVFKVLCPLCNIM 336

Query: 391 AVIGTAGASISYIRRASGATVTIQETRGVPGEMTVEMSGTASQN 414
            VIG  G++I  IR ASG+ + + ++R   G+    +  TA+++
Sbjct: 337 RVIGKGGSTIKRIREASGSCIEVNDSRTKCGDDECVIIVTATES 368

BLAST of Cp4.1LG07g09850 vs. Swiss-Prot
Match: HEN4_ARATH (KH domain-containing protein HEN4 OS=Arabidopsis thaliana GN=HEN4 PE=1 SV=1)

HSP 1 Score: 78.2 bits (191), Expect = 2.7e-13
Identity = 54/179 (30.17%), Postives = 92/179 (51.40%), Query Frame = 1

Query: 88  GESVFRMLVPAQKVGSIIGRKGEFIKKIVEETRARIKILDGPPGTAERAVMVSAKEEPDS 147
           G + FR+L P   VG++IG+ G  IK++ + T A+I++ + P G+ +R + + A+ +  S
Sbjct: 45  GHAAFRLLCPLSHVGAVIGKSGNVIKQLQQSTGAKIRVEEPPSGSPDRVITIIAQADSKS 104

Query: 148 DFPPA------VDGLLRVHKRIVDGLEGDNANAPNMGGK------VSTRLLVAASQAGSL 207
                       +G  +  +  V   +G       +         V  RLL  +S AG++
Sbjct: 105 RVKLGANNNGNAEGEKKEEEVEVSKAQGALIKVFELLAAEADSDTVVCRLLTESSHAGAV 164

Query: 208 IGKQGGTVKSIQEESNCIVRVLGSEDLPVFALQDDRVVEVLGDPAGVHKAVELIASHLR 255
           IGK G  V SI++E+ C + +   E+LP+ A  DD +VEV G+   V KA+  I+  L+
Sbjct: 165 IGKGGQMVGSIRKETGCKISI-RIENLPICADTDDEMVEVEGNAIAVKKALVSISRCLQ 222

BLAST of Cp4.1LG07g09850 vs. Swiss-Prot
Match: PCBP1_MOUSE (Poly(rC)-binding protein 1 OS=Mus musculus GN=Pcbp1 PE=1 SV=1)

HSP 1 Score: 76.3 bits (186), Expect = 1.0e-12
Identity = 94/341 (27.57%), Postives = 154/341 (45.16%), Query Frame = 1

Query: 93  RMLVPAQKVGSIIGRKGEFIKKIVEETRARIKILDGPPGTAERAVMVSAKEEPDSDFPPA 152
           R+L+  ++VGSIIG+KGE +K+I EE+ ARI I +G     ER + ++            
Sbjct: 17  RLLMHGKEVGSIIGKKGESVKRIREESGARINISEG--NCPERIITLTG----------P 76

Query: 153 VDGLLRVHKRIVDGLEGD-NANAPNMGG----KVSTRLLVAASQAGSLIGKQGGTVKSIQ 212
            + + +    I+D LE D N++  N        V+ RL+V A+Q GSLIGK G  +K I+
Sbjct: 77  TNAIFKAFAMIIDKLEEDINSSMTNSTAASRPPVTLRLVVPATQCGSLIGKGGCKIKEIR 136

Query: 213 EESNCIVRVLGSEDLPVFALQDDRVVEVLGDPAGVHKAVELIASHLRKFLVDR---SIIP 272
           E +   V+V G     +     +R + + G P  V + V+ I   + + L       ++ 
Sbjct: 137 ESTGAQVQVAGD----MLPNSTERAITIAGVPQSVTECVKQICLVMLETLSQSPQGRVMT 196

Query: 273 MTNQQMEHMPPPHQSWGPPQGMPPNAGGGPGFG---PNPPYMPPPRQFDNYYPPADMP-- 332
           +  Q M    P   + G  Q    +A G P        PP      Q  +   P D+   
Sbjct: 197 IPYQPMPASSPVICAGG--QDRCSDAAGYPHATHDLEGPPLDAYSIQGQHTISPLDLAKL 256

Query: 333 SAMDKQPHHGISAYGREAPMGV---HPQTNAQ-SVPSIVTQTTQQMQIPLSFADAVIGTA 392
           + + +Q  H    +G     G+    P+     +     TQTT ++ IP +    +IG  
Sbjct: 257 NQVARQQSHFAMMHGGTGFAGIDSSSPEVKGYWASLDASTQTTHELTIPNNLIGCIIGRQ 316

Query: 393 GASISYIRRASGATVTIQETRGVPGEMTVEMSGTASQNFMA 417
           GA+I+ IR+ SGA + I           V ++G+A+   +A
Sbjct: 317 GANINEIRQMSGAQIKIANPVEGSSGRQVTITGSAASISLA 339

BLAST of Cp4.1LG07g09850 vs. TrEMBL
Match: A0A0A0LJ91_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_2G010230 PE=4 SV=1)

HSP 1 Score: 767.3 bits (1980), Expect = 1.1e-218
Identity = 420/492 (85.37%), Postives = 438/492 (89.02%), Query Frame = 1

Query: 1   MADVEQSYTHV-DDNEVDETAQIYENAQLYENAQVQDNTLEQESTEEPENLHGLENEHVP 60
           MADVEQ+YTHV DD++VDE      NAQ+YENAQV D TLEQE+ +E ENLH L+NEH+P
Sbjct: 27  MADVEQNYTHVVDDDQVDE------NAQIYENAQVHDITLEQENAQEVENLHELQNEHIP 86

Query: 61  ETLELEPKEVHNEDTLAVVGEKKWPGWPGESVFRMLVPAQKVGSIIGRKGEFIKKIVEET 120
           ETLE EPK+VH ED L VV EKKWPGWPGESVFRMLVPAQKVGSIIGRKGEFIKKIVEET
Sbjct: 87  ETLESEPKQVHIEDPLTVVSEKKWPGWPGESVFRMLVPAQKVGSIIGRKGEFIKKIVEET 146

Query: 121 RARIKILDGPPGTAERAVMVSAKEEPDSDFPPAVDGLLRVHKRIVDGLEGDNANAPNMGG 180
           RARIKILDGPPGTAERAVMVSAK+EPDS FPPAVDGLLRVHKRIVDGLEGDNA+APN G 
Sbjct: 147 RARIKILDGPPGTAERAVMVSAKDEPDSAFPPAVDGLLRVHKRIVDGLEGDNAHAPNAGS 206

Query: 181 KVSTRLLVAASQAGSLIGKQGGTVKSIQEESNCIVRVLGSEDLPVFALQDDRVVEVLGDP 240
           KVSTRLLVAASQAGSLIGKQGGTVKSIQEESNCIVRVLGSEDLPVFALQDDRVVEVLGDP
Sbjct: 207 KVSTRLLVAASQAGSLIGKQGGTVKSIQEESNCIVRVLGSEDLPVFALQDDRVVEVLGDP 266

Query: 241 AGVHKAVELIASHLRKFLVDRSIIP-------MTNQQM-EHM-PPPHQSWGPPQGMPPNA 300
           AGVHKAVELIASHLRKFLVDRSIIP       M+N QM +HM PPPHQ WGPPQGMPPNA
Sbjct: 267 AGVHKAVELIASHLRKFLVDRSIIPVFEMNMQMSNPQMDQHMPPPPHQPWGPPQGMPPNA 326

Query: 301 GGGPGFGPNPP-YM-PPPRQFDNYYPPADMPSAMDKQPHHGISAYGREAPMGVHPQTNAQ 360
           GGGPGFGPNPP YM PPPRQFDNYYPPA+M + M+KQPHHGISAYGREAPMGVH  +NAQ
Sbjct: 327 GGGPGFGPNPPQYMPPPPRQFDNYYPPAEMQAVMEKQPHHGISAYGREAPMGVHAASNAQ 386

Query: 361 SVPSIVTQTTQQMQIPLSFADAVIGTAGASISYIRRASGATVTIQETRGVPGEMTVEMSG 420
           S PSIVTQTTQQMQIPLSFADAVIGTAGASISYIRRASGATVTIQETRGVPGEMTVEMSG
Sbjct: 387 SAPSIVTQTTQQMQIPLSFADAVIGTAGASISYIRRASGATVTIQETRGVPGEMTVEMSG 446

Query: 421 TAS---------QNFMAGAGAPAQTQAGGSADQGYNSYAAHGSVYASPPAANPAHASHAG 472
           TAS         QNFMAGAGAPAQ QAG S DQGYNSYAAHGSVYASPP ANP  A+HAG
Sbjct: 447 TASQVQAAQQLIQNFMAGAGAPAQPQAGVSTDQGYNSYAAHGSVYASPP-ANP--AAHAG 506

BLAST of Cp4.1LG07g09850 vs. TrEMBL
Match: A0A067JUM6_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_23518 PE=4 SV=1)

HSP 1 Score: 655.2 bits (1689), Expect = 6.0e-185
Identity = 348/453 (76.82%), Postives = 378/453 (83.44%), Query Frame = 1

Query: 36  DNTLEQESTEEPENLHGLENEHVPETLELEPKEVHNEDTLAVVGEKKWPGWPGESVFRML 95
           D  L+Q      EN      + VPE  +++ K+ H+EDT+A  GEKKWPGWPGESVFRML
Sbjct: 108 DQALDQNVDHTAENNMDHAEDQVPEDFQVQEKQGHHEDTVAAGGEKKWPGWPGESVFRML 167

Query: 96  VPAQKVGSIIGRKGEFIKKIVEETRARIKILDGPPGTAERAVMVSAKEEPDSDFPPAVDG 155
           VPAQKVGSIIGRKGEFIKKIVEETRARIKILDGPPGT ERAVMVSAKEEPDS  PPA+DG
Sbjct: 168 VPAQKVGSIIGRKGEFIKKIVEETRARIKILDGPPGTTERAVMVSAKEEPDSSLPPAMDG 227

Query: 156 LLRVHKRIVDGLEGDNAN-APNMGGKVSTRLLVAASQAGSLIGKQGGTVKSIQEESNCIV 215
           LLRVHKRIVDGL+GD+++ A   G KVSTRLLV ASQAGSLIGKQGGTVKSIQE S C+V
Sbjct: 228 LLRVHKRIVDGLDGDSSHIASGTGTKVSTRLLVPASQAGSLIGKQGGTVKSIQEASGCVV 287

Query: 216 RVLGSEDLPVFALQDDRVVEVLGDPAGVHKAVELIASHLRKFLVDRSIIP-------MTN 275
           RVLG+EDLPVFALQDDRVVEVLGDPAGVHKAVELIASHLRKFLVDRSIIP       M+N
Sbjct: 288 RVLGAEDLPVFALQDDRVVEVLGDPAGVHKAVELIASHLRKFLVDRSIIPLFEMHMQMSN 347

Query: 276 QQMEHMPPPHQSWGPPQGMPPNAGGGPGFGPNPPYMPPPRQFDNYYPPADMPSAMDKQPH 335
            Q+EHMP PHQSWGPPQG+PP+AGGGPGFGP P YMPPPRQ +NYYPPAD+P  M+KQPH
Sbjct: 348 PQVEHMP-PHQSWGPPQGLPPSAGGGPGFGPTPQYMPPPRQIENYYPPADLPPPMEKQPH 407

Query: 336 HGISAYGREAPMGVHPQTNAQSVPSIVTQTTQQMQIPLSFADAVIGTAGASISYIRRASG 395
            GISAYGR+APMGVH  +N+Q  PS++TQ TQQMQIPLS+ADAVIGTAGASISYIRRASG
Sbjct: 408 QGISAYGRDAPMGVHASSNSQGAPSMITQITQQMQIPLSYADAVIGTAGASISYIRRASG 467

Query: 396 ATVTIQETRGVPGEMTVEMSGTAS---------QNFMAGAGAPAQTQAGGSADQGYNSYA 455
           ATVTIQETRGVPGEMTVE+SGTAS         QNFMA A APAQ Q GGS DQ YN YA
Sbjct: 468 ATVTIQETRGVPGEMTVEISGTASQVQTAQQLIQNFMAEAAAPAQAQTGGSTDQAYNPYA 527

Query: 456 AHGSVYASPPAANPAHASHAGGYGSVYSTNYGY 472
           AHGSVYASPP +N  H  H GGYGSVY TNYGY
Sbjct: 528 AHGSVYASPP-SNQGHTGHTGGYGSVYGTNYGY 558

BLAST of Cp4.1LG07g09850 vs. TrEMBL
Match: M5VXP2_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa003484mg PE=4 SV=1)

HSP 1 Score: 648.3 bits (1671), Expect = 7.3e-183
Identity = 356/478 (74.48%), Postives = 386/478 (80.75%), Query Frame = 1

Query: 20  AQIYENAQLYENAQVQDNTLEQESTEEPENLHGLENEHVPETLELEPKEVH--------N 79
           AQ  E A     A  Q+  L Q    E E    L      E +E++ +E+H        +
Sbjct: 98  AQEQEQALAQLQAHEQEQALAQLQAHEQEQ--ALAQFQAQEEVEVQEEELHVREHNQEQH 157

Query: 80  EDTLAVVGEKKWPGWPGESVFRMLVPAQKVGSIIGRKGEFIKKIVEETRARIKILDGPPG 139
           ED +   GEKKWPGWPGESVFRMLVPAQKVGSIIGRKGEFIKKIVEETRARIKILDGPPG
Sbjct: 158 EDAVVGGGEKKWPGWPGESVFRMLVPAQKVGSIIGRKGEFIKKIVEETRARIKILDGPPG 217

Query: 140 TAERAVMVSAKEEPDSDFPPAVDGLLRVHKRIVDGLEGDNANAP-NMGGKVSTRLLVAAS 199
           T ERAVMVSAKEEPDS FPPA+DGLLRVHKRI+DGL+GD+++AP  MGGKVSTRLLVAAS
Sbjct: 218 TTERAVMVSAKEEPDSSFPPAMDGLLRVHKRIIDGLDGDSSHAPPGMGGKVSTRLLVAAS 277

Query: 200 QAGSLIGKQGGTVKSIQEESNCIVRVLGSEDLPVFALQDDRVVEVLGDPAGVHKAVELIA 259
           QAGSLIGKQGGTVKSIQE SNCIVRVLG+EDLP+FALQDDRVVEV+GD  GVHKA+ELIA
Sbjct: 278 QAGSLIGKQGGTVKSIQESSNCIVRVLGAEDLPIFALQDDRVVEVVGDAVGVHKAIELIA 337

Query: 260 SHLRKFLVDRSIIP-------MTNQQMEHMPPPHQSWGPPQGMPPNAGGGPGFGP-NPPY 319
           SHLRKFLVDRSIIP       M N QMEH PP HQ WGPPQG+P NAGGGPGFGP NP Y
Sbjct: 338 SHLRKFLVDRSIIPIFEMHMQMANPQMEHAPP-HQQWGPPQGLPHNAGGGPGFGPPNPQY 397

Query: 320 MPPPRQFDNYYPPADMPSAMDKQPHHGISAYGREAPMGVHPQTNAQSVPSIVTQTTQQMQ 379
           MPPPRQ DNYYPPADMP  ++KQPHHGISAYGREAPMGVH  +NAQS PS+VTQ TQQMQ
Sbjct: 398 MPPPRQHDNYYPPADMPPPIEKQPHHGISAYGREAPMGVHQSSNAQSAPSMVTQITQQMQ 457

Query: 380 IPLSFADAVIGTAGASISYIRRASGATVTIQETRGVPGEMTVEMSGTAS---------QN 439
           IPLS+ADAVIGTAGASISYIRRASGATVTIQETRGVPGEMTVE+SG+AS         QN
Sbjct: 458 IPLSYADAVIGTAGASISYIRRASGATVTIQETRGVPGEMTVEISGSASQVQAAQQLIQN 517

Query: 440 FMAGAGAPAQTQAGGSADQGYNSYAAHGSVYASPPAANPAHASHAGGYGSVYSTNYGY 472
           FMA AGAP  TQ  GS DQGYNSYAAHGSVY+SPP +N  HA H GGYGSVY ++YGY
Sbjct: 518 FMADAGAPQPTQTAGSVDQGYNSYAAHGSVYSSPP-SNQGHAGHTGGYGSVYGSHYGY 571

BLAST of Cp4.1LG07g09850 vs. TrEMBL
Match: D7TXC7_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_14s0081g00440 PE=4 SV=1)

HSP 1 Score: 642.5 bits (1656), Expect = 4.0e-181
Identity = 340/443 (76.75%), Postives = 376/443 (84.88%), Query Frame = 1

Query: 45  EEPENLHGLENEHVPETLELEPKEVHNEDTLAVVGEKKWPGWPGESVFRMLVPAQKVGSI 104
           E  EN    + + +PE  + + K   ++D+    GEK+WPGWPGESVFRMLVPAQKVGSI
Sbjct: 72  EVEENFGEPDMDQLPEDSQPQQKRGRDDDSAIGGGEKRWPGWPGESVFRMLVPAQKVGSI 131

Query: 105 IGRKGEFIKKIVEETRARIKILDGPPGTAERAVMVSAKEEPDSDFPPAVDGLLRVHKRIV 164
           IGRKGEFIKKIVEETRARIKILDGPPGTAERAVMVSAKEEPDS  PPA+DGLL+VHKRIV
Sbjct: 132 IGRKGEFIKKIVEETRARIKILDGPPGTAERAVMVSAKEEPDSSLPPAMDGLLKVHKRIV 191

Query: 165 DGLEGDNANAPNMGGKVSTRLLVAASQAGSLIGKQGGTVKSIQEESNCIVRVLGSEDLPV 224
           DGLEGD+++ P  GGKVSTRLLVAASQAGSLIGKQGGTVKSIQE SNCIVRVLG+EDLP+
Sbjct: 192 DGLEGDSSHMPP-GGKVSTRLLVAASQAGSLIGKQGGTVKSIQEASNCIVRVLGAEDLPI 251

Query: 225 FALQDDRVVEVLGDPAGVHKAVELIASHLRKFLVDRSIIP-------MTNQQMEHMPPPH 284
           FALQDDRVVEV+G+P GVHKAVELIASHLRKFLVDRS+IP       M+N  +EHMPP H
Sbjct: 252 FALQDDRVVEVVGEPIGVHKAVELIASHLRKFLVDRSVIPLFEMQMQMSNPPIEHMPP-H 311

Query: 285 QSWGPPQGMPPNAGGGPGFGPNPPYMPPPRQFDNYYPPADMPSAMDKQPHHGISAYGREA 344
           Q WGPPQG+PPNA GGPGFGPNPPYMPPPRQ D+YYPP ++P  ++KQPH GISAYGRE 
Sbjct: 312 QPWGPPQGLPPNASGGPGFGPNPPYMPPPRQLDSYYPPPELPPPVEKQPHQGISAYGREV 371

Query: 345 PMGVHPQTNAQSVPSIVTQTTQQMQIPLSFADAVIGTAGASISYIRRASGATVTIQETRG 404
           PMG H  ++AQ  PS++TQ TQQMQIPLS+ADAVIGTAGASISYIRRASGATVTIQETRG
Sbjct: 372 PMGGHAPSSAQPAPSMITQVTQQMQIPLSYADAVIGTAGASISYIRRASGATVTIQETRG 431

Query: 405 VPGEMTVEMSGTAS---------QNFMAGAGAPAQTQAGGSADQGYNSYAAHGSVYASPP 464
           VPGEMTVE++GTAS         QNFMA A APAQ QAGGSADQGYNSYAAHGS+YAS P
Sbjct: 432 VPGEMTVEINGTASQVQAAQQLIQNFMAEAAAPAQAQAGGSADQGYNSYAAHGSMYAS-P 491

Query: 465 AANPAHASHAGGYGSVYSTNYGY 472
           A+NPAHASH GGYG VY  NYGY
Sbjct: 492 ASNPAHASHTGGYGPVYGANYGY 511

BLAST of Cp4.1LG07g09850 vs. TrEMBL
Match: A0A061F946_THECC (RNA-binding KH domain-containing protein OS=Theobroma cacao GN=TCM_026308 PE=4 SV=1)

HSP 1 Score: 641.7 bits (1654), Expect = 6.8e-181
Identity = 353/501 (70.46%), Postives = 395/501 (78.84%), Query Frame = 1

Query: 1   MADVEQSYTHVDDNEVDETAQIYENAQLYENAQVQDN-------TLEQESTEEPE-NLH- 60
           MA+V+QS+   ++ +V E     EN  L +N  ++ N        LE     EP+ NL  
Sbjct: 1   MAEVDQSFVEHEEEQVQEPENSEENLNLEQNLSLEQNLNLEPNLNLEPNLNLEPDLNLEE 60

Query: 61  ----GLENEHVPETLELEPKEVHNEDTLAVVGEKKWPGWPGESVFRMLVPAQKVGSIIGR 120
                LE +++ +    +PK+ H ++ +    EKKWPGWPGESVFRMLVPAQKVGSIIGR
Sbjct: 61  NAEENLEEQNLQQESPHQPKQEHEDEAVVGGVEKKWPGWPGESVFRMLVPAQKVGSIIGR 120

Query: 121 KGEFIKKIVEETRARIKILDGPPGTAERAVMVSAKEEPDSDFPPAVDGLLRVHKRIVDGL 180
           KGEFIKKIVEETRARIKILDGPPGT ERAVMVSAKEEPDS  PPA+DGLLRVHKRIVDGL
Sbjct: 121 KGEFIKKIVEETRARIKILDGPPGTTERAVMVSAKEEPDSSLPPAMDGLLRVHKRIVDGL 180

Query: 181 EGDNANAPN-MGGKVSTRLLVAASQAGSLIGKQGGTVKSIQEESNCIVRVLGSEDLPVFA 240
           +GD+++AP  +G KVSTRLLV ASQAGSLIGKQG TVKSIQE S C+VRVLG+EDLPVFA
Sbjct: 181 DGDSSHAPTAVGTKVSTRLLVPASQAGSLIGKQGTTVKSIQESSGCVVRVLGAEDLPVFA 240

Query: 241 LQDDRVVEVLGDPAGVHKAVELIASHLRKFLVDRSIIP-------MTNQQMEHMPPPHQS 300
           LQDDRVVEV+G+ AGVHKAVELIASHLRKFLVDRSIIP       M+N QM+HMP PHQS
Sbjct: 241 LQDDRVVEVVGEAAGVHKAVELIASHLRKFLVDRSIIPLFEMHMQMSNPQMDHMP-PHQS 300

Query: 301 WGPPQGMPPNAGGGPGFGPNPPYMPPPRQFDNYYPPADMPSAMDKQPHHGISAYGREAPM 360
           WGPPQG+PPNA GG GFG NP YMPPPRQ DNYYPPADMP  ++KQPH GISAYGREAPM
Sbjct: 301 WGPPQGVPPNASGGAGFGHNPQYMPPPRQLDNYYPPADMPPPIEKQPHQGISAYGREAPM 360

Query: 361 GVHPQTNAQSVPSIVTQTTQQMQIPLSFADAVIGTAGASISYIRRASGATVTIQETRGVP 420
           G H  +N QS PS++TQ TQQMQIPLS+ADAVIGTAGASISYIRRASGATVTIQETRGVP
Sbjct: 361 GAHASSNPQSAPSMITQVTQQMQIPLSYADAVIGTAGASISYIRRASGATVTIQETRGVP 420

Query: 421 GEMTVEMSGTAS---------QNFMAGAGAPAQTQAGGSADQGYNSYAAHGSVYASPPAA 472
           GEMTVE+SGTAS         QNFMA A APAQ Q GG+ DQ YN YAAH SVYASPP +
Sbjct: 421 GEMTVEISGTASQVQTAQQLIQNFMAEAAAPAQGQTGGATDQAYNPYAAHSSVYASPP-S 480

BLAST of Cp4.1LG07g09850 vs. TAIR10
Match: AT3G04610.1 (AT3G04610.1 RNA-binding KH domain-containing protein)

HSP 1 Score: 521.2 bits (1341), Expect = 6.8e-148
Identity = 302/474 (63.71%), Postives = 343/474 (72.36%), Query Frame = 1

Query: 18  ETAQIYENAQLYENAQVQDNTLEQESTEEPENLHGLENEHVPETLELEPKEVHNEDTLAV 77
           E  Q+ + A      Q Q +  +    E  + +   + E +PE LE   K    ED    
Sbjct: 117 EQFQLQDEAHDQAQYQAQGDVQDHNGDEVQDKVE--DEEGIPEHLESLQKSEPEEDATVG 176

Query: 78  VGEKKWPGWPGESVFRMLVPAQKVGSIIGRKGEFIKKIVEETRARIKILDGPPGTAERAV 137
             EK+WPGWPGE+VFRMLVPAQKVGSIIGRKG+ IKKIVEETRARIKILDGPPGT ERAV
Sbjct: 177 GEEKRWPGWPGETVFRMLVPAQKVGSIIGRKGDVIKKIVEETRARIKILDGPPGTTERAV 236

Query: 138 MVSAKEEPDSDFPPAVDGLLRVHKRIVDGLEGDNANAPNMGGKVSTRLLVAASQAGSLIG 197
           MVS KEEP+S  PP++DGLLRVH RIVDGL+G+ + AP    KVSTRLLV ASQAGSLIG
Sbjct: 237 MVSGKEEPESSLPPSMDGLLRVHMRIVDGLDGEASQAPPPS-KVSTRLLVPASQAGSLIG 296

Query: 198 KQGGTVKSIQEESNCIVRVLGSEDLPVFALQDDRVVEVLGDPAGVHKAVELIASHLRKFL 257
           KQGGTVK+IQE S CIVRVLGSEDLPVFALQDDRVVEV+G+P  VH+A+ELIASHLRKFL
Sbjct: 297 KQGGTVKAIQEASACIVRVLGSEDLPVFALQDDRVVEVVGEPTSVHRALELIASHLRKFL 356

Query: 258 VDRSIIPM-------TNQQMEHMPPPHQSWGPPQGMPPNAGGGPGFGPN-PPYMPPPRQF 317
           VDRSIIP          +QM+HMPPPHQSWGPPQG  P+ GGG G+G N PPYM PP + 
Sbjct: 357 VDRSIIPFFENQMQKPTRQMDHMPPPHQSWGPPQGHAPSVGGG-GYGHNPPPYMQPPPRH 416

Query: 318 DNYYPPADM-PSAMDKQPHHGISAYGREAPMGVHPQTNAQSVPSIVTQTTQQMQIPLSFA 377
           D+YYPP +M    M+KQPH GISAYGRE PM VH    + + P +  Q TQQMQIPLS+A
Sbjct: 417 DSYYPPPEMRQPPMEKQPHQGISAYGREPPMNVHV---SSAPPMVAQQVTQQMQIPLSYA 476

Query: 378 DAVIGTAGASISYIRRASGATVTIQETRGVPGEMTVEMSGTAS---------QNFM--AG 437
           DAVIGT+G++ISY RR SGATVTIQETRGVPGEMTVE+SGT S         QNFM  AG
Sbjct: 477 DAVIGTSGSNISYTRRLSGATVTIQETRGVPGEMTVEVSGTGSQVQTAVQLIQNFMAEAG 536

Query: 438 AGAPAQTQAGGSADQGYNSYAAHGSVYASPPAANPAHASHAGGYGSVYSTNYGY 472
           A APAQ Q      QGYN YA HGSVYA+ P   P      GGY + YS+ YGY
Sbjct: 537 APAPAQPQTVAPEQQGYNPYATHGSVYAAAPTNPP------GGYATDYSSGYGY 577

BLAST of Cp4.1LG07g09850 vs. TAIR10
Match: AT4G26000.1 (AT4G26000.1 RNA-binding KH domain-containing protein)

HSP 1 Score: 239.2 bits (609), Expect = 5.2e-63
Identity = 146/346 (42.20%), Postives = 212/346 (61.27%), Query Frame = 1

Query: 80  EKKWPGWPGESVFRMLVPAQKVGSIIGRKGEFIKKIVEETRARIKILDGPPGTAERAVMV 139
           E++WPGWPG+ VFRM+VP  KVG+IIGRKG+FIKK+ EETRARIK+LDGP  T +R V++
Sbjct: 64  EERWPGWPGDCVFRMIVPVTKVGAIIGRKGDFIKKMCEETRARIKVLDGPVNTPDRIVLI 123

Query: 140 SAKEEPDSDFPPAVDGLLRVHKRIVDGLEGDNANAPNMGGKVST-RLLVAASQAGSLIGK 199
           S KEEP++   PA+D +LRV +R+    + D+ +  N G   S+ RLLVA++QA +LIGK
Sbjct: 124 SGKEEPEAYMSPAMDAVLRVFRRVSGLPDNDDDDVQNAGSVFSSVRLLVASTQAINLIGK 183

Query: 200 QGGTVKSIQEESNCIVRVLGSEDLPVFALQDDRVVEVLGDPAGVHKAVELIASHLRKFLV 259
           QG  +KSI E S   VR+L  E+ P +A QD+R+V++ G+   + KA+E I  HLR+FLV
Sbjct: 184 QGSLIKSIVENSGASVRILSEEETPFYAAQDERIVDLQGEALKILKALEAIVGHLRRFLV 243

Query: 260 DRSIIPMTNQ-------QMEHMPPPHQSWGPPQGMPPNAGGGPGFG----PNPPYMPPPR 319
           D +++P+  +       Q     P  +S      +  N    P F       P ++    
Sbjct: 244 DHTVVPLFEKQYLARVSQTRQEEPLAESKSSLHTISSNL-MEPDFSLLARREPLFLERDS 303

Query: 320 QFDNYYPPADMPSAMDKQPHHGISAYGREAPMGV-HPQTNAQSVPSIVTQTTQQMQIPLS 379
           + D+   P+            G+S Y ++  +   H    A+   + VTQ +Q MQIP S
Sbjct: 304 RVDSRVQPS------------GVSIYSQDPVLSARHSPGLARVSSAFVTQVSQTMQIPFS 363

Query: 380 FADAVIGTAGASISYIRRASGATVTIQETRGVPGEMTVEMSGTASQ 413
           +A+ +IG  GA+I+YIRR SGAT+TI+E+   P ++TVE+ GT SQ
Sbjct: 364 YAEDIIGVEGANIAYIRRRSGATITIKESPH-PDQITVEIKGTTSQ 395

BLAST of Cp4.1LG07g09850 vs. TAIR10
Match: AT1G51580.1 (AT1G51580.1 RNA-binding KH domain-containing protein)

HSP 1 Score: 107.8 bits (268), Expect = 1.8e-23
Identity = 106/374 (28.34%), Postives = 163/374 (43.58%), Query Frame = 1

Query: 78  VGEKKWPGWPGESVFRMLVPAQKVGSIIGRKGEFIKKIVEETRARIKILDGPPGTAERAV 137
           VG    P    E  FR+L PA KVGS+IG+ G  ++ +  E+ A IK+ D    + ER +
Sbjct: 264 VGPFNRPVVEEEVAFRLLCPADKVGSLIGKGGAVVRALQNESGASIKVSDPTHDSEERII 323

Query: 138 MVSAKEEPDSDFPPAVDGLLRVHKRIVD-GLEGDNANAPNMGGKVSTRLLVAASQAGSLI 197
           ++SA+E  +     A DG++RVH RIV+ G E   A        V  RLLV +   G L+
Sbjct: 324 VISARENLERRHSLAQDGVMRVHNRIVEIGFEPSAA--------VVARLLVHSPYIGRLL 383

Query: 198 GKQGGTVKSIQEESNCIVRVLGSEDLPVFALQDDRVVEVLGDPAGVHKAVELIASHLRKF 257
           GK G  +  ++  +   +RV   +    +  Q D +V+V+G+   V  A+  I   LR+ 
Sbjct: 384 GKGGHLISEMRRATGASIRVFAKDQATKYESQHDEIVQVIGNLKTVQDALFQILCRLREA 443

Query: 258 LVDRSIIPMTNQQMEHMPPPHQSWGPPQGMPPNAGGGPGFGPNPPYMPP--PRQF----D 317
           +    +             P Q  G P   PP     P  GP P   PP  PRQ+    D
Sbjct: 444 MFPGRL-------------PFQGMGGP---PP-----PFMGPYPEPPPPFGPRQYPASPD 503

Query: 318 NYYPPAD----------------------MPSAMDKQPHHGISAY-GREAPMGVHPQTNA 377
            Y+ P                         PS M   P  GI  + G   P  V+     
Sbjct: 504 RYHSPVGPFHERHCHGPGFDRPPGPGFDRPPSPMSWTPQPGIDGHPGGMVPPDVNHGFAL 563

Query: 378 QSVP-----SIVTQTTQQMQIPLSFADAVIGTAGASISYIRRASGATVTIQETRGVPGEM 417
           ++ P      ++T    ++ IP ++   V G   ++++YI++ SGA V + + +    E 
Sbjct: 564 RNEPIGSENPVMTSANVEIVIPQAYLGHVYGENCSNLNYIKQVSGANVVVHDPKAGTTEG 608

BLAST of Cp4.1LG07g09850 vs. TAIR10
Match: AT5G15270.2 (AT5G15270.2 RNA-binding KH domain-containing protein)

HSP 1 Score: 101.7 bits (252), Expect = 1.3e-21
Identity = 64/213 (30.05%), Postives = 116/213 (54.46%), Query Frame = 1

Query: 89  ESVFRMLVPAQKVGSIIGRKGEFIKKIVEETRARIKILDGPPGTAERAVMVSAKEEPDSD 148
           ++VFR L P +K+GS+IGR G+ +K++  +TR++I+I +  PG  ER + + +  +  + 
Sbjct: 49  DTVFRYLCPVKKIGSVIGRGGDIVKQLRNDTRSKIRIGEAIPGCDERVITIYSPSDETNA 108

Query: 149 F-------PPAVDGLLRVHKRIVDGLEGDNANAPNMGGKVSTRLLVAASQAGSLIGKQGG 208
           F        PA D L R+H R+V   +  + ++P    +V+ +LLV + Q G ++G+ G 
Sbjct: 109 FGDGEKVLSPAQDALFRIHDRVVAD-DARSEDSPEGEKQVTAKLLVPSDQIGCILGRGGQ 168

Query: 209 TVKSIQEESNCIVRVLGSEDLPVFALQDDRVVEVLGDPAGVHKAVELIASHLRKFLVDRS 268
            V++I+ E+   +R++   ++P+ AL  D ++++ G+   V KA+  IAS L +      
Sbjct: 169 IVQNIRSETGAQIRIVKDRNMPLCALNSDELIQISGEVLIVKKALLQIASRLHE------ 228

Query: 269 IIPMTNQQMEHMPPPHQSWGPPQGMPPNAGGGP 295
             P  +Q +        S G P G   +  GGP
Sbjct: 229 -NPSRSQNL-----LSSSGGYPAGSLMSHAGGP 248

BLAST of Cp4.1LG07g09850 vs. TAIR10
Match: AT1G14170.3 (AT1G14170.3 RNA-binding KH domain-containing protein)

HSP 1 Score: 95.9 bits (237), Expect = 7.1e-20
Identity = 65/204 (31.86%), Postives = 112/204 (54.90%), Query Frame = 1

Query: 67  KEVHNEDTLAVVGEKKWPGWPGESVFRMLVPAQKVGSIIGRKGEFIKKIVEETRARIKIL 126
           + +H+E    V+  +       ++V+R L P +K GSIIG+ GE  K+I  ET++ ++I 
Sbjct: 28  RNLHDETDQNVIASE-------DTVYRYLCPVKKTGSIIGKGGEIAKQIRSETKSNMRIN 87

Query: 127 DGPPGTAERAVMVSAKEEPDSDFP-------PAVDGLLRVHKRIV------DGLEGDNAN 186
           +  PG  ER V + +  E  + F        PA+D L +VH  +V      DG + DN  
Sbjct: 88  EALPGCEERVVTMYSTNEELNHFGDDGELVCPALDALFKVHDMVVADADQDDGTDDDN-- 147

Query: 187 APNMGGK--VSTRLLVAASQAGSLIGKQGGTVKSIQEESNCIVRVLGSEDLPVFA--LQD 246
             ++G K  V+ R+LV + Q G +IGK G  +++++ ++N  +RV+  + LP  A  L  
Sbjct: 148 --DLGEKQTVTVRMLVPSDQIGCVIGKGGQVIQNLRNDTNAQIRVI-KDHLPACALTLSH 207

Query: 247 DRVVEVLGDPAGVHKAVELIASHL 254
           D ++ ++G+P  V +A+  +AS L
Sbjct: 208 DELLLIIGEPLVVREALYQVASLL 219

BLAST of Cp4.1LG07g09850 vs. NCBI nr
Match: gi|659070698|ref|XP_008456230.1| (PREDICTED: poly(rC)-binding protein 3 isoform X1 [Cucumis melo])

HSP 1 Score: 774.2 bits (1998), Expect = 1.3e-220
Identity = 423/492 (85.98%), Postives = 440/492 (89.43%), Query Frame = 1

Query: 1   MADVEQSYTHV-DDNEVDETAQIYENAQLYENAQVQDNTLEQESTEEPENLHGLENEHVP 60
           MADVEQ+YTHV DD++VDE      NAQ+YENAQV D TLEQE+T+E ENLH L+NEH+P
Sbjct: 1   MADVEQNYTHVVDDDQVDE------NAQIYENAQVHDITLEQENTQEVENLHELQNEHIP 60

Query: 61  ETLELEPKEVHNEDTLAVVGEKKWPGWPGESVFRMLVPAQKVGSIIGRKGEFIKKIVEET 120
           ETLE EPK+VHNED L VV EKKWPGWPGESVFRMLVPAQKVGSIIGRKGEFIKKIVEET
Sbjct: 61  ETLESEPKQVHNEDPLTVVSEKKWPGWPGESVFRMLVPAQKVGSIIGRKGEFIKKIVEET 120

Query: 121 RARIKILDGPPGTAERAVMVSAKEEPDSDFPPAVDGLLRVHKRIVDGLEGDNANAPNMGG 180
           RARIKILDGPPGTAERAVMVSAK+EPDS FPPAVDGLLRVHKRIVDGLEGDNA+APN G 
Sbjct: 121 RARIKILDGPPGTAERAVMVSAKDEPDSAFPPAVDGLLRVHKRIVDGLEGDNAHAPNAGS 180

Query: 181 KVSTRLLVAASQAGSLIGKQGGTVKSIQEESNCIVRVLGSEDLPVFALQDDRVVEVLGDP 240
           KVSTRLLVAASQAGSLIGKQGGTVKSIQEESNCIVRVLGSEDLPVFALQDDRVVEVLGDP
Sbjct: 181 KVSTRLLVAASQAGSLIGKQGGTVKSIQEESNCIVRVLGSEDLPVFALQDDRVVEVLGDP 240

Query: 241 AGVHKAVELIASHLRKFLVDRSIIP-------MTNQQM-EHM-PPPHQSWGPPQGMPPNA 300
           AGVHKAVELIASHLRKFLVDRSIIP       M+N QM +HM PPPHQ WGPPQGMPPNA
Sbjct: 241 AGVHKAVELIASHLRKFLVDRSIIPVFEMNMQMSNPQMDQHMPPPPHQPWGPPQGMPPNA 300

Query: 301 GGGPGFGPNPP-YM-PPPRQFDNYYPPADMPSAMDKQPHHGISAYGREAPMGVHPQTNAQ 360
           GGGPGFGPNPP YM PPPRQFDNYYPPA+M + MDKQPHHGISAYGREAPMGVH  +NAQ
Sbjct: 301 GGGPGFGPNPPQYMPPPPRQFDNYYPPAEMQAVMDKQPHHGISAYGREAPMGVHAASNAQ 360

Query: 361 SVPSIVTQTTQQMQIPLSFADAVIGTAGASISYIRRASGATVTIQETRGVPGEMTVEMSG 420
           S PSIVTQTTQQMQIPLSFADAVIGTAGASISYIRRASGATVTIQETRGVPGEMTVEMSG
Sbjct: 361 SAPSIVTQTTQQMQIPLSFADAVIGTAGASISYIRRASGATVTIQETRGVPGEMTVEMSG 420

Query: 421 TAS---------QNFMAGAGAPAQTQAGGSADQGYNSYAAHGSVYASPPAANPAHASHAG 472
           TAS         QNFMAGAGAPAQ QAG S DQGYNSYAAHGSVYASPP ANP  A+HAG
Sbjct: 421 TASQVQAAQQLIQNFMAGAGAPAQPQAGVSTDQGYNSYAAHGSVYASPP-ANP--AAHAG 480

BLAST of Cp4.1LG07g09850 vs. NCBI nr
Match: gi|659070700|ref|XP_008456234.1| (PREDICTED: poly(rC)-binding protein 3 isoform X2 [Cucumis melo])

HSP 1 Score: 767.7 bits (1981), Expect = 1.2e-218
Identity = 422/492 (85.77%), Postives = 439/492 (89.23%), Query Frame = 1

Query: 1   MADVEQSYTHV-DDNEVDETAQIYENAQLYENAQVQDNTLEQESTEEPENLHGLENEHVP 60
           MADVEQ+YTHV DD++VDE      NAQ+YENAQV D TLEQE+T+E ENLH L+NEH+P
Sbjct: 1   MADVEQNYTHVVDDDQVDE------NAQIYENAQVHDITLEQENTQEVENLHELQNEHIP 60

Query: 61  ETLELEPKEVHNEDTLAVVGEKKWPGWPGESVFRMLVPAQKVGSIIGRKGEFIKKIVEET 120
           ETLE EPK+VHNED L VV EKKWPGWPGESVFRMLVPAQKVGSIIGRKGEFIKKIVEET
Sbjct: 61  ETLESEPKQVHNEDPLTVVSEKKWPGWPGESVFRMLVPAQKVGSIIGRKGEFIKKIVEET 120

Query: 121 RARIKILDGPPGTAERAVMVSAKEEPDSDFPPAVDGLLRVHKRIVDGLEGDNANAPNMGG 180
           RARIKILDGPPGTAERAVMVSAK+EPDS FPPAVDGLLRVHKRIVDGLEGDNA+APN G 
Sbjct: 121 RARIKILDGPPGTAERAVMVSAKDEPDSAFPPAVDGLLRVHKRIVDGLEGDNAHAPNAGS 180

Query: 181 KVSTRLLVAASQAGSLIGKQGGTVKSIQEESNCIVRVLGSEDLPVFALQDDRVVEVLGDP 240
           KVSTRLLVAASQAGSLIGKQGGTVKSIQEESNCIVRVLGS DLPVFALQDDRVVEVLGDP
Sbjct: 181 KVSTRLLVAASQAGSLIGKQGGTVKSIQEESNCIVRVLGS-DLPVFALQDDRVVEVLGDP 240

Query: 241 AGVHKAVELIASHLRKFLVDRSIIP-------MTNQQM-EHM-PPPHQSWGPPQGMPPNA 300
           AGVHKAVELIASHLRKFLVDRSIIP       M+N QM +HM PPPHQ WGPPQGMPPNA
Sbjct: 241 AGVHKAVELIASHLRKFLVDRSIIPVFEMNMQMSNPQMDQHMPPPPHQPWGPPQGMPPNA 300

Query: 301 GGGPGFGPNPP-YM-PPPRQFDNYYPPADMPSAMDKQPHHGISAYGREAPMGVHPQTNAQ 360
           GGGPGFGPNPP YM PPPRQFDNYYPPA+M + MDKQPHHGISAYGREAPMGVH  +NAQ
Sbjct: 301 GGGPGFGPNPPQYMPPPPRQFDNYYPPAEMQAVMDKQPHHGISAYGREAPMGVHAASNAQ 360

Query: 361 SVPSIVTQTTQQMQIPLSFADAVIGTAGASISYIRRASGATVTIQETRGVPGEMTVEMSG 420
           S PSIVTQTTQQMQIPLSFADAVIGTAGASISYIRRASGATVTIQETRGVPGEMTVEMSG
Sbjct: 361 SAPSIVTQTTQQMQIPLSFADAVIGTAGASISYIRRASGATVTIQETRGVPGEMTVEMSG 420

Query: 421 TAS---------QNFMAGAGAPAQTQAGGSADQGYNSYAAHGSVYASPPAANPAHASHAG 472
           TAS         QNFMAGAGAPAQ QAG S DQGYNSYAAHGSVYASPP ANP  A+HAG
Sbjct: 421 TASQVQAAQQLIQNFMAGAGAPAQPQAGVSTDQGYNSYAAHGSVYASPP-ANP--AAHAG 480

BLAST of Cp4.1LG07g09850 vs. NCBI nr
Match: gi|449442959|ref|XP_004139248.1| (PREDICTED: flowering locus K homology domain isoform X1 [Cucumis sativus])

HSP 1 Score: 767.3 bits (1980), Expect = 1.5e-218
Identity = 420/492 (85.37%), Postives = 438/492 (89.02%), Query Frame = 1

Query: 1   MADVEQSYTHV-DDNEVDETAQIYENAQLYENAQVQDNTLEQESTEEPENLHGLENEHVP 60
           MADVEQ+YTHV DD++VDE      NAQ+YENAQV D TLEQE+ +E ENLH L+NEH+P
Sbjct: 1   MADVEQNYTHVVDDDQVDE------NAQIYENAQVHDITLEQENAQEVENLHELQNEHIP 60

Query: 61  ETLELEPKEVHNEDTLAVVGEKKWPGWPGESVFRMLVPAQKVGSIIGRKGEFIKKIVEET 120
           ETLE EPK+VH ED L VV EKKWPGWPGESVFRMLVPAQKVGSIIGRKGEFIKKIVEET
Sbjct: 61  ETLESEPKQVHIEDPLTVVSEKKWPGWPGESVFRMLVPAQKVGSIIGRKGEFIKKIVEET 120

Query: 121 RARIKILDGPPGTAERAVMVSAKEEPDSDFPPAVDGLLRVHKRIVDGLEGDNANAPNMGG 180
           RARIKILDGPPGTAERAVMVSAK+EPDS FPPAVDGLLRVHKRIVDGLEGDNA+APN G 
Sbjct: 121 RARIKILDGPPGTAERAVMVSAKDEPDSAFPPAVDGLLRVHKRIVDGLEGDNAHAPNAGS 180

Query: 181 KVSTRLLVAASQAGSLIGKQGGTVKSIQEESNCIVRVLGSEDLPVFALQDDRVVEVLGDP 240
           KVSTRLLVAASQAGSLIGKQGGTVKSIQEESNCIVRVLGSEDLPVFALQDDRVVEVLGDP
Sbjct: 181 KVSTRLLVAASQAGSLIGKQGGTVKSIQEESNCIVRVLGSEDLPVFALQDDRVVEVLGDP 240

Query: 241 AGVHKAVELIASHLRKFLVDRSIIP-------MTNQQM-EHM-PPPHQSWGPPQGMPPNA 300
           AGVHKAVELIASHLRKFLVDRSIIP       M+N QM +HM PPPHQ WGPPQGMPPNA
Sbjct: 241 AGVHKAVELIASHLRKFLVDRSIIPVFEMNMQMSNPQMDQHMPPPPHQPWGPPQGMPPNA 300

Query: 301 GGGPGFGPNPP-YM-PPPRQFDNYYPPADMPSAMDKQPHHGISAYGREAPMGVHPQTNAQ 360
           GGGPGFGPNPP YM PPPRQFDNYYPPA+M + M+KQPHHGISAYGREAPMGVH  +NAQ
Sbjct: 301 GGGPGFGPNPPQYMPPPPRQFDNYYPPAEMQAVMEKQPHHGISAYGREAPMGVHAASNAQ 360

Query: 361 SVPSIVTQTTQQMQIPLSFADAVIGTAGASISYIRRASGATVTIQETRGVPGEMTVEMSG 420
           S PSIVTQTTQQMQIPLSFADAVIGTAGASISYIRRASGATVTIQETRGVPGEMTVEMSG
Sbjct: 361 SAPSIVTQTTQQMQIPLSFADAVIGTAGASISYIRRASGATVTIQETRGVPGEMTVEMSG 420

Query: 421 TAS---------QNFMAGAGAPAQTQAGGSADQGYNSYAAHGSVYASPPAANPAHASHAG 472
           TAS         QNFMAGAGAPAQ QAG S DQGYNSYAAHGSVYASPP ANP  A+HAG
Sbjct: 421 TASQVQAAQQLIQNFMAGAGAPAQPQAGVSTDQGYNSYAAHGSVYASPP-ANP--AAHAG 480

BLAST of Cp4.1LG07g09850 vs. NCBI nr
Match: gi|700205668|gb|KGN60787.1| (hypothetical protein Csa_2G010230 [Cucumis sativus])

HSP 1 Score: 767.3 bits (1980), Expect = 1.5e-218
Identity = 420/492 (85.37%), Postives = 438/492 (89.02%), Query Frame = 1

Query: 1   MADVEQSYTHV-DDNEVDETAQIYENAQLYENAQVQDNTLEQESTEEPENLHGLENEHVP 60
           MADVEQ+YTHV DD++VDE      NAQ+YENAQV D TLEQE+ +E ENLH L+NEH+P
Sbjct: 27  MADVEQNYTHVVDDDQVDE------NAQIYENAQVHDITLEQENAQEVENLHELQNEHIP 86

Query: 61  ETLELEPKEVHNEDTLAVVGEKKWPGWPGESVFRMLVPAQKVGSIIGRKGEFIKKIVEET 120
           ETLE EPK+VH ED L VV EKKWPGWPGESVFRMLVPAQKVGSIIGRKGEFIKKIVEET
Sbjct: 87  ETLESEPKQVHIEDPLTVVSEKKWPGWPGESVFRMLVPAQKVGSIIGRKGEFIKKIVEET 146

Query: 121 RARIKILDGPPGTAERAVMVSAKEEPDSDFPPAVDGLLRVHKRIVDGLEGDNANAPNMGG 180
           RARIKILDGPPGTAERAVMVSAK+EPDS FPPAVDGLLRVHKRIVDGLEGDNA+APN G 
Sbjct: 147 RARIKILDGPPGTAERAVMVSAKDEPDSAFPPAVDGLLRVHKRIVDGLEGDNAHAPNAGS 206

Query: 181 KVSTRLLVAASQAGSLIGKQGGTVKSIQEESNCIVRVLGSEDLPVFALQDDRVVEVLGDP 240
           KVSTRLLVAASQAGSLIGKQGGTVKSIQEESNCIVRVLGSEDLPVFALQDDRVVEVLGDP
Sbjct: 207 KVSTRLLVAASQAGSLIGKQGGTVKSIQEESNCIVRVLGSEDLPVFALQDDRVVEVLGDP 266

Query: 241 AGVHKAVELIASHLRKFLVDRSIIP-------MTNQQM-EHM-PPPHQSWGPPQGMPPNA 300
           AGVHKAVELIASHLRKFLVDRSIIP       M+N QM +HM PPPHQ WGPPQGMPPNA
Sbjct: 267 AGVHKAVELIASHLRKFLVDRSIIPVFEMNMQMSNPQMDQHMPPPPHQPWGPPQGMPPNA 326

Query: 301 GGGPGFGPNPP-YM-PPPRQFDNYYPPADMPSAMDKQPHHGISAYGREAPMGVHPQTNAQ 360
           GGGPGFGPNPP YM PPPRQFDNYYPPA+M + M+KQPHHGISAYGREAPMGVH  +NAQ
Sbjct: 327 GGGPGFGPNPPQYMPPPPRQFDNYYPPAEMQAVMEKQPHHGISAYGREAPMGVHAASNAQ 386

Query: 361 SVPSIVTQTTQQMQIPLSFADAVIGTAGASISYIRRASGATVTIQETRGVPGEMTVEMSG 420
           S PSIVTQTTQQMQIPLSFADAVIGTAGASISYIRRASGATVTIQETRGVPGEMTVEMSG
Sbjct: 387 SAPSIVTQTTQQMQIPLSFADAVIGTAGASISYIRRASGATVTIQETRGVPGEMTVEMSG 446

Query: 421 TAS---------QNFMAGAGAPAQTQAGGSADQGYNSYAAHGSVYASPPAANPAHASHAG 472
           TAS         QNFMAGAGAPAQ QAG S DQGYNSYAAHGSVYASPP ANP  A+HAG
Sbjct: 447 TASQVQAAQQLIQNFMAGAGAPAQPQAGVSTDQGYNSYAAHGSVYASPP-ANP--AAHAG 506

BLAST of Cp4.1LG07g09850 vs. NCBI nr
Match: gi|778666444|ref|XP_011648742.1| (PREDICTED: flowering locus K homology domain isoform X2 [Cucumis sativus])

HSP 1 Score: 760.8 bits (1963), Expect = 1.4e-216
Identity = 419/492 (85.16%), Postives = 437/492 (88.82%), Query Frame = 1

Query: 1   MADVEQSYTHV-DDNEVDETAQIYENAQLYENAQVQDNTLEQESTEEPENLHGLENEHVP 60
           MADVEQ+YTHV DD++VDE      NAQ+YENAQV D TLEQE+ +E ENLH L+NEH+P
Sbjct: 1   MADVEQNYTHVVDDDQVDE------NAQIYENAQVHDITLEQENAQEVENLHELQNEHIP 60

Query: 61  ETLELEPKEVHNEDTLAVVGEKKWPGWPGESVFRMLVPAQKVGSIIGRKGEFIKKIVEET 120
           ETLE EPK+VH ED L VV EKKWPGWPGESVFRMLVPAQKVGSIIGRKGEFIKKIVEET
Sbjct: 61  ETLESEPKQVHIEDPLTVVSEKKWPGWPGESVFRMLVPAQKVGSIIGRKGEFIKKIVEET 120

Query: 121 RARIKILDGPPGTAERAVMVSAKEEPDSDFPPAVDGLLRVHKRIVDGLEGDNANAPNMGG 180
           RARIKILDGPPGTAERAVMVSAK+EPDS FPPAVDGLLRVHKRIVDGLEGDNA+APN G 
Sbjct: 121 RARIKILDGPPGTAERAVMVSAKDEPDSAFPPAVDGLLRVHKRIVDGLEGDNAHAPNAGS 180

Query: 181 KVSTRLLVAASQAGSLIGKQGGTVKSIQEESNCIVRVLGSEDLPVFALQDDRVVEVLGDP 240
           KVSTRLLVAASQAGSLIGKQGGTVKSIQEESNCIVRVLGS DLPVFALQDDRVVEVLGDP
Sbjct: 181 KVSTRLLVAASQAGSLIGKQGGTVKSIQEESNCIVRVLGS-DLPVFALQDDRVVEVLGDP 240

Query: 241 AGVHKAVELIASHLRKFLVDRSIIP-------MTNQQM-EHM-PPPHQSWGPPQGMPPNA 300
           AGVHKAVELIASHLRKFLVDRSIIP       M+N QM +HM PPPHQ WGPPQGMPPNA
Sbjct: 241 AGVHKAVELIASHLRKFLVDRSIIPVFEMNMQMSNPQMDQHMPPPPHQPWGPPQGMPPNA 300

Query: 301 GGGPGFGPNPP-YM-PPPRQFDNYYPPADMPSAMDKQPHHGISAYGREAPMGVHPQTNAQ 360
           GGGPGFGPNPP YM PPPRQFDNYYPPA+M + M+KQPHHGISAYGREAPMGVH  +NAQ
Sbjct: 301 GGGPGFGPNPPQYMPPPPRQFDNYYPPAEMQAVMEKQPHHGISAYGREAPMGVHAASNAQ 360

Query: 361 SVPSIVTQTTQQMQIPLSFADAVIGTAGASISYIRRASGATVTIQETRGVPGEMTVEMSG 420
           S PSIVTQTTQQMQIPLSFADAVIGTAGASISYIRRASGATVTIQETRGVPGEMTVEMSG
Sbjct: 361 SAPSIVTQTTQQMQIPLSFADAVIGTAGASISYIRRASGATVTIQETRGVPGEMTVEMSG 420

Query: 421 TAS---------QNFMAGAGAPAQTQAGGSADQGYNSYAAHGSVYASPPAANPAHASHAG 472
           TAS         QNFMAGAGAPAQ QAG S DQGYNSYAAHGSVYASPP ANP  A+HAG
Sbjct: 421 TASQVQAAQQLIQNFMAGAGAPAQPQAGVSTDQGYNSYAAHGSVYASPP-ANP--AAHAG 480

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
FLK_ARATH1.2e-14663.71Flowering locus K homology domain OS=Arabidopsis thaliana GN=FLK PE=1 SV=1[more]
PEP_ARATH9.2e-6242.20RNA-binding KH domain-containing protein PEPPER OS=Arabidopsis thaliana GN=PEP P... [more]
Y4837_ARATH1.6e-1327.33KH domain-containing protein At4g18375 OS=Arabidopsis thaliana GN=At4g18375 PE=2... [more]
HEN4_ARATH2.7e-1330.17KH domain-containing protein HEN4 OS=Arabidopsis thaliana GN=HEN4 PE=1 SV=1[more]
PCBP1_MOUSE1.0e-1227.57Poly(rC)-binding protein 1 OS=Mus musculus GN=Pcbp1 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0LJ91_CUCSA1.1e-21885.37Uncharacterized protein OS=Cucumis sativus GN=Csa_2G010230 PE=4 SV=1[more]
A0A067JUM6_JATCU6.0e-18576.82Uncharacterized protein OS=Jatropha curcas GN=JCGZ_23518 PE=4 SV=1[more]
M5VXP2_PRUPE7.3e-18374.48Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa003484mg PE=4 SV=1[more]
D7TXC7_VITVI4.0e-18176.75Putative uncharacterized protein OS=Vitis vinifera GN=VIT_14s0081g00440 PE=4 SV=... [more]
A0A061F946_THECC6.8e-18170.46RNA-binding KH domain-containing protein OS=Theobroma cacao GN=TCM_026308 PE=4 S... [more]
Match NameE-valueIdentityDescription
AT3G04610.16.8e-14863.71 RNA-binding KH domain-containing protein[more]
AT4G26000.15.2e-6342.20 RNA-binding KH domain-containing protein[more]
AT1G51580.11.8e-2328.34 RNA-binding KH domain-containing protein[more]
AT5G15270.21.3e-2130.05 RNA-binding KH domain-containing protein[more]
AT1G14170.37.1e-2031.86 RNA-binding KH domain-containing protein[more]
Match NameE-valueIdentityDescription
gi|659070698|ref|XP_008456230.1|1.3e-22085.98PREDICTED: poly(rC)-binding protein 3 isoform X1 [Cucumis melo][more]
gi|659070700|ref|XP_008456234.1|1.2e-21885.77PREDICTED: poly(rC)-binding protein 3 isoform X2 [Cucumis melo][more]
gi|449442959|ref|XP_004139248.1|1.5e-21885.37PREDICTED: flowering locus K homology domain isoform X1 [Cucumis sativus][more]
gi|700205668|gb|KGN60787.1|1.5e-21885.37hypothetical protein Csa_2G010230 [Cucumis sativus][more]
gi|778666444|ref|XP_011648742.1|1.4e-21685.16PREDICTED: flowering locus K homology domain isoform X2 [Cucumis sativus][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0003723RNA binding
GO:0003676nucleic acid binding
Vocabulary: INTERPRO
TermDefinition
IPR004088KH_dom_type_1
IPR004087KH_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009911 positive regulation of flower development
biological_process GO:0008150 biological_process
cellular_component GO:0005737 cytoplasm
cellular_component GO:0005634 nucleus
cellular_component GO:0005575 cellular_component
molecular_function GO:0003723 RNA binding
molecular_function GO:0003676 nucleic acid binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG07g09850.1Cp4.1LG07g09850.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR004087K Homology domainSMARTSM00322kh_6coord: 88..168
score: 2.5E-8coord: 355..425
score: 0.13coord: 179..254
score: 5.8
IPR004088K Homology domain, type 1GENE3DG3DSA:3.30.1370.10coord: 179..255
score: 5.0E-18coord: 356..416
score: 7.5E-10coord: 80..143
score: 1.5
IPR004088K Homology domain, type 1PFAMPF00013KH_1coord: 358..412
score: 7.2E-7coord: 183..250
score: 8.7E-14coord: 92..142
score: 4.6
IPR004088K Homology domain, type 1PROFILEPS50084KH_TYPE_1coord: 89..156
score: 13.9coord: 356..409
score: 11.007coord: 180..249
score: 1
IPR004088K Homology domain, type 1unknownSSF54791Eukaryotic type KH-domain (KH-domain type I)coord: 83..143
score: 6.04E-14coord: 348..416
score: 1.17E-9coord: 181..256
score: 7.01
NoneNo IPR availablePANTHERPTHR10288KH DOMAIN CONTAINING RNA BINDING PROTEINcoord: 1..471
score: 7.5E
NoneNo IPR availablePANTHERPTHR10288:SF155SUBFAMILY NOT NAMEDcoord: 1..471
score: 7.5E

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cp4.1LG07g09850Cp4.1LG11g09890Cucurbita pepo (Zucchini)cpecpeB139