ClCG03G011600 (gene) Watermelon (Charleston Gray)

NameClCG03G011600
Typegene
OrganismCitrullus lanatus (Watermelon (Charleston Gray))
DescriptionNucleolar protein 8
LocationCG_Chr03 : 22851369 .. 22854543 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GAAAATTTGAGTGTAACTTTCTCAGCGAATTGCATTTGGGCGACTGGTGGAAACCTGGAACTTCGAGCTCTAGGAAGAAGAAGAAAGAAGAAGGAAGAAAGAAGAAAGAAGAAAAGCCATGGAAGAAGAAGAAGAAAGCGCCTCCAAAAAGATGAGAATTTACGTCGGAGGACTGGGTGCCGCCACGACGGAAGACGATCTCAGAAAGGTTTTTCAGAGCGTCGGCGGCGTGGTGGAGGCTGTTGATTTCGTTCGTACCAAATCCCGCTGCTTCGCTTATGTCGACTTCTTTCCGTCATCCCAATCTTCCCTCTCCAAACTTTTCAGCACTGTAAGTTTCTCTCCTCGATTCCAATGCCAAACTCTCATTTCGTTGTTTCCTTCTTCTCTCGATTGGCGAGCGACGGCGTTCATTACCTTTCTTTTTCAAGAATGTTCATGTGCTTATCTCCGTAACTGGTTGCGATTTTAACCTAATGTTAGTTCGATTGATGTACTTTTCGTACCTCTTCATGAATTGAACGGCCTGTTCCTAGTTTAGCTGCTGAAAATTTCGATATTCTTATACACAGTACAATGGATGTGCTTGGAAAGGAGGAAAGTTAAGGCTTGAGAAAGCGAAGGAAAATTATCTTGCTCGTTTGAGACGGGAATGGGAGGAAGATGCTCAAATTACGGATAGTAATGTTGGTGCAGGCATGAAGGTTGTTGCTCCAGAATTTACTGAATATGTCACCAAGTCGGAGCACATTCAGATTTTCTTTCCAAGTTTAGGAGAGGTTGTTCTTTACTTATTGAGTTTTCATTTTGATATGATTCAATTAATGGTTCATTGAAGTCTTGAATATGAAATTTGTTACAGGTGAAGTCTTTGCCAATTAGTGGAACAGGGACGCACAAATATGACTTTCCACATGTTGAGGTGCCTCCTCTTCCTGTGCATTTTTGTGACTGTGAAGAACATAATGTTTCTGCTCCCACTGGCAATTTCAAGGACACAAAAACAAGAGATTTGAATGCTGAGGATGGTGGAATGGATGAAGATGAAATCAAGATGATGAATGCAGTGTTGAACAAGCTCTTTGAGAGGCAAGAAGCTTCTCAATCTAATTGTAATGGGTCCATGGCACACAATGATAAACATAACTCTACGACATTGATTGATAATCAACTACTTGAAGATATTAAAGAGGACAGTGATGAAGATAACCTTGTGCTTAATGTGGTGGCTAGTAACTGCAATTCCAAATCTATGCCATTGAACAGTGGAAATAAAAGCTTCAAAGCTCATGGGAACAGTAAGGTTAAGAATCATGCCACCATCACCACAATTACTGTTCTATCTGTTTGTTTATTGAGAGGCAATTTTTATTACAAGGGTTTGTTGCAGTGTTTTGGTTTTTTACATTTAATTAGAATTTATTATCTTAAACTTTGAGTTGAGGTATACTTATGAAAGTGGACCGTTTCAGGGTGCGGCCAGGGACCAGAAAAATAATAGTAGAGTTCAAAGCAAGAAAAGGAAATCTGTTACTAGTGAGGAATTTGATGGTAATGAATCTGTACCCAGCATCTCTACCAGCTATGGGGGCACTGATCCATCATATGATCCAGCTAGATCCTCAAGACCTCAAGCTCCTGATCGAGGTCCACTGATTCAACCTTCACGTTCTCAGAAATCTTCATGGAAAACACTTATTCATGATAAGAATAACGTTTCATTTAGCATCTCAGACATACTGTCTTCAGTTACTTCAGCAAATGAAGGGCAAGCAGAAGCAGAAGCAGATTATCTTAATCTAGCTCATTCAACTTCTATCAGAAATAGTGACCTTGCAACTGCCGCAGAATTAGGAAGCAAAACAGAAGAAATTCAATCCCAGAAGATCAATGTTTCATTCACCGTTACAGACGTGCTACCTGCAGTTCCTTCAGCAGATCAAGAGGAAGCTGCTTCTGCTGATCTGAATCTAGCTCATTCAACTCCTAACAGAAACACTGACTTTGCAGCTGACCCAATATCAAAAAGCAAATCAGAAGAAATAAAATCTGTGGAGAGCTTCCCAGAAGCCGTATGTGCCGTTCCAAATGTCACCTCGAATAAAGGCAGAGGTTCTTCATGGCGGCAAAAATCTTCATGGACACAATTGGTCAGTGAGGAAATCACCTCCTTCAGTATTACGCAAATTTTACCAAATAATCCTTCTGAAAAGCAGGTACAAGGGGAATCTGATGTTATCAATGTTAATCTCTCTGCTCGGAGCGAAACTAATGCTTCAAAACAACGGGACAGTCAATGTATTGCTGAAGATGGTTCTGCTGCAATTGTAATTAGAAAAGATGAAACTGCCTGGAATAATGTCAAGAAGAATGAACCACCAGCAGTGGAAGAGAATAAGCCTTCTCCAGCCGAAATTATTGATAGTAATTTGCCACAAGTAGGTTCATTTGATGTAAACAGTGGAGAAACTTGCCCGTTTATGAGAAATTCTCGGTCGGTAGCAGAGTGGACAAAGATCAAAGCTGCGCTTTCTGGTGGTTCAAAGAAAAAAAAGCAGAGACAATAGATCACTCAATATCAAAGGTGTGTAACTCTTGAATCTTGCTTGAATGAGGGGCCGTGATTGTATCTGAACATAAATTGGTTTCATCTATCTAGGAAAAGTTTGATGCAGCTTTGAGTAGGTTCATGTTTTTGACTAGGCTTAAGACAGGGCTCTTGGAAATATTTTGCTTTAAACATAAAATGGTAATTTCATTTATGCATACAAAAATGAGATTGAATCTTGTTTTGACTAATTCTAAATTATTGTTTTTTATCTTATAGAAATCCCAAATTTGTAGTTGTATTTCTTATCTAGATATAATAGGATTGAGTGTGGTGCTCTTAGCCCATTAAAGGCATTTTTCAAAATTTTTGGGCAAAATATACATTTAGTCTTTTAGGTTTAAGATAGGTGTCTATTTGGTCTCTAAGGTTTTGAATTAATTTTAATTTACTTGTCATTGTCTTTCCTTTCCCTCTCCATCTCTTTCTCTTTGTCCTCTTTTCACACCGAGCTTCATGTCCAGAATGCACACCGACTTCTCCAATGGCTCGTGTCTAGTGACGTTGCCGGAGCTCACCTGGGATAAGGGGGTCCAAAGTCAATCCAGATAGGGGAGGTGTTGAAAG

mRNA sequence

GAAAATTTGAGTGTAACTTTCTCAGCGAATTGCATTTGGGCGACTGGTGGAAACCTGGAACTTCGAGCTCTAGGAAGAAGAAGAAAGAAGAAGGAAGAAAGAAGAAAGAAGAAAAGCCATGGAAGAAGAAGAAGAAAGCGCCTCCAAAAAGATGAGAATTTACGTCGGAGGACTGGGTGCCGCCACGACGGAAGACGATCTCAGAAAGGTTTTTCAGAGCGTCGGCGGCGTGGTGGAGGCTGTTGATTTCGTTCGTACCAAATCCCGCTGCTTCGCTTATGTCGACTTCTTTCCGTCATCCCAATCTTCCCTCTCCAAACTTTTCAGCACTTACAATGGATGTGCTTGGAAAGGAGGAAAGTTAAGGCTTGAGAAAGCGAAGGAAAATTATCTTGCTCGTTTGAGACGGGAATGGGAGGAAGATGCTCAAATTACGGATAGTAATGTTGGTGCAGGCATGAAGGTTGTTGCTCCAGAATTTACTGAATATGTCACCAAGTCGGAGCACATTCAGATTTTCTTTCCAAGTTTAGGAGAGGTGAAGTCTTTGCCAATTAGTGGAACAGGGACGCACAAATATGACTTTCCACATGTTGAGGTGCCTCCTCTTCCTGTGCATTTTTGTGACTGTGAAGAACATAATGTTTCTGCTCCCACTGGCAATTTCAAGGACACAAAAACAAGAGATTTGAATGCTGAGGATGGTGGAATGGATGAAGATGAAATCAAGATGATGAATGCAGTGTTGAACAAGCTCTTTGAGAGGCAAGAAGCTTCTCAATCTAATTGTAATGGGTCCATGGCACACAATGATAAACATAACTCTACGACATTGATTGATAATCAACTACTTGAAGATATTAAAGAGGACAGTGATGAAGATAACCTTGTGCTTAATGTGGTGGCTAGTAACTGCAATTCCAAATCTATGCCATTGAACAGTGGAAATAAAAGCTTCAAAGCTCATGGGAACAGTAAGGTTAAGAATCATGCCACCATCACCACAATTACTGTTCTATCTGGTGCGGCCAGGGACCAGAAAAATAATAGTAGAGTTCAAAGCAAGAAAAGGAAATCTGTTACTAGTGAGGAATTTGATGGTAATGAATCTGTACCCAGCATCTCTACCAGCTATGGGGGCACTGATCCATCATATGATCCAGCTAGATCCTCAAGACCTCAAGCTCCTGATCGAGGTCCACTGATTCAACCTTCACGTTCTCAGAAATCTTCATGGAAAACACTTATTCATGATAAGAATAACGTTTCATTTAGCATCTCAGACATACTGTCTTCAGTTACTTCAGCAAATGAAGGGCAAGCAGAAGCAGAAGCAGATTATCTTAATCTAGCTCATTCAACTTCTATCAGAAATAGTGACCTTGCAACTGCCGCAGAATTAGGAAGCAAAACAGAAGAAATTCAATCCCAGAAGATCAATGTTTCATTCACCGTTACAGACGTGCTACCTGCAGTTCCTTCAGCAGATCAAGAGGAAGCTGCTTCTGCTGATCTGAATCTAGCTCATTCAACTCCTAACAGAAACACTGACTTTGCAGCTGACCCAATATCAAAAAGCAAATCAGAAGAAATAAAATCTGTGGAGAGCTTCCCAGAAGCCGTATGTGCCGTTCCAAATGTCACCTCGAATAAAGGCAGAGGTTCTTCATGGCGGCAAAAATCTTCATGGACACAATTGGTCAGTGAGGAAATCACCTCCTTCAGTATTACGCAAATTTTACCAAATAATCCTTCTGAAAAGCAGGTACAAGGGGAATCTGATGTTATCAATGTTAATCTCTCTGCTCGGAGCGAAACTAATGCTTCAAAACAACGGGACAGTCAATGTATTGCTGAAGATGGTTCTGCTGCAATTGTAATTAGAAAAGATGAAACTGCCTGGAATAATGTCAAGAAGAATGAACCACCAGCAGTGGAAGAGAATAAGCCTTCTCCAGCCGAAATTATTGATAGTAATTTGCCACAAGTAGGTTCATTTGATGTAAACAGTGGAGAAACTTGCCCGTTTATGAGAAATTCTCGGTCGGTAGCAGAGTGGACAAAGATCAAAGCTGCGCTTTCTGGTGGTTCAAAGAAAAAAAAGCAGAGACAATAGATCACTCAATATCAAAGAATGCACACCGACTTCTCCAATGGCTCGTGTCTAGTGACGTTGCCGGAGCTCACCTGGGATAAGGGGGTCCAAAGTCAATCCAGATAGGGGAGGTGTTGAAAG

Coding sequence (CDS)

ATGGAAGAAGAAGAAGAAAGCGCCTCCAAAAAGATGAGAATTTACGTCGGAGGACTGGGTGCCGCCACGACGGAAGACGATCTCAGAAAGGTTTTTCAGAGCGTCGGCGGCGTGGTGGAGGCTGTTGATTTCGTTCGTACCAAATCCCGCTGCTTCGCTTATGTCGACTTCTTTCCGTCATCCCAATCTTCCCTCTCCAAACTTTTCAGCACTTACAATGGATGTGCTTGGAAAGGAGGAAAGTTAAGGCTTGAGAAAGCGAAGGAAAATTATCTTGCTCGTTTGAGACGGGAATGGGAGGAAGATGCTCAAATTACGGATAGTAATGTTGGTGCAGGCATGAAGGTTGTTGCTCCAGAATTTACTGAATATGTCACCAAGTCGGAGCACATTCAGATTTTCTTTCCAAGTTTAGGAGAGGTGAAGTCTTTGCCAATTAGTGGAACAGGGACGCACAAATATGACTTTCCACATGTTGAGGTGCCTCCTCTTCCTGTGCATTTTTGTGACTGTGAAGAACATAATGTTTCTGCTCCCACTGGCAATTTCAAGGACACAAAAACAAGAGATTTGAATGCTGAGGATGGTGGAATGGATGAAGATGAAATCAAGATGATGAATGCAGTGTTGAACAAGCTCTTTGAGAGGCAAGAAGCTTCTCAATCTAATTGTAATGGGTCCATGGCACACAATGATAAACATAACTCTACGACATTGATTGATAATCAACTACTTGAAGATATTAAAGAGGACAGTGATGAAGATAACCTTGTGCTTAATGTGGTGGCTAGTAACTGCAATTCCAAATCTATGCCATTGAACAGTGGAAATAAAAGCTTCAAAGCTCATGGGAACAGTAAGGTTAAGAATCATGCCACCATCACCACAATTACTGTTCTATCTGGTGCGGCCAGGGACCAGAAAAATAATAGTAGAGTTCAAAGCAAGAAAAGGAAATCTGTTACTAGTGAGGAATTTGATGGTAATGAATCTGTACCCAGCATCTCTACCAGCTATGGGGGCACTGATCCATCATATGATCCAGCTAGATCCTCAAGACCTCAAGCTCCTGATCGAGGTCCACTGATTCAACCTTCACGTTCTCAGAAATCTTCATGGAAAACACTTATTCATGATAAGAATAACGTTTCATTTAGCATCTCAGACATACTGTCTTCAGTTACTTCAGCAAATGAAGGGCAAGCAGAAGCAGAAGCAGATTATCTTAATCTAGCTCATTCAACTTCTATCAGAAATAGTGACCTTGCAACTGCCGCAGAATTAGGAAGCAAAACAGAAGAAATTCAATCCCAGAAGATCAATGTTTCATTCACCGTTACAGACGTGCTACCTGCAGTTCCTTCAGCAGATCAAGAGGAAGCTGCTTCTGCTGATCTGAATCTAGCTCATTCAACTCCTAACAGAAACACTGACTTTGCAGCTGACCCAATATCAAAAAGCAAATCAGAAGAAATAAAATCTGTGGAGAGCTTCCCAGAAGCCGTATGTGCCGTTCCAAATGTCACCTCGAATAAAGGCAGAGGTTCTTCATGGCGGCAAAAATCTTCATGGACACAATTGGTCAGTGAGGAAATCACCTCCTTCAGTATTACGCAAATTTTACCAAATAATCCTTCTGAAAAGCAGGTACAAGGGGAATCTGATGTTATCAATGTTAATCTCTCTGCTCGGAGCGAAACTAATGCTTCAAAACAACGGGACAGTCAATGTATTGCTGAAGATGGTTCTGCTGCAATTGTAATTAGAAAAGATGAAACTGCCTGGAATAATGTCAAGAAGAATGAACCACCAGCAGTGGAAGAGAATAAGCCTTCTCCAGCCGAAATTATTGATAGTAATTTGCCACAAGTAGGTTCATTTGATGTAAACAGTGGAGAAACTTGCCCGTTTATGAGAAATTCTCGGTCGGTAGCAGAGTGGACAAAGATCAAAGCTGCGCTTTCTGGTGGTTCAAAGAAAAAAAAGCAGAGACAATAG

Protein sequence

MEEEEESASKKMRIYVGGLGAATTEDDLRKVFQSVGGVVEAVDFVRTKSRCFAYVDFFPSSQSSLSKLFSTYNGCAWKGGKLRLEKAKENYLARLRREWEEDAQITDSNVGAGMKVVAPEFTEYVTKSEHIQIFFPSLGEVKSLPISGTGTHKYDFPHVEVPPLPVHFCDCEEHNVSAPTGNFKDTKTRDLNAEDGGMDEDEIKMMNAVLNKLFERQEASQSNCNGSMAHNDKHNSTTLIDNQLLEDIKEDSDEDNLVLNVVASNCNSKSMPLNSGNKSFKAHGNSKVKNHATITTITVLSGAARDQKNNSRVQSKKRKSVTSEEFDGNESVPSISTSYGGTDPSYDPARSSRPQAPDRGPLIQPSRSQKSSWKTLIHDKNNVSFSISDILSSVTSANEGQAEAEADYLNLAHSTSIRNSDLATAAELGSKTEEIQSQKINVSFTVTDVLPAVPSADQEEAASADLNLAHSTPNRNTDFAADPISKSKSEEIKSVESFPEAVCAVPNVTSNKGRGSSWRQKSSWTQLVSEEITSFSITQILPNNPSEKQVQGESDVINVNLSARSETNASKQRDSQCIAEDGSAAIVIRKDETAWNNVKKNEPPAVEENKPSPAEIIDSNLPQVGSFDVNSGETCPFMRNSRSVAEWTKIKAALSGGSKKKKQRQ
BLAST of ClCG03G011600 vs. Swiss-Prot
Match: SRP40_YEAST (Suppressor protein SRP40 OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=SRP40 PE=1 SV=2)

HSP 1 Score: 67.8 bits (164), Expect = 5.2e-10
Identity = 72/336 (21.43%), Postives = 139/336 (41.37%), Query Frame = 1

Query: 312 RVQSKKRKSVTSEEFDGNESVPSISTSYGGTDPSYDPARSSRPQAPDRGPLIQPSRSQKS 371
           +V    + SV  +E +  +S  S S+S   +  S   + SS             S S  S
Sbjct: 7   KVDEVPKLSVKEKEIE-EKSSSSSSSSSSSSSSSSSSSSSSSSSGESSSSSSSSSSSSSS 66

Query: 372 SWKTLIHDKNNVSFSISDILSSVTSANEGQAEAEADYLNLAHSTSIRNSDLATAAELGSK 431
                   +++ S S S   SS +S +E  +E+++     + S+S  + + ++ +E   +
Sbjct: 67  DSSDSSDSESSSSSSSSSSSSSSSSDSESSSESDSSSSGSSSSSSSSSDESSSESESEDE 126

Query: 432 TEEIQSQKINVSFTVTDVLPAVP--SADQEEAASADLNLAHSTPNRNTDFAADPISKSKS 491
           T++   +  N     T      P  S+  E ++S   + + S     +D  +   S S S
Sbjct: 127 TKKRARESDNEDAKETKKAKTEPESSSSSESSSSGSSSSSESESGSESDSDSSSSSSSSS 186

Query: 492 EEIKSVESFPEAVCAVPNVTSNKGRGSSWRQKSSWTQLVSEEITSFSITQILPNNPSEKQ 551
           +     ES  ++  +  +  S+    SS    SS +   S   +S S +    ++ S+  
Sbjct: 187 DSESDSESDSQSSSSSSSSDSSSDSDSSSSDSSSDSDSSSSSSSSSSDSDSDSDSSSDSD 246

Query: 552 VQGESDVINVNLSARSETNASKQRDSQCIAEDGSAAIVIRKDETAWNNVKKNEPPAVEEN 611
             G SD  + + S+  E+ +S   DS   ++ GS++ +  K+ TA  +  +  P +  E+
Sbjct: 247 SSGSSDSSSSSDSSSDESTSSDSSDSDSDSDSGSSSELETKEATADESKAEETPASSNES 306

Query: 612 KPSPAEIIDSNLPQV--GSFDVNSGETCPFMRNSRS 644
            PS +    +N   +  G+ ++  G+   F R  RS
Sbjct: 307 TPSASSSSSANKLNIPAGTDEIKEGQRKHFSRVDRS 341

BLAST of ClCG03G011600 vs. Swiss-Prot
Match: NOL8_MOUSE (Nucleolar protein 8 OS=Mus musculus GN=Nol8 PE=1 SV=2)

HSP 1 Score: 57.8 bits (138), Expect = 5.3e-07
Identity = 34/94 (36.17%), Postives = 49/94 (52.13%), Query Frame = 1

Query: 13  RIYVGGLGAATTEDDLRKVFQSVGGVVEAVDFVR-----TKSRCFAYVDFFPSSQSSLSK 72
           R++VGGLG   +E DL+  F   G V +     R        + FAYV+    +++ L K
Sbjct: 9   RLFVGGLGQGISETDLQNQFGRFGEVSDVEIITRKDDQGNSQKVFAYVNI-QITEADLKK 68

Query: 73  LFSTYNGCAWKGGKLRLEKAKENYLARLRREWEE 102
             S  N   WKGG L+++ AKE++L RL +E E+
Sbjct: 69  CMSILNKTKWKGGTLQIQLAKESFLHRLAQERED 101

BLAST of ClCG03G011600 vs. Swiss-Prot
Match: NOL8_HUMAN (Nucleolar protein 8 OS=Homo sapiens GN=NOL8 PE=1 SV=1)

HSP 1 Score: 55.5 bits (132), Expect = 2.7e-06
Identity = 33/93 (35.48%), Postives = 47/93 (50.54%), Query Frame = 1

Query: 13  RIYVGGLGAATTEDDLRKVFQSVGGVVEAVDFVR-----TKSRCFAYVDFFPSSQSSLSK 72
           R+YVGGL    +E DL+  F   G V +     R        + FAY++    +++ L K
Sbjct: 9   RLYVGGLSQDISEADLQNQFSRFGEVSDVEIITRKDDQGNPQKVFAYINI-SVAEADLKK 68

Query: 73  LFSTYNGCAWKGGKLRLEKAKENYLARLRREWE 101
             S  N   WKGG L+++ AKE++L RL +E E
Sbjct: 69  CMSVLNKTKWKGGTLQIQLAKESFLHRLAQERE 100

BLAST of ClCG03G011600 vs. Swiss-Prot
Match: SRAP_STAAN (Serine-rich adhesin for platelets OS=Staphylococcus aureus (strain N315) GN=sraP PE=1 SV=1)

HSP 1 Score: 54.7 bits (130), Expect = 4.5e-06
Identity = 75/354 (21.19%), Postives = 144/354 (40.68%), Query Frame = 1

Query: 235  NSTTLIDNQLLEDIKEDSDEDNLVLNVVASNC----------NSKSMPLNSGNKSFKAHG 294
            NST+L  +  L D   DS  D+L  ++  S+            S S+  ++      +  
Sbjct: 1293 NSTSLSTS--LSDSTSDSKSDSLSTSMSTSDSISTSKSDSISTSTSLSGSTSESESDSTS 1352

Query: 295  NSKVKNHATITTITV---LSGAARDQKNNSRVQSKKRKSVTSEEFDGNESVPSISTSYGG 354
            +S+ K+ +T  +I++    SG+     + S   S       S   +    V S S S   
Sbjct: 1353 SSESKSDSTSMSISMSQSTSGSTSTSTSTSLSDSTSTSLSLSASMN-QSGVDSNSASQSA 1412

Query: 355  TDPSYDPARSSRPQAPDRGPLIQPSRSQKSSWKTLIHDKNNVSFSISDILSSVTSANEGQ 414
            ++ +      S  Q+         S+S+ +S  T + D  ++S S S   S+ TSA+   
Sbjct: 1413 SNSTSTSTSESDSQSTSTYTSQSTSQSESTSTSTSLSDSTSISKSTSQSGSTSTSASLSG 1472

Query: 415  AEAEADYLNLAHSTSIRNSDLATAAELGSKTEEIQSQKINVSFTVTDVLPA----VPSAD 474
            +E+E+D  +++ S S   S+ A+ + L   T    S   + S ++++   A      S  
Sbjct: 1473 SESESDSQSISTSASESTSESASTS-LSDSTSTSNSGSASTSTSLSNSASASESDSSSTS 1532

Query: 475  QEEAASADLNLAHSTPNRNTDFAADPISKSKSEEIKSVESFPEAVCAVPNVTSNKGRGSS 534
              ++ SA +  + S     +   +D +S S S  + ++ S   +V    + ++++    S
Sbjct: 1533 LSDSTSASMQSSESDSQSTSASLSDSLSTSTSNRMSTIASLSTSVSTSESGSTSESTSES 1592

Query: 535  WRQKSSWTQLVSEEITSFSITQILPNNPSEKQVQGESDVINVNLSARSETNASK 572
                S+ T L   + TS S +     + S       S   + + S R+ T+ S+
Sbjct: 1593 ---DSTSTSLSDSQSTSRSTSASGSASTSTSTSDSRSTSASTSTSMRTSTSDSQ 1639


HSP 2 Score: 52.0 bits (123), Expect = 2.9e-05
Identity = 76/351 (21.65%), Postives = 146/351 (41.60%), Query Frame = 1

Query: 225  NGSMAHNDKHNSTTLIDNQLLEDIKEDSDEDNLVLNVVASNCNSKSMPLN---SGNKSFK 284
            +GS++ +D   S ++  +  +   +  S  ++L  +   S+ +SKS+ L+   SG+ S  
Sbjct: 1018 SGSLSASD---SKSMSVSSSMSTSQSGSTSESLSDSQSTSDSDSKSLSLSTSQSGSTSTS 1077

Query: 285  AHGNSKVKNHATITTITVLSGAARDQKNNSRVQSKKRKSVTSEEFDGNESVP-SISTSYG 344
               ++ V+   + +T   +S +  D  + S   S       S     +ES+  S STS  
Sbjct: 1078 TSTSASVRTSESQSTSGSMSASQSDSMSISTSFSDSTSDSKSASTASSESISQSASTSTS 1137

Query: 345  GTDPSYDPARSSRPQAPDRGPLIQPSRSQKSSWKTLIHDKNNVSFSISDILSSVTSANEG 404
            G+  +                 +  S S+++S  T + D  ++S S SD +S  TS ++ 
Sbjct: 1138 GSVST--------------STSLSTSNSERTS--TSVSDSTSLSTSESDSISESTSTSDS 1197

Query: 405  QAEAEADYLNLAHSTSIRNSDLATAAELGSKTEEIQSQKINVSFTVTDVLPAVPSADQEE 464
             +EA    ++ + STSI  S+  + ++      E QS    +S ++++      S     
Sbjct: 1198 ISEA----ISASESTSISLSESNSTSD-----SESQSASAFLSESLSESTSESTSESVSS 1257

Query: 465  AASADLNLAHSTPNRNTDFAADPISKSKSEEIKSVESFPEAVCAVPNVTSNKGRGSSWRQ 524
            + S   +L+ ST    +   +   S S S  I +  S  E+              S+++ 
Sbjct: 1258 STSESTSLSDSTSESGSTSTSLSNSTSGSASISTSTSISEST-------------STFKS 1317

Query: 525  KSSWTQLVSEEITSFSITQILPNNPSEKQVQGESDVINVNLSARSETNASK 572
            +S  T L     TS S +  L  + S+     +SD ++ ++S     + SK
Sbjct: 1318 ESVSTSLSMSTSTSLSNSTSLSTSLSDSTSDSKSDSLSTSMSTSDSISTSK 1327

BLAST of ClCG03G011600 vs. Swiss-Prot
Match: SRAP_STAAM (Serine-rich adhesin for platelets OS=Staphylococcus aureus (strain Mu50 / ATCC 700699) GN=sraP PE=3 SV=1)

HSP 1 Score: 54.7 bits (130), Expect = 4.5e-06
Identity = 75/354 (21.19%), Postives = 144/354 (40.68%), Query Frame = 1

Query: 235  NSTTLIDNQLLEDIKEDSDEDNLVLNVVASNC----------NSKSMPLNSGNKSFKAHG 294
            NST+L  +  L D   DS  D+L  ++  S+            S S+  ++      +  
Sbjct: 1293 NSTSLSTS--LSDSTSDSKSDSLSTSMSTSDSISTSKSDSISTSTSLSGSTSESESDSTS 1352

Query: 295  NSKVKNHATITTITV---LSGAARDQKNNSRVQSKKRKSVTSEEFDGNESVPSISTSYGG 354
            +S+ K+ +T  +I++    SG+     + S   S       S   +    V S S S   
Sbjct: 1353 SSESKSDSTSMSISMSQSTSGSTSTSTSTSLSDSTSTSLSLSASMN-QSGVDSNSASQSA 1412

Query: 355  TDPSYDPARSSRPQAPDRGPLIQPSRSQKSSWKTLIHDKNNVSFSISDILSSVTSANEGQ 414
            ++ +      S  Q+         S+S+ +S  T + D  ++S S S   S+ TSA+   
Sbjct: 1413 SNSTSTSTSESDSQSTSTYTSQSTSQSESTSTSTSLSDSTSISKSTSQSGSTSTSASLSG 1472

Query: 415  AEAEADYLNLAHSTSIRNSDLATAAELGSKTEEIQSQKINVSFTVTDVLPA----VPSAD 474
            +E+E+D  +++ S S   S+ A+ + L   T    S   + S ++++   A      S  
Sbjct: 1473 SESESDSQSISTSASESTSESASTS-LSDSTSTSNSGSASTSTSLSNSASASESDSSSTS 1532

Query: 475  QEEAASADLNLAHSTPNRNTDFAADPISKSKSEEIKSVESFPEAVCAVPNVTSNKGRGSS 534
              ++ SA +  + S     +   +D +S S S  + ++ S   +V    + ++++    S
Sbjct: 1533 LSDSTSASMQSSESDSQSTSASLSDSLSTSTSNRMSTIASLSTSVSTSESGSTSESTSES 1592

Query: 535  WRQKSSWTQLVSEEITSFSITQILPNNPSEKQVQGESDVINVNLSARSETNASK 572
                S+ T L   + TS S +     + S       S   + + S R+ T+ S+
Sbjct: 1593 ---DSTSTSLSDSQSTSRSTSASGSASTSTSTSDSRSTSASTSTSMRTSTSDSQ 1639


HSP 2 Score: 52.0 bits (123), Expect = 2.9e-05
Identity = 76/351 (21.65%), Postives = 146/351 (41.60%), Query Frame = 1

Query: 225  NGSMAHNDKHNSTTLIDNQLLEDIKEDSDEDNLVLNVVASNCNSKSMPLN---SGNKSFK 284
            +GS++ +D   S ++  +  +   +  S  ++L  +   S+ +SKS+ L+   SG+ S  
Sbjct: 1018 SGSLSASD---SKSMSVSSSMSTSQSGSTSESLSDSQSTSDSDSKSLSLSTSQSGSTSTS 1077

Query: 285  AHGNSKVKNHATITTITVLSGAARDQKNNSRVQSKKRKSVTSEEFDGNESVP-SISTSYG 344
               ++ V+   + +T   +S +  D  + S   S       S     +ES+  S STS  
Sbjct: 1078 TSTSASVRTSESQSTSGSMSASQSDSMSISTSFSDSTSDSKSASTASSESISQSASTSTS 1137

Query: 345  GTDPSYDPARSSRPQAPDRGPLIQPSRSQKSSWKTLIHDKNNVSFSISDILSSVTSANEG 404
            G+  +                 +  S S+++S  T + D  ++S S SD +S  TS ++ 
Sbjct: 1138 GSVST--------------STSLSTSNSERTS--TSVSDSTSLSTSESDSISESTSTSDS 1197

Query: 405  QAEAEADYLNLAHSTSIRNSDLATAAELGSKTEEIQSQKINVSFTVTDVLPAVPSADQEE 464
             +EA    ++ + STSI  S+  + ++      E QS    +S ++++      S     
Sbjct: 1198 ISEA----ISASESTSISLSESNSTSD-----SESQSASAFLSESLSESTSESTSESVSS 1257

Query: 465  AASADLNLAHSTPNRNTDFAADPISKSKSEEIKSVESFPEAVCAVPNVTSNKGRGSSWRQ 524
            + S   +L+ ST    +   +   S S S  I +  S  E+              S+++ 
Sbjct: 1258 STSESTSLSDSTSESGSTSTSLSNSTSGSASISTSTSISEST-------------STFKS 1317

Query: 525  KSSWTQLVSEEITSFSITQILPNNPSEKQVQGESDVINVNLSARSETNASK 572
            +S  T L     TS S +  L  + S+     +SD ++ ++S     + SK
Sbjct: 1318 ESVSTSLSMSTSTSLSNSTSLSTSLSDSTSDSKSDSLSTSMSTSDSISTSK 1327

BLAST of ClCG03G011600 vs. TrEMBL
Match: A0A0A0LXQ1_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G651690 PE=4 SV=1)

HSP 1 Score: 993.8 bits (2568), Expect = 1.0e-286
Identity = 521/672 (77.53%), Postives = 563/672 (83.78%), Query Frame = 1

Query: 3   EEEESASKKMRIYVGGLGAATTEDDLRKVFQSVGGVVEAVDFVRTKSRCFAYVDFFPSSQ 62
           E+ +SAS+ MRIYVGGLGAA TEDDLRKVF SVGGVVEAVDFVRTKSR FAYVDFFPSSQ
Sbjct: 2   EKGQSASENMRIYVGGLGAAMTEDDLRKVFHSVGGVVEAVDFVRTKSRSFAYVDFFPSSQ 61

Query: 63  SSLSKLFSTYNGCAWKGGKLRLEKAKENYLARLRREWEEDAQITDSNVGAGMKVVAPEFT 122
           SSLSKLFSTYNGCAWKGGKLRLEKAKENYLARL REWEEDAQI D+NVGA M++VAPE T
Sbjct: 62  SSLSKLFSTYNGCAWKGGKLRLEKAKENYLARLNREWEEDAQIRDNNVGADMELVAPEST 121

Query: 123 EYVTKSEHIQIFFPSLGEVKSLPISGTGTHKYDFPHVEVPPLPVHFCDCEEHNVSAPTGN 182
           E+VTKSEHI IFFPSLGEVK LPISGTGTHKYDFPHVEVPP PVHFCDCEEHN S+P GN
Sbjct: 122 EHVTKSEHINIFFPSLGEVKPLPISGTGTHKYDFPHVEVPPFPVHFCDCEEHNASSPIGN 181

Query: 183 FKDTKTRDLNAEDGGMDEDEIKMMNAVLNKLFERQEASQSNCNGSMAHNDKHNSTTLIDN 242
            K TKTRDLNAE+GGMDEDEIKMMNAVL+KLFER+EASQSNCN SMA NDKHNSTT  DN
Sbjct: 182 SKYTKTRDLNAENGGMDEDEIKMMNAVLSKLFERKEASQSNCNDSMALNDKHNSTTSTDN 241

Query: 243 QLLEDIKEDSDEDNLVLNVVASNCNSKSMPLNSGNKSFKAHGNSKVKNHATITTITVLSG 302
           QLLED K DSDEDNLVLNV+ASNCNSK+M LN GNK FKAHGNSK               
Sbjct: 242 QLLEDNKVDSDEDNLVLNVMASNCNSKTMALNRGNKIFKAHGNSK--------------D 301

Query: 303 AARDQKNNSRVQSKKRKSVTSEEFDGNESVPSISTSYGGTDPSYDPARSSRPQAPDRGPL 362
           A RDQKNN RVQSKKRKS  SEEFDGNESVPSI TS  GTDPSYDPARSSRPQAPDRGP 
Sbjct: 302 AVRDQKNNCRVQSKKRKSFISEEFDGNESVPSIFTSNRGTDPSYDPARSSRPQAPDRGPP 361

Query: 363 IQPSRSQKSSWKTLIHDKNNVSFSISDILSSVTSANEGQAEAEADYLNLAHSTSIRNSDL 422
           +Q  RSQKSSWKTLI DK+NVSF ISDILSSV SANE   +AEAD LN+AHST  RNS+L
Sbjct: 362 VQSLRSQKSSWKTLIRDKSNVSFCISDILSSVPSANE--EKAEADDLNIAHSTPNRNSNL 421

Query: 423 ATAAELGSKTEEIQSQKINVSFTVTDVLPAV--------PSADQEEAASADLNLAHSTPN 482
           A+ A LGS+ +EIQS KINV F++TDVLP V         SADQE+AASADLNLAHSTPN
Sbjct: 422 ASTAVLGSEIDEIQSGKINVPFSITDVLPLVLSADQEKAASADQEKAASADLNLAHSTPN 481

Query: 483 RNTDFAADPISKSKSEEIKSVESFPEAVCAVPNVTSNKGRGSSWRQKSSWTQLVSEEITS 542
            NTD  ADPISKSKSEE++SVESF +A C VPNVT NKGRGSSWR+KSSWTQLVSEE TS
Sbjct: 482 INTDVGADPISKSKSEEMESVESFQDAQCTVPNVTLNKGRGSSWRKKSSWTQLVSEEFTS 541

Query: 543 FSITQILPNNPSEKQVQGESDVINVNLSARSETNASKQRDSQCIAEDGSAAIVIRKDETA 602
           FSITQILPN+ SE QVQGES  IN N SA SETNA +++DS+CIA+D S A VI K E  
Sbjct: 542 FSITQILPNSTSENQVQGESGDINANFSAWSETNAPRKQDSECIAKDESTAFVIGKGEIG 601

Query: 603 WNNVKKNEPPAVEENKPSPAEIIDSNLP-QVGSFDVNSGETCPFMRNSRSVAEWTKIKAA 662
            N+VK+NEP AV+E +  P +I +SN P Q GSFD  SG+TCPFMRNS+SVAEWTKIKAA
Sbjct: 602 CNDVKQNEPQAVQECETCPTQITESNFPQQEGSFDEISGDTCPFMRNSQSVAEWTKIKAA 657

Query: 663 LSGGSKKKKQRQ 666
           LSGGSKKKKQRQ
Sbjct: 662 LSGGSKKKKQRQ 657

BLAST of ClCG03G011600 vs. TrEMBL
Match: A0A061GQW7_THECC (RNA-binding family protein, putative isoform 1 OS=Theobroma cacao GN=TCM_039820 PE=4 SV=1)

HSP 1 Score: 334.0 bits (855), Expect = 4.3e-88
Identity = 234/692 (33.82%), Postives = 368/692 (53.18%), Query Frame = 1

Query: 7   SASKKMRIYVGGLGAATTEDDLRKVFQSVGGVVEAVDFVRTKSRCFAYVDFFPSSQSSLS 66
           +A+   RI+VGGLG + + DDLRKVF +VG  VE +D +R K R FAYVD  PSS +SLS
Sbjct: 2   AATASTRIHVGGLGQSVSSDDLRKVFSAVG-TVEGLDIIRAKGRSFAYVDILPSSSNSLS 61

Query: 67  KLFSTYNGCAWKGGKLRLEKAKENYLARLRREW-EEDAQITDSNVGAGMKVVAPEFTEYV 126
           KLF+TYNGC WKGGKL+L KAKE+YL RL+REW +E+ +     + +           +V
Sbjct: 62  KLFNTYNGCVWKGGKLKLGKAKEHYLTRLKREWAKEEEEAHHQPMPSSSDEPYNGNKVHV 121

Query: 127 TKSEHIQIFFPSLGEVKSLPISGTGTHKYDFPHVEVPPLPVHFCDCEEHNVSAPTGNFKD 186
           ++  H++IFFP L  VKSLP+SGTG HKY F  VEV  LP+HFCDCEEH     +G+F  
Sbjct: 122 SQQGHLRIFFPRLTRVKSLPLSGTGKHKYSFQRVEVSALPIHFCDCEEH-----SGHFNA 181

Query: 187 TKTRD-LNAED--GGMDEDEIKMMNAVLNKLFERQEASQSNCNGSMAHNDKHNSTTLIDN 246
            + ++  N E+  G M+E+E+ MM++V+NKLFER  A+ SN + ++  +++ + T LI+ 
Sbjct: 182 VRRKEGQNHEEINGVMNEEEVSMMSSVMNKLFER--ANISNTSSAILADEREDFTKLIEG 241

Query: 247 QLLEDIKEDSDEDNLVLNVVASNCNSKSMPLNSGNKSFKAHGNSKVKNHAT-ITTITVLS 306
            L ++  E++D+D+L++NVV+ + N  +M   SG++  KA    K     T I+    + 
Sbjct: 242 PLSDE--EETDDDDLIINVVSDSNNRAAM---SGSREKKAVSTEKTGLGETHISNYGAIR 301

Query: 307 GAARDQKNNSRVQSKKRKSVTSEEFDGNESVPSISTSYGGTDPSYDPARSSRPQAPDRGP 366
            A + Q+NN+    KKRK + ++E D ++ V         ++     A     +A +   
Sbjct: 302 SACKVQENNTLHPRKKRKPLPNKEEDKHQLVSLFHGQRRNSEFDDSNADFEENEANEDNL 361

Query: 367 LIQ-PSRSQKSSWKTLIHDKNNVS--FSISDILSSVTSANEGQAEAEADYLNLAHSTSIR 426
           +I   S + K S +T + DK ++   F  S+  +S     + + + + D + L +     
Sbjct: 362 MINIVSMANKRSGRTKL-DKVSLKQRFKSSEKQTSEDGPIQNEHKVQKDDILLPNRNEKG 421

Query: 427 NSDL----------ATAAELGSKTEE-------------IQSQKINVSFTVTDVLPAVPS 486
           N              T AE G K                +   + N +F+++++L  V +
Sbjct: 422 NVQTQSNESVVVAQTTGAECGLKQSNTSCSWSQKSSWRALVGDRSNSAFSLSNILQNVGT 481

Query: 487 ADQEEAASADLNLAHSTPNRNTDFAADPISKSK--SEEIKSVESFPEAVCAVPNVTSNKG 546
             +++  S    +  +  +RN + A     +      EI  VE  P         +SN G
Sbjct: 482 TKEKQQISDGCKVNKTLDSRNGNLAKPKNLEGMLGKTEIVDVEPQPN---QPKTASSNSG 541

Query: 547 RGSSWRQKSSWTQLVSEEITSFSITQILPNNPSEKQVQGESDVINVNLSARSETNASKQR 606
           RGSSW  KSSW QLVSE  +SFSI++I+P + ++++         V  +  + +N +K  
Sbjct: 542 RGSSWLHKSSWMQLVSENSSSFSISEIVPGSTTKQECTKPIYEDVVYSADGNHSNKTKSH 601

Query: 607 DSQCIAEDGSAAIVIRKDETAWNNVKKNEPPAVEENKPSPAEIID---SNLPQVGSFDVN 663
            S+     GS A+ +RK+     ++ ++    V  N  +    ++   S   +    D +
Sbjct: 602 KSEPTVY-GSPALGVRKEGDTVRSIPESNQQTVVGNTDASVPTVEKCNSEPDKAFGGDTS 661

BLAST of ClCG03G011600 vs. TrEMBL
Match: A0A151RVC0_CAJCA (Nucleolar protein 8 OS=Cajanus cajan GN=KK1_031900 PE=4 SV=1)

HSP 1 Score: 325.1 bits (832), Expect = 2.0e-85
Identity = 230/671 (34.28%), Postives = 350/671 (52.16%), Query Frame = 1

Query: 1   MEEEEESASKKMRIYVGGLGAATTEDDLRKVFQSVGGVVEAVDFVRTKSRCFAYVDFFPS 60
           MEEE +     +RI+VGGLG A +++DLR +F S+G  V+ +  +RTK R FAY+DF  +
Sbjct: 1   MEEEAKETQSAVRIFVGGLGEAVSDEDLRTLFSSLG-TVQTIRTIRTKGRSFAYLDFL-T 60

Query: 61  SQSSLSKLFSTYNGCAWKGGKLRLEKAKENYLARLRREWEEDAQITDSNVGAGMKVVAPE 120
              SLSKLFS YNGC WKGG+L+LEKAKE+YL RL+REWE++A        A      P 
Sbjct: 61  DPKSLSKLFSKYNGCLWKGGRLKLEKAKEDYLTRLKREWEQEAL-------ADATQPPPV 120

Query: 121 FTEYVTKSEHIQIFFPSLGEVKSLPISGTGTHKYDFPHVEVPPLPVHFCDCEEHNVSAPT 180
             + V KS  + +FFP L +VKS+P +GTG HKY F +++VPPLPVHFCDCEEH   +P 
Sbjct: 121 VPQEVPKS--LSVFFPRLRKVKSIPFNGTGKHKYSFQNIKVPPLPVHFCDCEEH--CSPF 180

Query: 181 GNFKDTKTRDLNAEDGGMDEDEIKMMNAVLNKLFERQEASQSNCNGSMAHNDKHNSTTLI 240
            N ++ ++ D  AE GGM+++EI +MNAV+NKLFE+++ S +   G     D   S   +
Sbjct: 181 VNEREKQSIDGAAESGGMNDEEINIMNAVMNKLFEKEKVSNAKNLGE--EKDSFESPDAL 240

Query: 241 DNQLLEDIKEDSDEDNLVLNVVASNCNSKSMPLNSGNKSF-KAHGNSKVKNHATITTITV 300
            +   ED    +DED+L++N+       K+     GN+   K   N +  N   +     
Sbjct: 241 HSDECED--SATDEDDLIINMQ----TRKNKTALIGNQELEKILENQEWSNKTKVD---- 300

Query: 301 LSGAARDQKNNSRVQSKKRKSVTSEEFDGNESVP--SISTSYGGTDP----SYDPARSSR 360
                +++ N S  Q +KR +    +    +S+P   +ST+ GG       S +    ++
Sbjct: 301 -----KEEPNKSTPQVQKRNNSNPAKNKKRKSLPKLEVSTTSGGKSNMQTLSDEEGSDAQ 360

Query: 361 PQ--APDRGPLIQPSRSQKSSWKTLIHDKNNVSFSISDILSSVTSANEGQAEAEADYLNL 420
           P     D   L + S SQKSSW+ L+ D  N SFS S IL  + S   G+ +  +++L+ 
Sbjct: 361 PTELEDDVEELTKVSWSQKSSWRELLGDGGNTSFSASLILPELDS---GKNQQSSEHLST 420

Query: 421 AHSTSIRNSDLATAAELGSKTEEIQSQKINVSFTVTDVLPAVPSADQEEAASADLNLAHS 480
           + ST     ++ +   L SK+   Q         + ++    P+ +Q     A+      
Sbjct: 421 SISTDSETENMESDGHLWSKSTNTQ--------VIKELAEVQPTNEQVIKELAE------ 480

Query: 481 TPNRNTDFAADPISKSKSEEIKSVESFPEAVCAVPNVTSNKGRGSSWRQKSSWTQLVSEE 540
                    A P +K   E++  +++       VPN T   GRG+SWR+K SWTQLVSE 
Sbjct: 481 ---------AQPANKQVIEDLNEMQNNNN---VVPNKT---GRGASWRRKQSWTQLVSEN 540

Query: 541 ITSFSITQILPNNPSEKQVQGESDVINVNLSARSETNASKQRDSQCIAEDGSAAIVIRKD 600
            +SFSI+ ILP     + +  E  ++   +S   + N   Q     +  DG  +  I  +
Sbjct: 541 NSSFSISHILPGITFPEPMAKE-PIMEPAISNDCKHNDVAQDTIDKVLSDGFNSREIIPE 600

Query: 601 ETAWNNVKKNEPPAVEENKPSPAEIIDSNLPQVGSFDVNSGETCPFMRNSRSVAEWTKIK 660
                      P +V E K      ++++  +    +V  GETC FMR++ S+ +W K K
Sbjct: 601 RIQHIGANDIVPGSVAEEK------VETSPRERSIENVEVGETCRFMRSAASLKDWAKAK 602

Query: 661 AALSGGSKKKK 663
           AA+S   K+K+
Sbjct: 661 AAISRSLKRKR 602

BLAST of ClCG03G011600 vs. TrEMBL
Match: V4KX69_EUTSA (Uncharacterized protein OS=Eutrema salsugineum GN=EUTSA_v10012942mg PE=4 SV=1)

HSP 1 Score: 324.7 bits (831), Expect = 2.6e-85
Identity = 240/687 (34.93%), Postives = 347/687 (50.51%), Query Frame = 1

Query: 12  MRIYVGGLGAATTEDDLRKVFQSVGGVVEAVDFVRTKSRCFAYVDFFPSSQSSLSKLFST 71
           +R++VGGLG +  +DDL K+F  +G V ++V+FVRTK R FAY+DF PSS  SL+KLFST
Sbjct: 10  VRLHVGGLGESVGKDDLLKIFSPMGSV-DSVEFVRTKGRSFAYIDFSPSSDKSLTKLFST 69

Query: 72  YNGCAWKGGKLRLEKAKENYLARLRREWEEDAQITDSNVGAGMKVVAPEFTEYVTKSEHI 131
           YNGC WKGGKLRLEKAKE+YLARLRR WEE A   D  + A          E  T S  I
Sbjct: 70  YNGCIWKGGKLRLEKAKEHYLARLRRGWEESASTPDDTIKA---------PEKFTPSTQI 129

Query: 132 QIFFPSLGEVKSLPISGTGTHKYDFPHVEVP-PLPVHFCDCEEHNVSAPTGNFKDTKTRD 191
            IF P L +VKSLP+SGTG HKY F  V VP  LP  FCDCEEH+ S      ++T  RD
Sbjct: 130 NIFIPRLRKVKSLPLSGTGKHKYSFQRVSVPSSLPKTFCDCEEHSASLTP---RETHVRD 189

Query: 192 LNAEDGGMDEDEIKMMNAVLNKLFERQEASQSNCNGSMAHNDKHNSTTLIDNQLLEDIKE 251
           L A + G++E+E+ +MN+V++KLFE  +            ND +++    DN +L D   
Sbjct: 190 LEALNVGINEEEVNIMNSVMSKLFENND------------NDNNDNN---DNLILND--- 249

Query: 252 DSDEDNLVLNVVASNCNSKSMPLNSGNKSFKAHGNSKVKNHATITTITVLSGAA--RDQK 311
             + DNL++NVV++  +     L+  ++  K             +T+   SGA     +K
Sbjct: 250 --NNDNLIINVVSNGNDMMDSELDILSRKRK-------------STLNEASGAGYIEGRK 309

Query: 312 NNSRVQSKKRKSVTSEEFDGNESVPSISTSY--GGTDPSYDPARSSRPQAPDRGPLIQPS 371
            N+    KKR+S+  E+    ES  +IS        DP+      SR     R      S
Sbjct: 310 GNNVHPKKKRQSIILEKNGRQESSQTISEKKKPSEVDPNKSTDEPSRKIGVKRS-TDNIS 369

Query: 372 RSQKSSWKTLIHDKNNVSFSISDILSSVTSANEGQ-AEAEADYLNLAHSTSIRNSDLATA 431
            SQKSSWK L+ + N+  F++S  L  V S    Q A    D   L    +    D+  +
Sbjct: 370 WSQKSSWKALVANGNSNHFTVSSFLPGVGSNKAVQSASGNTDLAGLPSQENSEEFDVPNS 429

Query: 432 AELGSKTEEIQSQKINVSFTVTDVLPAVPSAD-----------QEEAASADLNLAHSTPN 491
            E  + T+  ++++  ++ T+  +   V + D             EA   + +  +   N
Sbjct: 430 TERPTVTKIKKTKRKRITSTI--IAENVAAEDDIEKNDIVTDISVEAEPLEASTENDCEN 489

Query: 492 RNTDFAADP-ISKSKSEEIKSVE---------SFPEAVCAVPNVTSNKGRGSSWRQKSSW 551
            N +   D  +++  + E +S++            EA  A     S    GSSW Q++SW
Sbjct: 490 DNLNVETDENVAEDLNAEKESLDLKDNVDHDVDKDEAGKASLEARSKSTGGSSWLQRASW 549

Query: 552 TQLVSEEITSFSITQILPNNPSEKQV------QGESDVINVNLSARSETNASKQRDSQCI 611
           TQLVS+   SFSITQ+ P+  S+K         G+    N N S     +  KQRD+ C 
Sbjct: 550 TQLVSDRSASFSITQLFPDLASDKNEAARVNNNGDGQFSNFNQS----ESGMKQRDNSCS 609

Query: 612 AEDGSAAIVIRKDETAWNNVKKNEPPAVEENKPSPAEIIDSNLPQVGSFDVNSGETCPFM 666
                AA  ++ D T   ++++N      +N     ++  + +P      + SGETC FM
Sbjct: 610 TASFEAA-GVQVDSTPVRSLEENRQSLKGKNVREGCKLA-AKMPI--RRKIGSGETCTFM 639

BLAST of ClCG03G011600 vs. TrEMBL
Match: V7AY66_PHAVU (Uncharacterized protein OS=Phaseolus vulgaris GN=PHAVU_009G217200g PE=4 SV=1)

HSP 1 Score: 318.5 bits (815), Expect = 1.9e-83
Identity = 244/673 (36.26%), Postives = 344/673 (51.11%), Query Frame = 1

Query: 1   MEEEEESASKKMRIYVGGLGAATTEDDLRKVFQSVGGVVEAVDFVRTKSRCFAYVDFFPS 60
           M EE E     +RI+VGGL  A + +DLR +F S+G  V+AV  +RTK R FAY+DF  S
Sbjct: 1   MAEEAEETINAVRIFVGGLAEAVSAEDLRSLFSSLG-TVQAVQTIRTKGRSFAYIDF-QS 60

Query: 61  SQSSLSKLFSTYNGCAWKGGKLRLEKAKENYLARLRREWEEDAQITDSNVGAGMKVVAPE 120
           +  SLSKLFS YNGC WKGGKLRLEKAKE+YLAR++REWE DA + D+            
Sbjct: 61  NPKSLSKLFSKYNGCLWKGGKLRLEKAKEDYLARMKREWEHDA-LDDATQPPPSSPKDTT 120

Query: 121 FTEYVTKSEHIQIFFPSLGEVKSLPISGTGTHKYDFPHVEVPPLPVHFCDCEEHNVSAPT 180
                + ++H+ IFFP L +VK++P SGTG HKY F +++VPPLPVHFCDCEEH   +P 
Sbjct: 121 GDSSKSNTKHLNIFFPRLRKVKTIPFSGTGKHKYSFQNIKVPPLPVHFCDCEEH--CSPF 180

Query: 181 GNFKDTKTRDLNAEDGGMDEDEIKMMNAVLNKLFERQEASQSNCNGSMAHNDKHNSTTLI 240
              K     +  AE GGM+++EI +MNAV+NKL ++++ S +   G     D + S    
Sbjct: 181 VTEKGKLYINGAAERGGMNDEEISIMNAVMNKLLQKEKVSSAEKLGK--EKDSYKS---- 240

Query: 241 DNQLLEDIKEDSDEDNLVLNVVASNCNSKSMPLNSGNKSFKAHGNSKVKNHATITTITV- 300
            + L  D    +DED+L++N+      +  +    GN+  +    +K ++   IT I   
Sbjct: 241 PDALQSDEDSATDEDDLIINMEMKTNKTALI----GNQELERILENK-ESWLNITKIAKE 300

Query: 301 ---LSGAARDQKNNSRV-QSKKRKSVTSEEFDGNESVPSISTSYGGTDPSYDPARSSRPQ 360
               S    +++NNS + ++KKRKS+ + E + N       T+ GG     +   S  P+
Sbjct: 301 EPNKSMPQVEKRNNSNLNKNKKRKSIPALEMESN-----AETTPGGKGNMQNKVGSGDPE 360

Query: 361 APDRGPLIQPSRSQKSSWKTLIHDKNNVSFSISDILSSVTSANEGQAEAEADYLNLAHST 420
               G L + S SQKSSWK L+    N SFS S IL  + S+   Q    +D L    ST
Sbjct: 361 -DVFGELTKVSWSQKSSWKELLGGGGNTSFSASLILPKLDSSKNLQT---SDDLCAPLST 420

Query: 421 SIRNSDLATAAELGSKTEEIQSQKINVSFTVTDVLPAVPSADQEEAASADLNLAHSTPNR 480
           + +  ++    EL S+    Q  K                 DQ EA   +  +       
Sbjct: 421 NNKTENMERDRELWSRPTNTQVIK-----------------DQTEAQPTNTQVI------ 480

Query: 481 NTDFAADPISKSKSEEIKSVESFPEAVCAVPNVTSNKGRGSSWRQKSSWTQLVSEEITSF 540
               AA P +K   + IK V      V   PN T   GRG+SW QK SWTQ+VSE   SF
Sbjct: 481 KDHTAAQPTNK---QVIKDVTENQHDV--APNKT---GRGASWLQKQSWTQMVSENNNSF 540

Query: 541 SITQILPNNPSEKQVQGESDVINVNLSARSETNASKQRDSQCIAEDGSAAIVIRKDETAW 600
           SI+ ILP   +  + + +  ++   +S   + N   +     +  DG      +  ET  
Sbjct: 541 SISHILP-GITFPEPKAKDPIVEPAISNYFKHNGVAKDTINNVVSDG-----FKSRETIQ 600

Query: 601 NN---VKKNE---PPAVEEN-KPSPAEIIDSNLPQVGSFDVNSGETCPFMRNSRSVAEWT 660
            N   +  N+    P VEE  + SP E    N        +  GETC FMR++ S+ EW 
Sbjct: 601 ENSQYIGANDIAFAPVVEEKVETSPREKSTEN--------IEIGETCSFMRSAASMKEWA 603

Query: 661 KIKAALSGGSKKK 662
           K KAA+SG  K+K
Sbjct: 661 KAKAAVSGSLKRK 603

BLAST of ClCG03G011600 vs. TAIR10
Match: AT5G58130.1 (AT5G58130.1 RNA-binding (RRM/RBD/RNP motifs) family protein)

HSP 1 Score: 247.7 bits (631), Expect = 2.1e-65
Identity = 173/492 (35.16%), Postives = 251/492 (51.02%), Query Frame = 1

Query: 4   EEESASKKMRIYVGGLGAATTEDDLRKVFQSVGGVVEAVDFVRTKSRCFAYVDFFPSSQS 63
           EE+S+   +R++VGGLG +   DDL K+F  +G  V+AV+FVRTK R FAY+DF PSS +
Sbjct: 2   EEKSSGGGVRLHVGGLGESVGRDDLLKIFSPMG-TVDAVEFVRTKGRSFAYIDFSPSSTN 61

Query: 64  SLSKLFSTYNGCAWKGGKLRLEKAKENYLARLRREWEEDAQITDSNVGAGMKVVAPEFTE 123
           SL+KLFSTYNGC WKGG+LRLEKAKE+YLARL+REWE  +  +D+       + AP  + 
Sbjct: 62  SLTKLFSTYNGCVWKGGRLRLEKAKEHYLARLKREWEAASSTSDNT------IKAPSDSP 121

Query: 124 YVTKSEHIQIFFPSLGEVKSLPISGTGTHKYDFPHVEV-PPLPVHFCDCEEHNVSAPTGN 183
             T   H+ IFFP L +VK +P+SGTG HKY F  V V   LP  FCDCEEH+ S+ T  
Sbjct: 122 PAT---HLNIFFPRLRKVKPMPLSGTGKHKYSFQRVPVSSSLPRSFCDCEEHSNSSLTP- 181

Query: 184 FKDTKTRDLNAEDGGMDEDEIKMMNAVLNKLFERQEASQSNCNGSMAHNDKHNSTTLIDN 243
            ++    DL A + G  E E+ +MN+V+NKLFE+                          
Sbjct: 182 -REIHLHDLEAVNVGRQEAEVNVMNSVMNKLFEKNNVDPE-------------------- 241

Query: 244 QLLEDIKEDSDEDNLVLNVVASNCNSKSMPLNSGNKSFKAHGNSKVKNHATITTITVLSG 303
              ED + ++D+DNL++N VAS+ N     L+  ++  K+  N K  +           G
Sbjct: 242 ---EDNEIEADQDNLIIN-VASSGNDMDSALDMLSRKRKSILNKKTPSE---------EG 301

Query: 304 AARDQKNNSRVQSKKRKSVTSEEFDGNESVPSISTSYGGTDPSYDPARSSRPQAPDRGPL 363
            +  +K N    SK R++++ EE    ES  +I      ++   D +     +  D    
Sbjct: 302 YSEGRKGNLTHPSKNRQTISLEETGRQESSQAIRGKKKPSEVVPDKSSDEPSRTKDLEQS 361

Query: 364 IQP-SRSQKSSWKTLIHDKNNVSFSISDILSSVTSANEGQAEAEADYLNLAHSTSIRNSD 423
           I   S SQKSSWK+L+ + N+  FS+S  L  V S+              A   + RN+D
Sbjct: 362 IDNISWSQKSSWKSLMANGNSNDFSVSSFLPGVGSSK-------------AVQPAPRNTD 418

Query: 424 LATAAELGSKTEEIQSQKINVSFTVTDVLPAVPSADQEEAASADLNLAHSTPNRNTDFAA 483
           LA     G  + E   +K       + ++            + DL ++      ++D  A
Sbjct: 422 LA-----GLPSRENLKKKTKRKRVTSTIM------------AEDLPVSDDIKRDDSDTMA 418

Query: 484 DPISKSKSEEIK 494
           D I +  S+ ++
Sbjct: 482 DDIERDDSDAVE 418


HSP 2 Score: 83.6 bits (205), Expect = 5.1e-16
Identity = 62/198 (31.31%), Postives = 97/198 (48.99%), Query Frame = 1

Query: 471 STPNRNTDFAADPISKSKSEEIKSVESFPEAVCAVPNVTSNKGRG-SSWRQKSSWTQLVS 530
           S   ++ + A D  ++ +S  +K      E     P   SNK  G SSW QK+SWTQLVS
Sbjct: 553 SNVEKHENVAEDLNAEKESLVVKENVVDEEEAGKGPLKASNKSTGGSSWLQKASWTQLVS 612

Query: 531 EEITS-FSITQILPNNPSEK-QVQGESDVINVNLSARSETNASKQRDSQCIAEDGSAAIV 590
           ++ TS FSITQ+ P+  S+K +  G  + +    S  ++T ++ ++     +  G  A  
Sbjct: 613 DKNTSSFSITQLFPDLTSDKGEAAGVINNVGNQFSNSNQTASAMKQTDYASSSGGFVAAG 672

Query: 591 IRKDETAWNNVKKNEPPAVEENKPSPAEIIDSNLPQVGSFDVNSGETCPFMRNSRSVAEW 650
           +  D T   ++ +N      +N    A++      ++    V SG+TC FMR+S S+ EW
Sbjct: 673 VPVDSTPVRSLDENRQRLNGKNVSEGAKL--GAKKKIIKRKVGSGDTCTFMRSSTSLKEW 732

Query: 651 TKIKAALSGGSKKKKQRQ 666
            K K ALS   +KK   +
Sbjct: 733 AKAKKALSEPRRKKNSEE 748


HSP 3 Score: 39.3 bits (90), Expect = 1.1e-02
Identity = 42/184 (22.83%), Postives = 75/184 (40.76%), Query Frame = 1

Query: 225 NGSMAHNDKHNST---TLIDNQLLEDIKED-SDEDNLVLNVVASNCNSKSMPLN--SGNK 284
           + SMA +D  ++    T ID+   +   +D   +D+  L    S+ + +++PL   +  +
Sbjct: 486 SNSMAESDDGDNVEDDTAIDSMCDDTANDDVGSDDSGSLADTVSDTSVEAVPLEFVANTE 545

Query: 285 SFKAHGNSKVKNHATITTITVLSGAARDQKNNSRVQSKKRKSVTSEEFDGNESVPSISTS 344
                G S V+ H  +           +  N  +     +++V  EE  G   + + + S
Sbjct: 546 GDSVDGKSNVEKHENVA----------EDLNAEKESLVVKENVVDEEEAGKGPLKASNKS 605

Query: 345 YGGTDPSYDPARSSRPQAPDRGPLIQPSRSQKSSWKTLIHDKNNVSFSISDILSSVTSAN 403
            GG+                       S  QK+SW  L+ DKN  SFSI+ +   +TS +
Sbjct: 606 TGGS-----------------------SWLQKASWTQLVSDKNTSSFSITQLFPDLTS-D 635

BLAST of ClCG03G011600 vs. NCBI nr
Match: gi|659085871|ref|XP_008443653.1| (PREDICTED: uncharacterized protein LOC103487200 [Cucumis melo])

HSP 1 Score: 1029.2 bits (2660), Expect = 3.1e-297
Identity = 535/664 (80.57%), Postives = 571/664 (85.99%), Query Frame = 1

Query: 3   EEEESASKKMRIYVGGLGAATTEDDLRKVFQSVGGVVEAVDFVRTKSRCFAYVDFFPSSQ 62
           E  +SAS+KMRIYVGGLGAA TEDDLRKVF SVGGVVEAVDFVRTKSR FAYVDFFPSSQ
Sbjct: 2   ERGQSASEKMRIYVGGLGAAMTEDDLRKVFHSVGGVVEAVDFVRTKSRSFAYVDFFPSSQ 61

Query: 63  SSLSKLFSTYNGCAWKGGKLRLEKAKENYLARLRREWEEDAQITDSNVGAGMKVVAPEFT 122
           SSLSKLFSTYNGCAWKGGKLRLEKAKENYLARL+REWEEDAQI DSNVGA M+VVAPE T
Sbjct: 62  SSLSKLFSTYNGCAWKGGKLRLEKAKENYLARLKREWEEDAQIRDSNVGADMEVVAPEST 121

Query: 123 EYVTKSEHIQIFFPSLGEVKSLPISGTGTHKYDFPHVEVPPLPVHFCDCEEHNVSAPTGN 182
           ++VTKSEHI IFFPSLGEVKSLPISGTGTHKYDFPHVEVPP PVHFCDCEEH+VS+P GN
Sbjct: 122 QHVTKSEHINIFFPSLGEVKSLPISGTGTHKYDFPHVEVPPFPVHFCDCEEHDVSSPIGN 181

Query: 183 FKDTKTRDLNAEDGGMDEDEIKMMNAVLNKLFERQEASQSNCNGSMAHNDKHNSTTLIDN 242
            KDTKTRDLNAE+GGM EDEI+MMNAV+NKLFER+EASQSNCNGSMA NDKHNST L DN
Sbjct: 182 SKDTKTRDLNAENGGMAEDEIEMMNAVMNKLFEREEASQSNCNGSMALNDKHNSTMLTDN 241

Query: 243 QLLEDIKEDSDEDNLVLNVVASNCNSKSMPLNSGNKSFKAHGNSKVKNHATITTITVLSG 302
           QLLED K D DEDNLVLNV+ASNCNSKSM LNSGNK FKAHGNSK               
Sbjct: 242 QLLEDNKVDCDEDNLVLNVMASNCNSKSMALNSGNKIFKAHGNSK--------------D 301

Query: 303 AARDQKNNSRVQSKKRKSVTSEEFDGNESVPSISTSYGGTDPSYDPARSSRPQAPDRGPL 362
           A RDQKNN RVQ KKRKS  SEEFDGNESVPSI TS GGTDPSYDPARSSRPQAPDRGP 
Sbjct: 302 AVRDQKNNCRVQGKKRKSFLSEEFDGNESVPSIFTSNGGTDPSYDPARSSRPQAPDRGPP 361

Query: 363 IQPSRSQKSSWKTLIHDKNNVSFSISDILSSVTSANEGQAEAEADYLNLAHSTSIRNSDL 422
           +Q  RSQKS WKTLI DK+NVSF ISDIL SV SANE   ++EAD L++AHST  +NSDL
Sbjct: 362 VQSLRSQKSLWKTLIRDKSNVSFCISDILCSVPSANE--EKSEADDLSIAHSTPNKNSDL 421

Query: 423 ATAAELGSKTEEIQSQKINVSFTVTDVLPAVPSADQEEAASADLNLAHSTPNRNTDFAAD 482
           A AA LGSKT+EIQS KINVSF +T+VLP+VPSADQEEAASADLNLAHSTPN NTD  AD
Sbjct: 422 ARAAVLGSKTDEIQSGKINVSFNITEVLPSVPSADQEEAASADLNLAHSTPNINTDVGAD 481

Query: 483 PISKSKSEEIKSVESFPEAVCAVPNVTSNKGRGSSWRQKSSWTQLVSEEITSFSITQILP 542
           PISKSKSEE+KSVESF +A C VPNV SNKGRGSSWRQKSSWTQLVSEEITSFSITQILP
Sbjct: 482 PISKSKSEEMKSVESFLDAQCTVPNVNSNKGRGSSWRQKSSWTQLVSEEITSFSITQILP 541

Query: 543 NNPSEKQVQGESDVINVNLSARSETNASKQRDSQCIAEDGSAAIVIRKDETAWNNVKKNE 602
           NN S KQVQGE+   N N S  SETNA K++DS+CIAED S A VI KDE   N+VKKNE
Sbjct: 542 NNTSGKQVQGEAGASNANFSLWSETNAPKKQDSECIAEDESTAFVIGKDEIDSNDVKKNE 601

Query: 603 PPAVEENKPSPAEIIDSNLPQV-GSFDVNSGETCPFMRNSRSVAEWTKIKAALSGGSKKK 662
           P AV+E +  P +II+SNLPQ  GSFDV SGETCPFMRNS+SVAEWTKIKAALSGGSKKK
Sbjct: 602 PQAVQECETCPTQIIESNLPQQGGSFDVISGETCPFMRNSQSVAEWTKIKAALSGGSKKK 649

Query: 663 KQRQ 666
           KQRQ
Sbjct: 662 KQRQ 649

BLAST of ClCG03G011600 vs. NCBI nr
Match: gi|778664026|ref|XP_004139156.2| (PREDICTED: uncharacterized protein LOC101203716 [Cucumis sativus])

HSP 1 Score: 993.8 bits (2568), Expect = 1.4e-286
Identity = 521/672 (77.53%), Postives = 563/672 (83.78%), Query Frame = 1

Query: 3   EEEESASKKMRIYVGGLGAATTEDDLRKVFQSVGGVVEAVDFVRTKSRCFAYVDFFPSSQ 62
           E+ +SAS+ MRIYVGGLGAA TEDDLRKVF SVGGVVEAVDFVRTKSR FAYVDFFPSSQ
Sbjct: 2   EKGQSASENMRIYVGGLGAAMTEDDLRKVFHSVGGVVEAVDFVRTKSRSFAYVDFFPSSQ 61

Query: 63  SSLSKLFSTYNGCAWKGGKLRLEKAKENYLARLRREWEEDAQITDSNVGAGMKVVAPEFT 122
           SSLSKLFSTYNGCAWKGGKLRLEKAKENYLARL REWEEDAQI D+NVGA M++VAPE T
Sbjct: 62  SSLSKLFSTYNGCAWKGGKLRLEKAKENYLARLNREWEEDAQIRDNNVGADMELVAPEST 121

Query: 123 EYVTKSEHIQIFFPSLGEVKSLPISGTGTHKYDFPHVEVPPLPVHFCDCEEHNVSAPTGN 182
           E+VTKSEHI IFFPSLGEVK LPISGTGTHKYDFPHVEVPP PVHFCDCEEHN S+P GN
Sbjct: 122 EHVTKSEHINIFFPSLGEVKPLPISGTGTHKYDFPHVEVPPFPVHFCDCEEHNASSPIGN 181

Query: 183 FKDTKTRDLNAEDGGMDEDEIKMMNAVLNKLFERQEASQSNCNGSMAHNDKHNSTTLIDN 242
            K TKTRDLNAE+GGMDEDEIKMMNAVL+KLFER+EASQSNCN SMA NDKHNSTT  DN
Sbjct: 182 SKYTKTRDLNAENGGMDEDEIKMMNAVLSKLFERKEASQSNCNDSMALNDKHNSTTSTDN 241

Query: 243 QLLEDIKEDSDEDNLVLNVVASNCNSKSMPLNSGNKSFKAHGNSKVKNHATITTITVLSG 302
           QLLED K DSDEDNLVLNV+ASNCNSK+M LN GNK FKAHGNSK               
Sbjct: 242 QLLEDNKVDSDEDNLVLNVMASNCNSKTMALNRGNKIFKAHGNSK--------------D 301

Query: 303 AARDQKNNSRVQSKKRKSVTSEEFDGNESVPSISTSYGGTDPSYDPARSSRPQAPDRGPL 362
           A RDQKNN RVQSKKRKS  SEEFDGNESVPSI TS  GTDPSYDPARSSRPQAPDRGP 
Sbjct: 302 AVRDQKNNCRVQSKKRKSFISEEFDGNESVPSIFTSNRGTDPSYDPARSSRPQAPDRGPP 361

Query: 363 IQPSRSQKSSWKTLIHDKNNVSFSISDILSSVTSANEGQAEAEADYLNLAHSTSIRNSDL 422
           +Q  RSQKSSWKTLI DK+NVSF ISDILSSV SANE   +AEAD LN+AHST  RNS+L
Sbjct: 362 VQSLRSQKSSWKTLIRDKSNVSFCISDILSSVPSANE--EKAEADDLNIAHSTPNRNSNL 421

Query: 423 ATAAELGSKTEEIQSQKINVSFTVTDVLPAV--------PSADQEEAASADLNLAHSTPN 482
           A+ A LGS+ +EIQS KINV F++TDVLP V         SADQE+AASADLNLAHSTPN
Sbjct: 422 ASTAVLGSEIDEIQSGKINVPFSITDVLPLVLSADQEKAASADQEKAASADLNLAHSTPN 481

Query: 483 RNTDFAADPISKSKSEEIKSVESFPEAVCAVPNVTSNKGRGSSWRQKSSWTQLVSEEITS 542
            NTD  ADPISKSKSEE++SVESF +A C VPNVT NKGRGSSWR+KSSWTQLVSEE TS
Sbjct: 482 INTDVGADPISKSKSEEMESVESFQDAQCTVPNVTLNKGRGSSWRKKSSWTQLVSEEFTS 541

Query: 543 FSITQILPNNPSEKQVQGESDVINVNLSARSETNASKQRDSQCIAEDGSAAIVIRKDETA 602
           FSITQILPN+ SE QVQGES  IN N SA SETNA +++DS+CIA+D S A VI K E  
Sbjct: 542 FSITQILPNSTSENQVQGESGDINANFSAWSETNAPRKQDSECIAKDESTAFVIGKGEIG 601

Query: 603 WNNVKKNEPPAVEENKPSPAEIIDSNLP-QVGSFDVNSGETCPFMRNSRSVAEWTKIKAA 662
            N+VK+NEP AV+E +  P +I +SN P Q GSFD  SG+TCPFMRNS+SVAEWTKIKAA
Sbjct: 602 CNDVKQNEPQAVQECETCPTQITESNFPQQEGSFDEISGDTCPFMRNSQSVAEWTKIKAA 657

Query: 663 LSGGSKKKKQRQ 666
           LSGGSKKKKQRQ
Sbjct: 662 LSGGSKKKKQRQ 657

BLAST of ClCG03G011600 vs. NCBI nr
Match: gi|590582319|ref|XP_007014591.1| (RNA-binding family protein, putative isoform 1 [Theobroma cacao])

HSP 1 Score: 334.0 bits (855), Expect = 6.2e-88
Identity = 234/692 (33.82%), Postives = 368/692 (53.18%), Query Frame = 1

Query: 7   SASKKMRIYVGGLGAATTEDDLRKVFQSVGGVVEAVDFVRTKSRCFAYVDFFPSSQSSLS 66
           +A+   RI+VGGLG + + DDLRKVF +VG  VE +D +R K R FAYVD  PSS +SLS
Sbjct: 2   AATASTRIHVGGLGQSVSSDDLRKVFSAVG-TVEGLDIIRAKGRSFAYVDILPSSSNSLS 61

Query: 67  KLFSTYNGCAWKGGKLRLEKAKENYLARLRREW-EEDAQITDSNVGAGMKVVAPEFTEYV 126
           KLF+TYNGC WKGGKL+L KAKE+YL RL+REW +E+ +     + +           +V
Sbjct: 62  KLFNTYNGCVWKGGKLKLGKAKEHYLTRLKREWAKEEEEAHHQPMPSSSDEPYNGNKVHV 121

Query: 127 TKSEHIQIFFPSLGEVKSLPISGTGTHKYDFPHVEVPPLPVHFCDCEEHNVSAPTGNFKD 186
           ++  H++IFFP L  VKSLP+SGTG HKY F  VEV  LP+HFCDCEEH     +G+F  
Sbjct: 122 SQQGHLRIFFPRLTRVKSLPLSGTGKHKYSFQRVEVSALPIHFCDCEEH-----SGHFNA 181

Query: 187 TKTRD-LNAED--GGMDEDEIKMMNAVLNKLFERQEASQSNCNGSMAHNDKHNSTTLIDN 246
            + ++  N E+  G M+E+E+ MM++V+NKLFER  A+ SN + ++  +++ + T LI+ 
Sbjct: 182 VRRKEGQNHEEINGVMNEEEVSMMSSVMNKLFER--ANISNTSSAILADEREDFTKLIEG 241

Query: 247 QLLEDIKEDSDEDNLVLNVVASNCNSKSMPLNSGNKSFKAHGNSKVKNHAT-ITTITVLS 306
            L ++  E++D+D+L++NVV+ + N  +M   SG++  KA    K     T I+    + 
Sbjct: 242 PLSDE--EETDDDDLIINVVSDSNNRAAM---SGSREKKAVSTEKTGLGETHISNYGAIR 301

Query: 307 GAARDQKNNSRVQSKKRKSVTSEEFDGNESVPSISTSYGGTDPSYDPARSSRPQAPDRGP 366
            A + Q+NN+    KKRK + ++E D ++ V         ++     A     +A +   
Sbjct: 302 SACKVQENNTLHPRKKRKPLPNKEEDKHQLVSLFHGQRRNSEFDDSNADFEENEANEDNL 361

Query: 367 LIQ-PSRSQKSSWKTLIHDKNNVS--FSISDILSSVTSANEGQAEAEADYLNLAHSTSIR 426
           +I   S + K S +T + DK ++   F  S+  +S     + + + + D + L +     
Sbjct: 362 MINIVSMANKRSGRTKL-DKVSLKQRFKSSEKQTSEDGPIQNEHKVQKDDILLPNRNEKG 421

Query: 427 NSDL----------ATAAELGSKTEE-------------IQSQKINVSFTVTDVLPAVPS 486
           N              T AE G K                +   + N +F+++++L  V +
Sbjct: 422 NVQTQSNESVVVAQTTGAECGLKQSNTSCSWSQKSSWRALVGDRSNSAFSLSNILQNVGT 481

Query: 487 ADQEEAASADLNLAHSTPNRNTDFAADPISKSK--SEEIKSVESFPEAVCAVPNVTSNKG 546
             +++  S    +  +  +RN + A     +      EI  VE  P         +SN G
Sbjct: 482 TKEKQQISDGCKVNKTLDSRNGNLAKPKNLEGMLGKTEIVDVEPQPN---QPKTASSNSG 541

Query: 547 RGSSWRQKSSWTQLVSEEITSFSITQILPNNPSEKQVQGESDVINVNLSARSETNASKQR 606
           RGSSW  KSSW QLVSE  +SFSI++I+P + ++++         V  +  + +N +K  
Sbjct: 542 RGSSWLHKSSWMQLVSENSSSFSISEIVPGSTTKQECTKPIYEDVVYSADGNHSNKTKSH 601

Query: 607 DSQCIAEDGSAAIVIRKDETAWNNVKKNEPPAVEENKPSPAEIID---SNLPQVGSFDVN 663
            S+     GS A+ +RK+     ++ ++    V  N  +    ++   S   +    D +
Sbjct: 602 KSEPTVY-GSPALGVRKEGDTVRSIPESNQQTVVGNTDASVPTVEKCNSEPDKAFGGDTS 661

BLAST of ClCG03G011600 vs. NCBI nr
Match: gi|1012335157|gb|KYP46495.1| (Nucleolar protein 8 [Cajanus cajan])

HSP 1 Score: 325.1 bits (832), Expect = 2.9e-85
Identity = 230/671 (34.28%), Postives = 350/671 (52.16%), Query Frame = 1

Query: 1   MEEEEESASKKMRIYVGGLGAATTEDDLRKVFQSVGGVVEAVDFVRTKSRCFAYVDFFPS 60
           MEEE +     +RI+VGGLG A +++DLR +F S+G  V+ +  +RTK R FAY+DF  +
Sbjct: 1   MEEEAKETQSAVRIFVGGLGEAVSDEDLRTLFSSLG-TVQTIRTIRTKGRSFAYLDFL-T 60

Query: 61  SQSSLSKLFSTYNGCAWKGGKLRLEKAKENYLARLRREWEEDAQITDSNVGAGMKVVAPE 120
              SLSKLFS YNGC WKGG+L+LEKAKE+YL RL+REWE++A        A      P 
Sbjct: 61  DPKSLSKLFSKYNGCLWKGGRLKLEKAKEDYLTRLKREWEQEAL-------ADATQPPPV 120

Query: 121 FTEYVTKSEHIQIFFPSLGEVKSLPISGTGTHKYDFPHVEVPPLPVHFCDCEEHNVSAPT 180
             + V KS  + +FFP L +VKS+P +GTG HKY F +++VPPLPVHFCDCEEH   +P 
Sbjct: 121 VPQEVPKS--LSVFFPRLRKVKSIPFNGTGKHKYSFQNIKVPPLPVHFCDCEEH--CSPF 180

Query: 181 GNFKDTKTRDLNAEDGGMDEDEIKMMNAVLNKLFERQEASQSNCNGSMAHNDKHNSTTLI 240
            N ++ ++ D  AE GGM+++EI +MNAV+NKLFE+++ S +   G     D   S   +
Sbjct: 181 VNEREKQSIDGAAESGGMNDEEINIMNAVMNKLFEKEKVSNAKNLGE--EKDSFESPDAL 240

Query: 241 DNQLLEDIKEDSDEDNLVLNVVASNCNSKSMPLNSGNKSF-KAHGNSKVKNHATITTITV 300
            +   ED    +DED+L++N+       K+     GN+   K   N +  N   +     
Sbjct: 241 HSDECED--SATDEDDLIINMQ----TRKNKTALIGNQELEKILENQEWSNKTKVD---- 300

Query: 301 LSGAARDQKNNSRVQSKKRKSVTSEEFDGNESVP--SISTSYGGTDP----SYDPARSSR 360
                +++ N S  Q +KR +    +    +S+P   +ST+ GG       S +    ++
Sbjct: 301 -----KEEPNKSTPQVQKRNNSNPAKNKKRKSLPKLEVSTTSGGKSNMQTLSDEEGSDAQ 360

Query: 361 PQ--APDRGPLIQPSRSQKSSWKTLIHDKNNVSFSISDILSSVTSANEGQAEAEADYLNL 420
           P     D   L + S SQKSSW+ L+ D  N SFS S IL  + S   G+ +  +++L+ 
Sbjct: 361 PTELEDDVEELTKVSWSQKSSWRELLGDGGNTSFSASLILPELDS---GKNQQSSEHLST 420

Query: 421 AHSTSIRNSDLATAAELGSKTEEIQSQKINVSFTVTDVLPAVPSADQEEAASADLNLAHS 480
           + ST     ++ +   L SK+   Q         + ++    P+ +Q     A+      
Sbjct: 421 SISTDSETENMESDGHLWSKSTNTQ--------VIKELAEVQPTNEQVIKELAE------ 480

Query: 481 TPNRNTDFAADPISKSKSEEIKSVESFPEAVCAVPNVTSNKGRGSSWRQKSSWTQLVSEE 540
                    A P +K   E++  +++       VPN T   GRG+SWR+K SWTQLVSE 
Sbjct: 481 ---------AQPANKQVIEDLNEMQNNNN---VVPNKT---GRGASWRRKQSWTQLVSEN 540

Query: 541 ITSFSITQILPNNPSEKQVQGESDVINVNLSARSETNASKQRDSQCIAEDGSAAIVIRKD 600
            +SFSI+ ILP     + +  E  ++   +S   + N   Q     +  DG  +  I  +
Sbjct: 541 NSSFSISHILPGITFPEPMAKE-PIMEPAISNDCKHNDVAQDTIDKVLSDGFNSREIIPE 600

Query: 601 ETAWNNVKKNEPPAVEENKPSPAEIIDSNLPQVGSFDVNSGETCPFMRNSRSVAEWTKIK 660
                      P +V E K      ++++  +    +V  GETC FMR++ S+ +W K K
Sbjct: 601 RIQHIGANDIVPGSVAEEK------VETSPRERSIENVEVGETCRFMRSAASLKDWAKAK 602

Query: 661 AALSGGSKKKK 663
           AA+S   K+K+
Sbjct: 661 AAISRSLKRKR 602

BLAST of ClCG03G011600 vs. NCBI nr
Match: gi|567177821|ref|XP_006401127.1| (hypothetical protein EUTSA_v10012942mg [Eutrema salsugineum])

HSP 1 Score: 324.7 bits (831), Expect = 3.7e-85
Identity = 240/687 (34.93%), Postives = 347/687 (50.51%), Query Frame = 1

Query: 12  MRIYVGGLGAATTEDDLRKVFQSVGGVVEAVDFVRTKSRCFAYVDFFPSSQSSLSKLFST 71
           +R++VGGLG +  +DDL K+F  +G V ++V+FVRTK R FAY+DF PSS  SL+KLFST
Sbjct: 10  VRLHVGGLGESVGKDDLLKIFSPMGSV-DSVEFVRTKGRSFAYIDFSPSSDKSLTKLFST 69

Query: 72  YNGCAWKGGKLRLEKAKENYLARLRREWEEDAQITDSNVGAGMKVVAPEFTEYVTKSEHI 131
           YNGC WKGGKLRLEKAKE+YLARLRR WEE A   D  + A          E  T S  I
Sbjct: 70  YNGCIWKGGKLRLEKAKEHYLARLRRGWEESASTPDDTIKA---------PEKFTPSTQI 129

Query: 132 QIFFPSLGEVKSLPISGTGTHKYDFPHVEVP-PLPVHFCDCEEHNVSAPTGNFKDTKTRD 191
            IF P L +VKSLP+SGTG HKY F  V VP  LP  FCDCEEH+ S      ++T  RD
Sbjct: 130 NIFIPRLRKVKSLPLSGTGKHKYSFQRVSVPSSLPKTFCDCEEHSASLTP---RETHVRD 189

Query: 192 LNAEDGGMDEDEIKMMNAVLNKLFERQEASQSNCNGSMAHNDKHNSTTLIDNQLLEDIKE 251
           L A + G++E+E+ +MN+V++KLFE  +            ND +++    DN +L D   
Sbjct: 190 LEALNVGINEEEVNIMNSVMSKLFENND------------NDNNDNN---DNLILND--- 249

Query: 252 DSDEDNLVLNVVASNCNSKSMPLNSGNKSFKAHGNSKVKNHATITTITVLSGAA--RDQK 311
             + DNL++NVV++  +     L+  ++  K             +T+   SGA     +K
Sbjct: 250 --NNDNLIINVVSNGNDMMDSELDILSRKRK-------------STLNEASGAGYIEGRK 309

Query: 312 NNSRVQSKKRKSVTSEEFDGNESVPSISTSY--GGTDPSYDPARSSRPQAPDRGPLIQPS 371
            N+    KKR+S+  E+    ES  +IS        DP+      SR     R      S
Sbjct: 310 GNNVHPKKKRQSIILEKNGRQESSQTISEKKKPSEVDPNKSTDEPSRKIGVKRS-TDNIS 369

Query: 372 RSQKSSWKTLIHDKNNVSFSISDILSSVTSANEGQ-AEAEADYLNLAHSTSIRNSDLATA 431
            SQKSSWK L+ + N+  F++S  L  V S    Q A    D   L    +    D+  +
Sbjct: 370 WSQKSSWKALVANGNSNHFTVSSFLPGVGSNKAVQSASGNTDLAGLPSQENSEEFDVPNS 429

Query: 432 AELGSKTEEIQSQKINVSFTVTDVLPAVPSAD-----------QEEAASADLNLAHSTPN 491
            E  + T+  ++++  ++ T+  +   V + D             EA   + +  +   N
Sbjct: 430 TERPTVTKIKKTKRKRITSTI--IAENVAAEDDIEKNDIVTDISVEAEPLEASTENDCEN 489

Query: 492 RNTDFAADP-ISKSKSEEIKSVE---------SFPEAVCAVPNVTSNKGRGSSWRQKSSW 551
            N +   D  +++  + E +S++            EA  A     S    GSSW Q++SW
Sbjct: 490 DNLNVETDENVAEDLNAEKESLDLKDNVDHDVDKDEAGKASLEARSKSTGGSSWLQRASW 549

Query: 552 TQLVSEEITSFSITQILPNNPSEKQV------QGESDVINVNLSARSETNASKQRDSQCI 611
           TQLVS+   SFSITQ+ P+  S+K         G+    N N S     +  KQRD+ C 
Sbjct: 550 TQLVSDRSASFSITQLFPDLASDKNEAARVNNNGDGQFSNFNQS----ESGMKQRDNSCS 609

Query: 612 AEDGSAAIVIRKDETAWNNVKKNEPPAVEENKPSPAEIIDSNLPQVGSFDVNSGETCPFM 666
                AA  ++ D T   ++++N      +N     ++  + +P      + SGETC FM
Sbjct: 610 TASFEAA-GVQVDSTPVRSLEENRQSLKGKNVREGCKLA-AKMPI--RRKIGSGETCTFM 639

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
SRP40_YEAST5.2e-1021.43Suppressor protein SRP40 OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c... [more]
NOL8_MOUSE5.3e-0736.17Nucleolar protein 8 OS=Mus musculus GN=Nol8 PE=1 SV=2[more]
NOL8_HUMAN2.7e-0635.48Nucleolar protein 8 OS=Homo sapiens GN=NOL8 PE=1 SV=1[more]
SRAP_STAAN4.5e-0621.19Serine-rich adhesin for platelets OS=Staphylococcus aureus (strain N315) GN=sraP... [more]
SRAP_STAAM4.5e-0621.19Serine-rich adhesin for platelets OS=Staphylococcus aureus (strain Mu50 / ATCC 7... [more]
Match NameE-valueIdentityDescription
A0A0A0LXQ1_CUCSA1.0e-28677.53Uncharacterized protein OS=Cucumis sativus GN=Csa_1G651690 PE=4 SV=1[more]
A0A061GQW7_THECC4.3e-8833.82RNA-binding family protein, putative isoform 1 OS=Theobroma cacao GN=TCM_039820 ... [more]
A0A151RVC0_CAJCA2.0e-8534.28Nucleolar protein 8 OS=Cajanus cajan GN=KK1_031900 PE=4 SV=1[more]
V4KX69_EUTSA2.6e-8534.93Uncharacterized protein OS=Eutrema salsugineum GN=EUTSA_v10012942mg PE=4 SV=1[more]
V7AY66_PHAVU1.9e-8336.26Uncharacterized protein OS=Phaseolus vulgaris GN=PHAVU_009G217200g PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G58130.12.1e-6535.16 RNA-binding (RRM/RBD/RNP motifs) family protein[more]
Match NameE-valueIdentityDescription
gi|659085871|ref|XP_008443653.1|3.1e-29780.57PREDICTED: uncharacterized protein LOC103487200 [Cucumis melo][more]
gi|778664026|ref|XP_004139156.2|1.4e-28677.53PREDICTED: uncharacterized protein LOC101203716 [Cucumis sativus][more]
gi|590582319|ref|XP_007014591.1|6.2e-8833.82RNA-binding family protein, putative isoform 1 [Theobroma cacao][more]
gi|1012335157|gb|KYP46495.1|2.9e-8534.28Nucleolar protein 8 [Cajanus cajan][more]
gi|567177821|ref|XP_006401127.1|3.7e-8534.93hypothetical protein EUTSA_v10012942mg [Eutrema salsugineum][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR000504RRM_dom
IPR012677Nucleotide-bd_a/b_plait_sf
Vocabulary: Molecular Function
TermDefinition
GO:0003676nucleic acid binding
GO:0000166nucleotide binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0008152 metabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0000166 nucleotide binding
molecular_function GO:0003723 RNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG03G011600.1ClCG03G011600.1mRNA


Analysis Name: InterPro Annotations of watermelon (Charleston Gray)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000504RNA recognition motif domainPFAMPF00076RRM_1coord: 14..58
score: 7.
IPR000504RNA recognition motif domainSMARTSM00360rrm1_1coord: 13..85
score: 1.
IPR000504RNA recognition motif domainPROFILEPS50102RRMcoord: 12..89
score: 12
IPR012677Nucleotide-binding alpha-beta plait domainGENE3DG3DSA:3.30.70.330coord: 3..90
score: 1.1
IPR012677Nucleotide-binding alpha-beta plait domainunknownSSF54928RNA-binding domain, RBDcoord: 9..91
score: 4.43
NoneNo IPR availablePANTHERPTHR23099TRANSCRIPTIONAL REGULATORcoord: 3..257
score: 3.2
NoneNo IPR availablePANTHERPTHR23099:SF0ACIDIC REPEAT-CONTAINING PROTEINcoord: 3..257
score: 3.2