Csa1G002100 (gene) Cucumber (Chinese Long) v2

NameCsa1G002100
Typegene
OrganismCucumis. sativus (Cucumber (Chinese Long) v2)
DescriptionU3 small nucleolar ribonucleoprotein protein mpp10, putative; contains IPR007151 (Mpp10 protein)
LocationChr1 : 342277 .. 348282 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TGCCCCCCAACCCTTCCCTCCGGTGAGTTCCGCCAGTTCTTAATTTCAACTCTTTCCGACGTTTTCCGGCTGCCTCAACGTGAATCAAGCGTTTTCGTCAGCTAATTTTTGACGTTGTCTACTGTCTTCCGACGACGTTTGCTACTAATTCATGTATACGTTACTCACTTCTTAAATTCTTCAAATATTCATATCTAAGCATATTGAAACGCCAACCAGCAAGCGTCGAGGCTGTTTGGCAGCTTTCAGTAGGTAATAACCCGCATTAATCGCTTTTGGGCTCAATTTAAGTTGTTTGAACCGTCCCTAACACATGGAAGCTATTGCTGTAGTTTTGGGGTGGTTTCGAATTCATTTGGATGTTTTTCAGCAAAAACTGAAGCCATTTTAGGTGGTAAAGTACTGTTTAGACCCTTTTGAAACCTCTGTAAGTGCTGTAATTTAGTTGCTAAAAAACTTCAAACTCTTATGCTTGTAGTGTGGTCTCGAGTTGTAGTGTTTGAATTAAGTTTAAGATCAAATTTGGGCAAGAACTTCCCTACTAGACACTCATTGGGACTGTTTAACGTTTCGAACTGAGTTTAGATATTAAACCTTGTTTTTATTGATTCAAAATCCTTTCAAATAATTTAGGAGCACTTATAGTAATTGGGATTCCATCTAGTGGATTTGTTTGTTGTGCATGTATATTGTGTTTTTTCGAATGGCCTGAAAATAGGGGGAGTTACCCACAAATCAAAATATCAAATCATTGAGCAACTCAAAATTGATTTAATCATTATTTTACATTGGTTTCGTTGTACTGAATTTATATATTGACTCTCAGTTCGAATAGAAATTAGAAAAAACAAATCTCATGGACTGTGGAAACACATGTAAAAGCTAGTTAACCCATAATATTGATGGTTTTTTCTTTTTGCTGCACCGAACATCCTTGAACTCCTCGCTAGGAATTTGATCTTGTTTCTTATTAGGCACCTGGGATTTGGAATGAGGAGCATAATGTCGAAAGATTAGTAATATGATGGAAATCCAATTGGTATTAACGTAGAAATATTATTGATAGTAGAAAGGGAATGAATTCCCGATTATAAGAATCGACTCAGACCTTCTGCAACCTATCTTTCAAGGTAGAACAGAGAGGGACTCTTTACTAGAATCTAACTAACTTTTCCGCCCTTCTATTTCATATATTTATATTAATTCCTAATTTCTTCCCCACTAACTAACTCACACGTGTTAATTCTTTTTCTTCTCTCTCCTGCTATATTGATGTATGATAGGAGGTCTATCATAATATTGGGTAATTTTAGTTTTTTTTCTCTAGTTAAATACACAATTTATTTTGTTTATGGATGGAGTGGAAGTGTGGAATTGAAAACAAATTTTCCAAAGTTTGGGATCCTGTCTAGTCTATCCTCAAAGGACAAGATTTCAAATTCCTTGCCTTTTTTCGGTAGGTCTTTACTGAAGTTTGGGATCCTGTCTGTTCTCCATGAAGGACACTCCACTATTTTTGATAAAGCTCAAGCATCTTTTTCTCACCTTCTATTTTTATTGAGCTTTCTAGCTACCATTGGAAGAAGTGTTTATTATTTATCACTAAGTTGTATTTCTGTTGACACGTTCTGTGATATGTAACAGCGTGCAAGGCCAGGGGATCTAATCGAATCACTGTGTTCACTTTCGGGTATGGCTTCTGTTAATTTGATTAGTTTTGATCGTTTATATCTCTATTTTCAAGAAATGTAAAGTAGTCTAACATTTATTTATTGCAGTTAATGGACTTATTTGAAATTTCAAATTATTATGAGATGTTGAATGAGGGAATTTTCTCTTTACAGAGCTCAATGGAGGCAGGGAAAGTGGTTCTTCCTAACATTGAAGCTGGTCTTAAACCACTTCACTCTCTCAAATCTACAGATCCGCCACTCTGGCTTGCTCCAAGCCCTTCCCTATCTCAAGTGGCTCGCCTTGCTTCACAGAGTTTATTTTCTATGCTGAAACCATTCAATCCTAAATCCCCATTTGATCATCTCCTGGTTGATGGATTTGATGCTGAGCAGATATGGCAACAAATTGACCTCCAATCACAACCTCTTTTGGCGAGTGTTCGCCGGGATTTGAAGCGGTTTGAGAAGAATCCAGAAGCAATTTCGAACCTGAAGGTTTCTTTGGAGGATAAGAAGAAGGTTATTCAAGAGATGGGTGTAGAGTCGGGAGAAGAAAGTGATGATTTTGAGGAGGATATGAAGGAGCTTGATGAAGAAGAAGAAGAAGACGATGAGGAAGATGAGGAGGAGGAAGAGGAGGACTGTGATGACAGAGAAGATGGAGACACTGAGGAAGGAGAGAAGGAAAAGAGTGATGATGAAGTTGAGGGAGAAGAAGGTAATGGTGGAATTGAGGATGGATTTTTGAAGCTAAAAGAGCTGGAGGAATTTATGGAGGAGGATGAGGTAAGAGAATATGGTTTACAGAAGAAGAAAGATGGTAAGAAGGAAAAGAAACCGAGGAAGACAGAAGAAGAATCTGATGACGACGAAGATGATGAGGTGAAGGCACATTGCTTAGTCAAACTGATCATAACATTTTTTTTACTAATGAGGGTTTATTTCTAATATTGTTTCTTGTTCCAGAATTTGTTTATTGAAGATATTTGCTGTTTTGGTAATTTAAAGTTGTTTCTTTCTTCCTTTGTAGCTTGAGGAGTTTGACCTCCATGGTGAGGAGGATGAAGATTCTAGCAAACTGGACAATGCGAGGTATGTGGAGTCATTTGTTCTTCCGCACGCTTATAAATGACAAACTCCTTAAACTGATGGAAGTACTCGATTGCATTGTTCATAGTTTCACTGTCATTTTACTAGTCTAGCGTAGTAGTAGTCTTCATTTTTAAAATAGTTGATTTATGGTTACTATCTAATGCAATAGACGACTCTGATACTAGTAGAATTTTTATGCACTTAAACTGACAGTAATCAGTAACTGATTTTAGTACAACTTATGTATATGGTTTGGAAAAGAAAACTTGAAGTTGTTGAGAAGGATACTTTGCATGCCAAAAGGATCCACTTACTTTTCACTTTTCAGAATACCCATTTTTGTTTAGTTTTCCTATTGCAGCAGGCTGACCATCATCTCTTCACTCTGCTAATGTAATTCTCCATTGCATCAATTTTGTTGTCGTTAGTGTTTGATGTCTCATGATCTCTTTGTCCATTTAATTAAGTGGTAAAAAAGTTTGTCATCCAATGACATCTCATTAAAGGCCTCTATTAGAAAACTTTCAGAATGTTAAATCTTCACTTTACTTTTTTCCTTTAAACTTGTGAATTGAGTAATTCATTAATATTGTGTCATGTTAATAATTTTTTTGTTAATTGTTTTTTAATCAGTTTCAATAAAATTATGGATATTCTATTTTTTTTTCTTTAAAATTTTAGCTTTCAATAGATGCCAAACAACACCTTTTTTTTTTTACAGATACGAAGACTTTTTTGGTGCTAAAAAGAAGAATCATGTGAGAAGAAATAAATTGACAAATGGATCAGAGTCTGAACTCTCGGATTCAGGTGATGAAGAGGAAGAAAATGAAGCATATACAGAACCGGTACATATCTTCCTCCACAACGTCAGCTTCTAATTCTAAATTGATAGGTGCTTTTCTTCTTGTGAATACTCATTTTAGTTTTTGATTTTGAGTTGCATACATAGAAGTCAGAAAATCTCTCAACTCATCAAAAGAAACTTAAGAAGCTTCAGTCTGAAATAGAGATGATGGAGAAAGCAAACTTGGAGCCAAAAACGTGGACCATGCAGGGAGAGGTAAAGTCACCATTTTATTGATTTTCTTACTTTCTATGAGATACGATCATTTCAAGGCATTGTAAGCTACATCTTTTATAACAGGTAACCGCCGCGAAAAGACCTAAGAATAGTGCCCTGGAAGTTGATCTAGATTTTGAGCACAATGTGAGACCACCACCTGTAATCACAGAAGAGGTTACAGCAACATTGGAAGAGATGATTCAGAAAAGAATCCTTGAGGTACGTTTCTCTTCTCTAAACGAGTAATTTTTAATGTTTTTGATGTTATTGTTTGTTCATATGTTCTTAAACTTTTGTTTGGTTAATACAATAGGTTTTTCTTCTTACTTAATCATAGTTTATCTATTGAATAAAGACCGGGCGTCTGGATAACTGCATTCGGAGCTGACAGATCAAGACGTTGAAGAGCTTGATTTAAAGTGGCCGTATGCATCTGATAAAAATATTTGACTGGTGTTGTCTTATGCTTTGTTTTCAGGGTCGTTTTGACGAAGTTCAGAAAGCACCTAAACGACCTACTAAAGCACCTAGAGAAATCAAAGAGTTGGTCAGTATTCTTTTATGGTACTCTTTTCTCTGTGTTTCACGTACATAATTCAGAAATATATACAATGTGTATGTGTTATGTGAATTCATTCATGGCGCTTTTCTTGCTCGAGGATTTTGATATTCTTTTCTCTTCTCTTAACAGGATGAGAATAAAAGCAAGAAAGGTCTTGGTGAACTATACGAGGTTTGTGTTCCTTTTAAAAAAAGATGCATCATTAACTTAACCCCCTTTGCTGCCATGCCATATTTTCCAACTGCATGTCTTTCAGTTATGGAATTAATCGATTTCTTAATCTCATTCTTTCCTTCCAGGAAGAGTATGTTGAAAAGACTAATTTGGCTACAGCCCCACCGTCATTCACCGATGAAGCAAAGACAGAGGTACAGTGCATCAAATGTTCTTATTGGTCTTTCTAACTATTTGAATTTGGATAATCAGGAACATTTACTCTAAAACCTTTCCCATTGACGATATCCTTACCTCGTCTTTTTCTCTAATCAGGCGAGCATTCTTTTCAAGAAACTTTGCTCCAAGTTGGATGCTCTCTCTCATTACCACTACGCTCCAAAACCTGTATGCTCCTCATACATCTTTTTGAATTTTATACTGTATTGACCATTTTTCTGGCCATTTTCTCATCTAAATGTTAATTTCTTGCTCATAAAGGTTATAGAGGATATGTCTATATCAACGAATGTCCCTGCTCTAGCAATGGAAGAGGTAACTGCTCTTCTAGTTCAATTCCCGTGTATAAATTACGTAGCTGACTACACAAGCTAATGTTTTAATTATGTTCTGTGGCTAGGTTGCTCCTGTTGCTGTCTCCGATGCGGCAATGCTTGCTCCAGAGGAAGTCTTTGCAGGAAAAGGTGAAATCAAAGAAGCAGCAGAACTTACACAATCTGACAGGAAGAGGAGGAGGGCCAGCAAGAAAAGGAAATATAAAGGTAAAAATAAAACAAAAATAGATAGGCTCACGACATGGTCGTATAGGAATGAATTGTTCGCTTGATTCTAATAGTCTGAGTCTTACAGCCATGGTTGCCAAAAGAGACGCAAAGAAATCAGGAAATACTACAGCTCCAAATGCCAATGAAGGTAATGCTTGTTTAATTTTTTCAATGTTTTCATGGAACCATTGCAACCTCATCTCTAACTATTTTCAATATTTTAGGCCAATGAAGATTCATATTATTATTCAACAATTCTTGAAAGTCTTTTCCATCAATGCCTCAAGATCCCAAGAACAGAAAAATTACAGAAAATGGAATTCGCTAACTTAATTTTGATGAGCCGAAAACCGAATAGCTCGAGGACAGTCTTGGATGTTGAATTTCCACATTTTTTCACTTGTGGCATCCAAGAAAAGTACGAAAATGGTATCGTTATGATATTTTTTTTGCAGCCATTTCCATCATAAGAGTTACCAAGGACGAATTAGAAGCTAAGATGTAAAAAAATCCAAATTTTGTGCTTTTGCTTCCCCTTCCTTTTATTGCATTTTTTTCCATTAAAAAAATGTACTTACATACAGTTAAGATCAATTAAAGCGAGCATGGATTTATTTACCAATTTACGTTGAAATTCATGGATGTATTCAGTCACAATAATTCTACTAAGTCAGA

mRNA sequence

ATGGAGGCAGGGAAAGTGGTTCTTCCTAACATTGAAGCTGGTCTTAAACCACTTCACTCTCTCAAATCTACAGATCCGCCACTCTGGCTTGCTCCAAGCCCTTCCCTATCTCAAGTGGCTCGCCTTGCTTCACAGAGTTTATTTTCTATGCTGAAACCATTCAATCCTAAATCCCCATTTGATCATCTCCTGGTTGATGGATTTGATGCTGAGCAGATATGGCAACAAATTGACCTCCAATCACAACCTCTTTTGGCGAGTGTTCGCCGGGATTTGAAGCGGTTTGAGAAGAATCCAGAAGCAATTTCGAACCTGAAGGTTTCTTTGGAGGATAAGAAGAAGGTTATTCAAGAGATGGGTGTAGAGTCGGGAGAAGAAAGTGATGATTTTGAGGAGGATATGAAGGAGCTTGATGAAGAAGAAGAAGAAGACGATGAGGAAGATGAGGAGGAGGAAGAGGAGGACTGTGATGACAGAGAAGATGGAGACACTGAGGAAGGAGAGAAGGAAAAGAGTGATGATGAAGTTGAGGGAGAAGAAGGTAATGGTGGAATTGAGGATGGATTTTTGAAGCTAAAAGAGCTGGAGGAATTTATGGAGGAGGATGAGGTAAGAGAATATGGTTTACAGAAGAAGAAAGATGGTAAGAAGGAAAAGAAACCGAGGAAGACAGAAGAAGAATCTGATGACGACGAAGATGATGAGCTTGAGGAGTTTGACCTCCATGGTGAGGAGGATGAAGATTCTAGCAAACTGGACAATGCGAGATACGAAGACTTTTTTGGTGCTAAAAAGAAGAATCATGTGAGAAGAAATAAATTGACAAATGGATCAGAGTCTGAACTCTCGGATTCAGGTGATGAAGAGGAAGAAAATGAAGCATATACAGAACCGAAGTCAGAAAATCTCTCAACTCATCAAAAGAAACTTAAGAAGCTTCAGTCTGAAATAGAGATGATGGAGAAAGCAAACTTGGAGCCAAAAACGTGGACCATGCAGGGAGAGGTAACCGCCGCGAAAAGACCTAAGAATAGTGCCCTGGAAGTTGATCTAGATTTTGAGCACAATGTGAGACCACCACCTGTAATCACAGAAGAGGTTACAGCAACATTGGAAGAGATGATTCAGAAAAGAATCCTTGAGGGTCGTTTTGACGAAGTTCAGAAAGCACCTAAACGACCTACTAAAGCACCTAGAGAAATCAAAGAGTTGGATGAGAATAAAAGCAAGAAAGGTCTTGGTGAACTATACGAGGAAGAGTATGTTGAAAAGACTAATTTGGCTACAGCCCCACCGTCATTCACCGATGAAGCAAAGACAGAGGCGAGCATTCTTTTCAAGAAACTTTGCTCCAAGTTGGATGCTCTCTCTCATTACCACTACGCTCCAAAACCTGTTATAGAGGATATGTCTATATCAACGAATGTCCCTGCTCTAGCAATGGAAGAGGTTGCTCCTGTTGCTGTCTCCGATGCGGCAATGCTTGCTCCAGAGGAAGTCTTTGCAGGAAAAGGTGAAATCAAAGAAGCAGCAGAACTTACACAATCTGACAGGAAGAGGAGGAGGGCCAGCAAGAAAAGGAAATATAAAGCCATGGTTGCCAAAAGAGACGCAAAGAAATCAGGAAATACTACAGCTCCAAATGCCAATGAAGGCCAATGA

Coding sequence (CDS)

ATGGAGGCAGGGAAAGTGGTTCTTCCTAACATTGAAGCTGGTCTTAAACCACTTCACTCTCTCAAATCTACAGATCCGCCACTCTGGCTTGCTCCAAGCCCTTCCCTATCTCAAGTGGCTCGCCTTGCTTCACAGAGTTTATTTTCTATGCTGAAACCATTCAATCCTAAATCCCCATTTGATCATCTCCTGGTTGATGGATTTGATGCTGAGCAGATATGGCAACAAATTGACCTCCAATCACAACCTCTTTTGGCGAGTGTTCGCCGGGATTTGAAGCGGTTTGAGAAGAATCCAGAAGCAATTTCGAACCTGAAGGTTTCTTTGGAGGATAAGAAGAAGGTTATTCAAGAGATGGGTGTAGAGTCGGGAGAAGAAAGTGATGATTTTGAGGAGGATATGAAGGAGCTTGATGAAGAAGAAGAAGAAGACGATGAGGAAGATGAGGAGGAGGAAGAGGAGGACTGTGATGACAGAGAAGATGGAGACACTGAGGAAGGAGAGAAGGAAAAGAGTGATGATGAAGTTGAGGGAGAAGAAGGTAATGGTGGAATTGAGGATGGATTTTTGAAGCTAAAAGAGCTGGAGGAATTTATGGAGGAGGATGAGGTAAGAGAATATGGTTTACAGAAGAAGAAAGATGGTAAGAAGGAAAAGAAACCGAGGAAGACAGAAGAAGAATCTGATGACGACGAAGATGATGAGCTTGAGGAGTTTGACCTCCATGGTGAGGAGGATGAAGATTCTAGCAAACTGGACAATGCGAGATACGAAGACTTTTTTGGTGCTAAAAAGAAGAATCATGTGAGAAGAAATAAATTGACAAATGGATCAGAGTCTGAACTCTCGGATTCAGGTGATGAAGAGGAAGAAAATGAAGCATATACAGAACCGAAGTCAGAAAATCTCTCAACTCATCAAAAGAAACTTAAGAAGCTTCAGTCTGAAATAGAGATGATGGAGAAAGCAAACTTGGAGCCAAAAACGTGGACCATGCAGGGAGAGGTAACCGCCGCGAAAAGACCTAAGAATAGTGCCCTGGAAGTTGATCTAGATTTTGAGCACAATGTGAGACCACCACCTGTAATCACAGAAGAGGTTACAGCAACATTGGAAGAGATGATTCAGAAAAGAATCCTTGAGGGTCGTTTTGACGAAGTTCAGAAAGCACCTAAACGACCTACTAAAGCACCTAGAGAAATCAAAGAGTTGGATGAGAATAAAAGCAAGAAAGGTCTTGGTGAACTATACGAGGAAGAGTATGTTGAAAAGACTAATTTGGCTACAGCCCCACCGTCATTCACCGATGAAGCAAAGACAGAGGCGAGCATTCTTTTCAAGAAACTTTGCTCCAAGTTGGATGCTCTCTCTCATTACCACTACGCTCCAAAACCTGTTATAGAGGATATGTCTATATCAACGAATGTCCCTGCTCTAGCAATGGAAGAGGTTGCTCCTGTTGCTGTCTCCGATGCGGCAATGCTTGCTCCAGAGGAAGTCTTTGCAGGAAAAGGTGAAATCAAAGAAGCAGCAGAACTTACACAATCTGACAGGAAGAGGAGGAGGGCCAGCAAGAAAAGGAAATATAAAGCCATGGTTGCCAAAAGAGACGCAAAGAAATCAGGAAATACTACAGCTCCAAATGCCAATGAAGGCCAATGA

Protein sequence

MEAGKVVLPNIEAGLKPLHSLKSTDPPLWLAPSPSLSQVARLASQSLFSMLKPFNPKSPFDHLLVDGFDAEQIWQQIDLQSQPLLASVRRDLKRFEKNPEAISNLKVSLEDKKKVIQEMGVESGEESDDFEEDMKELDEEEEEDDEEDEEEEEEDCDDREDGDTEEGEKEKSDDEVEGEEGNGGIEDGFLKLKELEEFMEEDEVREYGLQKKKDGKKEKKPRKTEEESDDDEDDELEEFDLHGEEDEDSSKLDNARYEDFFGAKKKNHVRRNKLTNGSESELSDSGDEEEENEAYTEPKSENLSTHQKKLKKLQSEIEMMEKANLEPKTWTMQGEVTAAKRPKNSALEVDLDFEHNVRPPPVITEEVTATLEEMIQKRILEGRFDEVQKAPKRPTKAPREIKELDENKSKKGLGELYEEEYVEKTNLATAPPSFTDEAKTEASILFKKLCSKLDALSHYHYAPKPVIEDMSISTNVPALAMEEVAPVAVSDAAMLAPEEVFAGKGEIKEAAELTQSDRKRRRASKKRKYKAMVAKRDAKKSGNTTAPNANEGQ*
BLAST of Csa1G002100 vs. Swiss-Prot
Match: RTL1_MOUSE (Retrotransposon-like protein 1 OS=Mus musculus GN=Rtl1 PE=2 SV=1)

HSP 1 Score: 110.5 bits (275), Expect = 5.8e-23
Identity = 64/200 (32.00%), Postives = 119/200 (59.50%), Query Frame = 1

Query: 84   LLASVRRDLKRFEKNPEAISNLKVSLEDKKKVIQEMGVES-GEESDDFEEDMKELDEEEE 143
            LL S+R +L+ F+++ E         ED ++  +E G E  GEE +D EE+  E +E+ E
Sbjct: 1273 LLMSIRANLRYFDRSSETEDK-----EDDEEEEEEDGEEEEGEEEEDGEEEEGEEEEDGE 1332

Query: 144  EDDEEDEEEEEEDCDDREDGDTEEGEKEKSDDEVEGEEGNGGIEDGFLKLKELEEFMEED 203
            E++EE+E++EEE+ ++ EDG+ EEGE+E+  +E EGEE   G E+   +  E EE  EE+
Sbjct: 1333 EEEEEEEDDEEEEGEEEEDGEEEEGEEEEDGEEEEGEEEEDGEEEEGEEEGEEEEEGEEE 1392

Query: 204  EVREYGLQKKKDGKKEKKPRKTEEESDDDEDDELEEFDLHGEEDEDSSKLDNARYEDFFG 263
            E  E   +++++ ++E++  + EEE +++E++E EE +   +E+E+  ++  +   +   
Sbjct: 1393 EEEEEDEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEDEEEEDEEEEDEEVP-SMVRELLA 1452

Query: 264  AKKKNHVRRNKLTNGSESEL 283
            A   +H+    L + S +++
Sbjct: 1453 AIPMDHILNGLLAHFSVAQI 1466


HSP 2 Score: 88.2 bits (217), Expect = 3.1e-16
Identity = 60/181 (33.15%), Postives = 97/181 (53.59%), Query Frame = 1

Query: 133  DMKELDEEEEEDDEEDEEEEEEDCDDREDGDTEEGEKEKSDDEVEGEEGNGGIEDGFLKL 192
            +++  D   E +D+ED+EEEEE     EDG+ EEGE+E+  +E EGEE     EDG    
Sbjct: 1280 NLRYFDRSSETEDKEDDEEEEE-----EDGEEEEGEEEEDGEEEEGEEE----EDG---- 1339

Query: 193  KELEEFMEEDEVREYGLQKKKDGKKEKKPRK---TEEESDDDEDDELEEFDLHGEEDEDS 252
             E EE  EED+  E G ++++DG++E+   +    EEE +++ED E EE +  GEE+E+ 
Sbjct: 1340 -EEEEEEEEDDEEEEG-EEEEDGEEEEGEEEEDGEEEEGEEEEDGEEEEGEEEGEEEEEG 1399

Query: 253  SKLDNARYEDFFGAKKKNHVRRNKLTNGSESELSDSGDEEEENEAYTEPKSENLSTHQKK 311
             + +    ED    +++      +     E E  +  +EE+E E   E + E + +  ++
Sbjct: 1400 EE-EEEEEEDEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEDEEEEDEEEEDEEVPSMVRE 1444


HSP 3 Score: 73.2 bits (178), Expect = 1.0e-11
Identity = 62/218 (28.44%), Postives = 100/218 (45.87%), Query Frame = 1

Query: 133  DMKELDEEEEEDDEEDEEEEEEDCDDREDGDTEEGEKEKSDDEVEGEEGNGGIEDGFLKL 192
            +++  D   E +D+ED+EEEEE          E+GE+E+ ++E +GEE  G  E+     
Sbjct: 1280 NLRYFDRSSETEDKEDDEEEEE----------EDGEEEEGEEEEDGEEEEGEEEED---- 1339

Query: 193  KELEEFMEEDEVREYGLQKKKDGKKEKKPRKTEEESDDDEDDELEEFDLHGEEDEDSSKL 252
             E EE  EED+  E G ++++DG+        EEE +++ED E EE    GEE+ED    
Sbjct: 1340 GEEEEEEEEDDEEEEG-EEEEDGE--------EEEGEEEEDGEEEE----GEEEEDGE-- 1399

Query: 253  DNARYEDFFGAKKKNHVRRNKLTNGSESELSDSGDEEEENEAYTEPKSENLSTHQKKLKK 312
                                    G E E  +  +EEEE+E   E + E     +++ ++
Sbjct: 1400 -----------------EEEGEEEGEEEEEGEEEEEEEEDEEEEEEEEEEEEEEEEEEEE 1451

Query: 313  LQSEIEMMEKANLEPKTWTMQGEVTAAKRPKNSALEVD 351
             + E E  E+   E        EV +  R   +A+ +D
Sbjct: 1460 EEEEEEEEEEDEEEEDEEEEDEEVPSMVRELLAAIPMD 1451


HSP 4 Score: 46.2 bits (108), Expect = 1.3e-03
Identity = 28/115 (24.35%), Postives = 54/115 (46.96%), Query Frame = 1

Query: 95  FEKNPEAISNLKVSLEDKKK------------VIQEMGVESGEESDDFEEDMKELDEEEE 154
           ++ NPE   N    L D               +  + G E   E+ D+  D + ++  E 
Sbjct: 325 YDSNPELSDNSNQELSDNSNQESSDSSNQSSDISNQEGSEPLSEASDYSMD-ETINSSET 384

Query: 155 EDDEEDEEEEEEDCDDREDGDTEEGEKEKSDDEVEGEEGNGGIEDGFLKLKELEE 198
           + D++D +  +++ ++ E+G  EEG+ + S +EV    GN  +   FL++++L+E
Sbjct: 385 QSDQDDTDLGDDEEEEEEEGGEEEGQPKNSPEEVVATMGN--VISLFLRMQDLKE 436

BLAST of Csa1G002100 vs. Swiss-Prot
Match: YCF2_OENAR (Protein Ycf2 OS=Oenothera argillicola GN=ycf2-A PE=3 SV=1)

HSP 1 Score: 85.9 bits (211), Expect = 1.5e-15
Identity = 91/340 (26.76%), Postives = 152/340 (44.71%), Query Frame = 1

Query: 110  EDKKKVIQEMGVESGEESDDFEEDMKELDEEEEEDDEED---EEEEEEDCDDREDGDTEE 169
            E+ +   +E+     EE +  EE+++  ++EE E  EE+    EEE E  +D E   TE+
Sbjct: 1866 EEVEGTEEEVEGTEDEEGEGTEEEVEGTEDEEGEGTEEEVEGTEEEVEGTEDEEGEGTED 1925

Query: 170  GEKEKSDDEVEG---EEGNGGIEDGFLKLKELEEFMEEDEVREYGLQKKKDGKKEKKPRK 229
             E E +++EVEG   EEG G  E+    ++  EE +E  E    G +++ +G ++++   
Sbjct: 1926 EEVEGTEEEVEGTEDEEGEGTEEE----VEGTEEEVEGTEEEVEGTEEEVEGTEDEEVEG 1985

Query: 230  TEEESDDDEDDE---LEEFDLHGEEDE-------DSSKLDNARYEDFFGAKKKNHVRRNK 289
            TEEE +  ED+E    E  ++ G EDE       DSS+ DN R       K +N +   +
Sbjct: 1986 TEEEVEGTEDEEGEGTEYEEVEGTEDEEVEGTEKDSSQFDNDRVTLLLRPKPRNPLDIQR 2045

Query: 290  LT---NGSESELSDSGDEEEENEAYTEPKSENLSTHQKKLKKLQSEIEMMEKANLEPKTW 349
            L       ESEL +  D++E+           +   QK L+ L SE+         P+ W
Sbjct: 2046 LIYQHQKYESELEEDDDDDED-----------VFAPQKMLEDLFSELVW------SPRIW 2105

Query: 350  TMQGEVTAAKRPKNSALEVDLDFEHNVRPPPVITEEVTATLEEMIQKRIL------EGRF 409
                       P +  L+ + +      P   I EE     EE ++  +       EG  
Sbjct: 2106 ----------HPWDFILDCEAEI-----PAEEIPEEEDPLPEEALETEVAVWGEEEEGEA 2165

Query: 410  DEVQKAPKRPTKAPREIKELDENKSKKGLGELYEEEYVEK 425
            D+ ++  +   +   E+ E ++ + K+   EL+EEE  E+
Sbjct: 2166 DD-EEDERLEAQQEDELLEEEDEELKEEEDELHEEEEEEE 2168


HSP 2 Score: 52.0 bits (123), Expect = 2.4e-05
Identity = 25/57 (43.86%), Postives = 40/57 (70.18%), Query Frame = 1

Query: 121  VESGEESDDFEEDMKELDEEEEEDDEEDEEEEEEDCDDREDGDTEEGEKEKSDDEVE 178
            +E+ +E +  EE+ +EL EEE+E  EE+EEEEEE+ ++ E  + EE E+E+ +DE++
Sbjct: 2137 LEAQQEDELLEEEDEELKEEEDELHEEEEEEEEEEEEEDELHEEEEEEEEEEEDELQ 2193


HSP 3 Score: 45.8 bits (107), Expect = 1.8e-03
Identity = 34/125 (27.20%), Postives = 61/125 (48.80%), Query Frame = 1

Query: 136  ELDEEEEEDDEEDEEEEEEDCDDREDGDTEEGEKEKSDDE-VEGEEGNGGIEDGFLKLKE 195
            E+  EE  ++E+   EE  + +    G+ EEGE +  +DE +E ++ +  +E+   +LKE
Sbjct: 2096 EIPAEEIPEEEDPLPEEALETEVAVWGEEEEGEADDEEDERLEAQQEDELLEEEDEELKE 2155

Query: 196  LEEFMEEDEVREYGLQKKKDGKKEKKPRKTEEESDDDEDDELEEFDLHGEEDEDSSKLDN 255
                 EEDE+ E                    E +++E++E EE +LH EE+E+  + ++
Sbjct: 2156 -----EEDELHE--------------------EEEEEEEEEEEEDELHEEEEEEEEEEED 2195

Query: 256  ARYED 260
               E+
Sbjct: 2216 ELQEN 2195


HSP 4 Score: 45.1 bits (105), Expect = 3.0e-03
Identity = 44/184 (23.91%), Postives = 80/184 (43.48%), Query Frame = 1

Query: 158  DREDGDTEEGEKEKSDDEVEGEEGNGGIEDGFLKLKELEEFMEEDEVREYGLQKKKDGKK 217
            D ++  T  G  E   D V G      I  G L+L+         E    G +++ +G +
Sbjct: 1808 DEQNLITSYGLVENDSDLVHGLSD---IVHGLLELEGALVGSSPTEEEVEGTEEEVEGTE 1867

Query: 218  EKKPRKTEEESDDDEDDELE--EFDLHGEEDEDSSKLD---NARYEDFFGAK-KKNHVRR 277
            +++   TEEE +  ED+E E  E ++ G EDE+    +       E+  G + ++     
Sbjct: 1868 DEEVEGTEEEVEGTEDEEGEGTEEEVEGTEDEEGEGTEEEVEGTEEEVEGTEDEEGEGTE 1927

Query: 278  NKLTNGSESELSDSGDEE-EENEAYTEPKSENLSTHQKKLKKLQSEIEMMEKANLEPKTW 335
            ++   G+E E+  + DEE E  E   E   E +   +++++  + E+E  E   +E    
Sbjct: 1928 DEEVEGTEEEVEGTEDEEGEGTEEEVEGTEEEVEGTEEEVEGTEEEVEGTEDEEVEGTEE 1987


HSP 5 Score: 33.5 bits (75), Expect = 9.0e+00
Identity = 18/62 (29.03%), Postives = 31/62 (50.00%), Query Frame = 1

Query: 110  EDKKKVIQEMGVESGEESDDFEEDMKELDEEEEEDDEEDEEEEEEDCDDREDGDTEEGEK 169
            E++ ++ +E   E  EE ++ E   +E +EEEEE+DE  E + E    + +     +G  
Sbjct: 2155 EEEDELHEEEEEEEEEEEEEDELHEEEEEEEEEEEDELQENDSEFFRSETQQPQARDGFS 2214

Query: 170  EK 172
            E+
Sbjct: 2215 EE 2216


HSP 6 Score: 32.7 bits (73), Expect = 1.5e+01
Identity = 36/170 (21.18%), Postives = 68/170 (40.00%), Query Frame = 1

Query: 130  FEEDMKELDEEEEEDDEEDEEEEEEDCDDREDGDTEEGEKEKSDDEVEGEEGNGGIEDGF 189
            ++    E + EE++DD+ED    ++  +D               D +   E     E+  
Sbjct: 2044 YQHQKYESELEEDDDDDEDVFAPQKMLEDLFSELVWSPRIWHPWDFILDCEAEIPAEEIP 2103

Query: 190  LKLKELEEFMEEDEVREYGLQKKKDGKKEKKPRKTEEESDD-----DEDDELEEFDLHGE 249
             +   L E   E EV  +G +++ +   E+  R   ++ D+     DE+ + EE +LH E
Sbjct: 2104 EEEDPLPEEALETEVAVWGEEEEGEADDEEDERLEAQQEDELLEEEDEELKEEEDELHEE 2163

Query: 250  EDEDSSKLDNARYEDFFGAKKKNHVRRNKLTNGSESELSDSGDEEEENEA 295
            E+E+                ++     ++L    E E  +  DE +EN++
Sbjct: 2164 EEEE----------------EEEEEEEDELHEEEEEEEEEEEDELQENDS 2197

BLAST of Csa1G002100 vs. Swiss-Prot
Match: ABCF4_DICDI (ABC transporter F family member 4 OS=Dictyostelium discoideum GN=abcF4 PE=3 SV=1)

HSP 1 Score: 83.6 bits (205), Expect = 7.6e-15
Identity = 84/330 (25.45%), Postives = 146/330 (44.24%), Query Frame = 1

Query: 122 ESGEESDDFEEDMKELDE----------EEEEDDEEDEEEEEEDCDDREDGDTEEGEKEK 181
           ES +E DD ++ +K+  +          EEEE++EE+EE E+         D ++G K K
Sbjct: 266 ESEDEEDDVQQPVKKGGKKDKKKGSKHVEEEEEEEEEEEIEQPVKKGSNKKDQKKGGKGK 325

Query: 182 SDDEVEGEEGNGGIEDGFLK----------LKELEEFMEEDEVREYGLQKKKDGKKEKK- 241
             +E E EE    IE    K           K  ++   EDE  E     KK GKK+KK 
Sbjct: 326 HVEEEEEEEEEEEIEQPVKKGSNKKDQKKGGKGKQQQESEDEEEEIQQPVKKGGKKDKKK 385

Query: 242 --PRKTEEESDDDEDDELEEFDLHGEEDEDSSKLDNARYEDFFGAKKKNHVRRNKLTNGS 301
                 EEE +++E++E+E+    G + +  S L+++  E    +KK           G 
Sbjct: 386 GSKHVEEEEEEEEEEEEIEQPVKKGGKKDKKSSLEDSMSELSIKSKK-----------GG 445

Query: 302 ESELSDSGDEEEENEAYTEPKSENLSTHQKKLKKLQSEIEMMEKANLEPKTWTMQGEVTA 361
           + +  +  +E+E+ E   +PKS++    +KK K ++ E E  E+   +PK+ + + +   
Sbjct: 446 KGKHVEEEEEQEQEEEEEKPKSKSNKKDKKKGKHVEEEEEEEEEEEEKPKSKSNKKD--- 505

Query: 362 AKRPKNSALEVDLDFEHNVRPPPVITEEVTATLEEMIQKRILEGRFDEVQKAPKRPTKAP 421
            K+      E + + E          EE   TL E+   +       +V+K  K+  K  
Sbjct: 506 KKKGSKHIEEEEEEEEEEEEEEKEEEEEKKMTLAEIRAAK-------KVKKVDKKEKKKE 565

Query: 422 REIKELDENKSKK-GLGELYEEEYVEKTNL 428
           +E K+ DE +     L +  ++E ++  N+
Sbjct: 566 KEKKKRDEQEEDAFELAKKKQQEEIDYDNI 574


HSP 2 Score: 80.1 bits (196), Expect = 8.4e-14
Identity = 84/354 (23.73%), Postives = 159/354 (44.92%), Query Frame = 1

Query: 75  QQIDLQSQPLLASVRRDLKRFEKNPEAISNLKVSLEDKKKVIQEMGVESGEESDDFEEDM 134
           ++ D+Q QP+    ++D K+  K+ E         E++++   E  V+ G    D ++  
Sbjct: 270 EEDDVQ-QPVKKGGKKDKKKGSKHVEEEE------EEEEEEEIEQPVKKGSNKKDQKKGG 329

Query: 135 KELDEEEEEDDEEDEEEEEEDCDDREDGDTEEGEKEKSDDEVEGEEGNGGIEDGFLKLKE 194
           K    EEEE++EE+EE E+         D ++G K K   E E EE            +E
Sbjct: 330 KGKHVEEEEEEEEEEEIEQPVKKGSNKKDQKKGGKGKQQQESEDEE------------EE 389

Query: 195 LEEFMEEDEVREYGLQKKKDGKKEKKPRKTEEESDDDEDDELEEFDLHGEEDEDSSKLDN 254
           +++      V++ G + KK G K  +    EEE +++E++E+E+    G + +  S L++
Sbjct: 390 IQQ-----PVKKGGKKDKKKGSKHVE----EEEEEEEEEEEIEQPVKKGGKKDKKSSLED 449

Query: 255 ARYEDFFGAKKKNHVRRNKLTNGSESELSDSGDEEEENEAYTEPKSENLSTHQKKLKKLQ 314
           +  E    +KK           G + +  +  +E+E+ E   +PKS++    +KK K ++
Sbjct: 450 SMSELSIKSKK-----------GGKGKHVEEEEEQEQEEEEEKPKSKSNKKDKKKGKHVE 509

Query: 315 SEIEMMEKANLEPKTWTMQGEVTAAKRPKNSALEVDLDFEHNVRPPPVITEEVTATLEEM 374
            E E  E+   +PK+ + + +    K+      E + + E          EE   TL E+
Sbjct: 510 EEEEEEEEEEEKPKSKSNKKD---KKKGSKHIEEEEEEEEEEEEEEKEEEEEKKMTLAEI 569

Query: 375 IQKRILEGRFDEVQKAPKRPTKAPREIKELDENKSKK-GLGELYEEEYVEKTNL 428
              +       +V+K  K+  K  +E K+ DE +     L +  ++E ++  N+
Sbjct: 570 RAAK-------KVKKVDKKEKKKEKEKKKRDEQEEDAFELAKKKQQEEIDYDNI 574


HSP 3 Score: 77.8 bits (190), Expect = 4.2e-13
Identity = 80/344 (23.26%), Postives = 148/344 (43.02%), Query Frame = 1

Query: 125 EESDDFEEDM-------------KELDEEEEEDDEEDE-----EEEEEDCDDREDGDTEE 184
           ++SDD +E++             K+  ++++ DDEEDE     ++  +    ++ G  +E
Sbjct: 143 QDSDDEQEEIPQPVKKGGKPAPQKKGGKQQDSDDEEDEIPQPVKKGGKPAPQKKGGKQQE 202

Query: 185 GEKEKSDDEVEGEEGNGGIEDGFLKLKELEEFMEEDEVREYGLQKKKDGKKEKKPRK--- 244
            E E  +DEV+     GG  D    +K +EE  EE+E  E   Q  K G K  KP+K   
Sbjct: 203 SEDEDEEDEVQQPVKKGGKNDKKKGVKHVEEEEEEEEEEEIE-QPVKKGGKAPKPKKGGK 262

Query: 245 -TEEESDDDEDDELEEFDLHGEEDED------SSKLDNARYEDFFGAKKKNHVRRNKLTN 304
            +++ES+D+EDD  +     G++D+         + +    E+     KK   ++++   
Sbjct: 263 GSKQESEDEEDDVQQPVKKGGKKDKKKGSKHVEEEEEEEEEEEIEQPVKKGSNKKDQKKG 322

Query: 305 GSESELSDSGDEEEENEAYTEPKSENLSTHQKKLKKLQSEIEMMEKANLEPKTWTMQGEV 364
           G    + +  +EEEE E     K  +    QKK  K + + E  ++     +     G+ 
Sbjct: 323 GKGKHVEEEEEEEEEEEIEQPVKKGSNKKDQKKGGKGKQQQESEDEEEEIQQPVKKGGKK 382

Query: 365 TAAKRPKNSALEVDLDFEHNVRPPPVI---TEEVTATLEEMIQKRILE----GRFDEVQK 424
              K  K+   E + + E      PV     ++  ++LE+ + +  ++    G+   V++
Sbjct: 383 DKKKGSKHVEEEEEEEEEEEEIEQPVKKGGKKDKKSSLEDSMSELSIKSKKGGKGKHVEE 442

Query: 425 APKRPTKAPREIKELDENKSKKGLGELYEEEYVEKTNLATAPPS 434
             ++  +   E  +   NK  K  G+  EEE  E+      P S
Sbjct: 443 EEEQEQEEEEEKPKSKSNKKDKKKGKHVEEEEEEEEEEEEKPKS 485


HSP 4 Score: 73.6 bits (179), Expect = 7.8e-12
Identity = 83/356 (23.31%), Postives = 150/356 (42.13%), Query Frame = 1

Query: 110 EDKKKVIQEMGVESGEESDDFEEDMKELDEEEEEDDEEDEEEEEEDCDD----REDGDTE 169
           ED++  +Q+   + G+  +D ++ +K ++EEEEE++EE+ E+  +        ++ G   
Sbjct: 206 EDEEDEVQQPVKKGGK--NDKKKGVKHVEEEEEEEEEEEIEQPVKKGGKAPKPKKGGKGS 265

Query: 170 EGEKEKSDDEVEGEEGNGGIEDGFLKLKELEEFMEEDEVREYGL-----QKKKDGKKEKK 229
           + E E  +D+V+     GG +D     K +EE  EE+E  E          KKD KK  K
Sbjct: 266 KQESEDEEDDVQQPVKKGGKKDKKKGSKHVEEEEEEEEEEEIEQPVKKGSNKKDQKKGGK 325

Query: 230 PRKTEEESDDDEDDELEEFDLHGEEDEDSSKLDNARYEDFFGAKKKNHVRRNKLTNGSES 289
            +  EEE +++E++E+E+    G   +D  K    + +      ++  +++     G + 
Sbjct: 326 GKHVEEEEEEEEEEEIEQPVKKGSNKKDQKKGGKGKQQQ-ESEDEEEEIQQPVKKGGKKD 385

Query: 290 ELSDSG--DEEEENEAYTEPKSENLSTHQKKLKKLQSEIEMMEKANLEPKTWTMQGEVTA 349
           +   S   +EEEE E   E   + +    KK KK   E  M E +            + +
Sbjct: 386 KKKGSKHVEEEEEEEEEEEEIEQPVKKGGKKDKKSSLEDSMSELS------------IKS 445

Query: 350 AKRPKNSALEVDLDFEHNVRPPPVITEEVTATLEEMIQKRILEGRFDEVQKAPKRPTKAP 409
            K  K   +E + + E          EE      +  +K   +G+  E ++  +   +  
Sbjct: 446 KKGGKGKHVEEEEEQEQE--------EEEEKPKSKSNKKDKKKGKHVEEEEEEEEEEEEK 505

Query: 410 REIKELDENKSKKGLGELYEEEYVEKTNLATAPPSFTDEAKTEASILFKKLCSKLD 455
            + K   ++K KKG   + EEE  E+           ++  T A I   K   K+D
Sbjct: 506 PKSKSNKKDK-KKGSKHIEEEEEEEEEEEEEEKEEEEEKKMTLAEIRAAKKVKKVD 537


HSP 5 Score: 52.8 bits (125), Expect = 1.4e-05
Identity = 82/356 (23.03%), Postives = 148/356 (41.57%), Query Frame = 1

Query: 96  EKNPEAISNLKVSLEDKKKVIQEMGVESGEESDDFEEDMKELDEEEEEDDEEDEEEEEED 155
           E+ P++ SN K    DKKK       E  EE ++ +   K   +++++  +  EEEEEE+
Sbjct: 451 EEKPKSKSNKK----DKKKGKHVEEEEEEEEEEEEKPKSKSNKKDKKKGSKHIEEEEEEE 510

Query: 156 CDDREDGDTEEGEKEKSDDEVEGEEGNGGIEDGFLKLKELEEFMEEDEVREYGLQKKKDG 215
            ++ E+   EE EK+ +  E+   +          K+K+++               KK+ 
Sbjct: 511 EEEEEEEKEEEEEKKMTLAEIRAAK----------KVKKVD---------------KKEK 570

Query: 216 KKEKKPRKTEEESDDDEDDELEEFDLHGEEDEDSSKLDNARYEDFFGAKKKNHVRRNKLT 275
           KKEK+ +K +E+ +D        F+L  ++ ++    DN   +D  G     +V   K +
Sbjct: 571 KKEKEKKKRDEQEED-------AFELAKKKQQEEIDYDNIDIDDVPGKDAPTYVHL-KSS 630

Query: 276 NGSESELSDSGDEEEENEAYTEP-------KSENLSTHQK---------KLKKLQSEIEM 335
            G  S++ +  D + +N   + P        S  L+  QK             L  +I M
Sbjct: 631 EGLRSKIGN--DIKFDNLILSVPGRILLNNASLTLAYGQKYGFVGRNGIGKSTLVKKIAM 690

Query: 336 MEKANLEP--KTWTMQGEVTA-AKRPKNSALEVDLDFEHNVRPPPVITEEVTATLEEMIQ 395
            ++  + P  +   ++ EVT     P +  L  D + +  +    V+TE         ++
Sbjct: 691 RDEITIAPHLRVLYVEQEVTGDDTTPLDCVLAADEERKWLLDEEKVLTE---------LE 750

Query: 396 KRILEGRFDEVQKAPKRPTKAPREIKELDENKSK-------KGLGELYEEEYVEKT 426
           K     +FD  QK           +KE+D +K+         GLG  +EE  V+K+
Sbjct: 751 KVNPSWQFDPRQKRNYSLRDIYDRLKEIDADKASIRAANILIGLGFTFEEISVKKS 758

BLAST of Csa1G002100 vs. Swiss-Prot
Match: AN32B_MOUSE (Acidic leucine-rich nuclear phosphoprotein 32 family member B OS=Mus musculus GN=Anp32b PE=1 SV=1)

HSP 1 Score: 80.5 bits (197), Expect = 6.4e-14
Identity = 46/125 (36.80%), Postives = 80/125 (64.00%), Query Frame = 1

Query: 110 EDKKKVIQEMGVESGEESDDFEEDMKELDEEEEEDDEEDEEEEEEDCDDREDGDTEEGEK 169
           ED++    ++ V+S EE+ D + ++  +D+EEE+++ EDEEEEE++  D E+ + E+ E 
Sbjct: 151 EDQEAPDSDVEVDSVEEAPDSDGEVDGVDKEEEDEEGEDEEEEEDE--DGEEEEDEDEED 210

Query: 170 EKSDDEVEGEEGN---GGIEDGFLKLKELEEFMEEDEVREYGLQKKKDGKKEKKPRKTEE 229
           E  D++VEGE+      G E+ F    E++E  EEDE  +   ++++ GK EK+ R+T++
Sbjct: 211 EDEDEDVEGEDDEDEVSGEEEEFGHDGEVDE-DEEDEDEDEDEEEEESGKGEKRKRETDD 270

Query: 230 ESDDD 232
           E +DD
Sbjct: 271 EGEDD 272


HSP 2 Score: 55.5 bits (132), Expect = 2.2e-06
Identity = 60/208 (28.85%), Postives = 78/208 (37.50%), Query Frame = 1

Query: 111 DKKKVIQEMGVESGEESDDFEEDMKELDEEEEED--DEEDEEEEEEDC--DDREDGDTEE 170
           D  K +   G E    SD  E   + L +    D  D ED+E  + D   D  E+    +
Sbjct: 113 DCLKSLDLFGCEVTNRSDYRETVFRLLPQLSYLDGYDREDQEAPDSDVEVDSVEEAPDSD 172

Query: 171 GEKEKSDDEVEGEEGNGGIEDGFLKLKELEEFMEEDEVREYGLQKKKDGKKEKKPRKTEE 230
           GE +  D E E EEG              +E  EEDE          DG         EE
Sbjct: 173 GEVDGVDKEEEDEEGE-------------DEEEEEDE----------DG---------EE 232

Query: 231 ESDDDEDDELEEFDLHGEEDEDSSKLDNARYEDFFGAKKKNHVRRNKLTNGSESELSDSG 290
           E D+DE+DE E+ D+ GE+DED                           +G E E    G
Sbjct: 233 EEDEDEEDEDEDEDVEGEDDEDE-------------------------VSGEEEEFGHDG 263

Query: 291 --DEEEENEAYTEPKSENLSTHQKKLKK 313
             DE+EE+E   E + E  S   +K K+
Sbjct: 293 EVDEDEEDEDEDEDEEEEESGKGEKRKR 263

BLAST of Csa1G002100 vs. Swiss-Prot
Match: AN32B_RAT (Acidic leucine-rich nuclear phosphoprotein 32 family member B OS=Rattus norvegicus GN=Anp32b PE=1 SV=1)

HSP 1 Score: 80.5 bits (197), Expect = 6.4e-14
Identity = 46/125 (36.80%), Postives = 80/125 (64.00%), Query Frame = 1

Query: 110 EDKKKVIQEMGVESGEESDDFEEDMKELDEEEEEDDEEDEEEEEEDCDDREDGDTEEGEK 169
           ED++    ++ V+S EE+ D + ++  +D+EEE+++ EDEEEEE++  D E+ + E+ E 
Sbjct: 151 EDQEAPDSDVEVDSVEEAPDSDGEVDGVDKEEEDEEGEDEEEEEDE--DGEEEEDEDEED 210

Query: 170 EKSDDEVEGEEGN---GGIEDGFLKLKELEEFMEEDEVREYGLQKKKDGKKEKKPRKTEE 229
           E  D++VEGE+      G E+ F    E++E  EEDE  +   ++++ GK EK+ R+T++
Sbjct: 211 EDEDEDVEGEDDEDEVSGEEEEFGHDGEVDE-DEEDEDEDEDEEEEESGKGEKRKRETDD 270

Query: 230 ESDDD 232
           E +DD
Sbjct: 271 EGEDD 272


HSP 2 Score: 55.5 bits (132), Expect = 2.2e-06
Identity = 60/208 (28.85%), Postives = 78/208 (37.50%), Query Frame = 1

Query: 111 DKKKVIQEMGVESGEESDDFEEDMKELDEEEEED--DEEDEEEEEEDC--DDREDGDTEE 170
           D  K +   G E    SD  E   + L +    D  D ED+E  + D   D  E+    +
Sbjct: 113 DCLKSLDLFGCEVTNRSDYRETVFRLLPQLSYLDGYDREDQEAPDSDVEVDSVEEAPDSD 172

Query: 171 GEKEKSDDEVEGEEGNGGIEDGFLKLKELEEFMEEDEVREYGLQKKKDGKKEKKPRKTEE 230
           GE +  D E E EEG              +E  EEDE          DG         EE
Sbjct: 173 GEVDGVDKEEEDEEGE-------------DEEEEEDE----------DG---------EE 232

Query: 231 ESDDDEDDELEEFDLHGEEDEDSSKLDNARYEDFFGAKKKNHVRRNKLTNGSESELSDSG 290
           E D+DE+DE E+ D+ GE+DED                           +G E E    G
Sbjct: 233 EEDEDEEDEDEDEDVEGEDDEDE-------------------------VSGEEEEFGHDG 263

Query: 291 --DEEEENEAYTEPKSENLSTHQKKLKK 313
             DE+EE+E   E + E  S   +K K+
Sbjct: 293 EVDEDEEDEDEDEDEEEEESGKGEKRKR 263

BLAST of Csa1G002100 vs. TrEMBL
Match: W9S508_9ROSA (U3 small nucleolar ribonucleoprotein protein MPP10 OS=Morus notabilis GN=L484_002912 PE=3 SV=1)

HSP 1 Score: 651.0 bits (1678), Expect = 1.3e-183
Identity = 351/570 (61.58%), Postives = 434/570 (76.14%), Query Frame = 1

Query: 12  EAGLKPLHSLKSTDPPLWLAPSPSLSQVARLASQSLFSMLKPFNPKSPFDHLLVDGFDAE 71
           +AG + LH LK+TDPP WLAPS +LSQ AR ASQ LFS L+P+ PK+PF+ LLV+GFDAE
Sbjct: 6   DAGSEALHRLKTTDPPQWLAPSAALSQTARAASQHLFSSLRPYAPKTPFEQLLVEGFDAE 65

Query: 72  QIWQQIDLQSQPLLASVRRDLKRFEKNPEAISNLKVSLEDKKKVIQEMGVESGEE----- 131
           QIWQQ+DLQSQPLL+S+RR+LKRFEK+PE I  L+V++E K++   E G+E   E     
Sbjct: 66  QIWQQLDLQSQPLLSSIRRELKRFEKDPEEIRKLRVAMEGKER---EKGLEDEVEKMEVE 125

Query: 132 SDDFEEDMKELDEEEEEDDE-EDEEEEEEDCDDREDGDTEEGEKEKSDDEVEGEEGNGGI 191
           SDDF ED+++L+EEEE+  + E +E+E+ED D+ ED   EEGE        EG EG GGI
Sbjct: 126 SDDFGEDLEDLEEEEEQQQQKEGKEDEDEDEDEDED---EEGE--------EGNEG-GGI 185

Query: 192 EDGFLKLKELEEFMEEDEVREYGLQKKKDGKKEK----------KPRKTEEESDDDEDD- 251
           ED FLK+K+LE+F++EDE REYG  KKKD KK+K          +  + EEE+ DDED+ 
Sbjct: 186 EDRFLKMKDLEKFLKEDEEREYG-SKKKDKKKKKVDDGDQEENTEKEEEEEEAGDDEDEN 245

Query: 252 -----------ELEEFDLHGEEDEDSSKLDNARYEDFFGAKKKN-HVRRNKLTNGSE-SE 311
                      E+ EF L  +EDED+ +L NARYEDFFG KKK    +++KLT   E S 
Sbjct: 246 GDGEDEDEDSEEMGEFGLDDDEDEDTDELGNARYEDFFGGKKKKPSKKKSKLTGDLEGSG 305

Query: 312 LSDSGDEEEENEAYTEPKSENLSTHQKKLKKLQSEIEMMEKANLEPKTWTMQGEVTAAKR 371
           + D G+++E        K + LSTH+K+L K + +IE MEK+NLEPKTWTMQGEVTAAKR
Sbjct: 306 MDDDGEDDE--------KQQTLSTHEKELAKRKLKIEQMEKSNLEPKTWTMQGEVTAAKR 365

Query: 372 PKNSALEVDLDFEHNVRPPPVITEEVTATLEEMIQKRILEGRFDEVQKAPKRPTKAPREI 431
           PKNSALEVDLDFEHN+RPPPVITEEVTA+LE++I+KRILEG FD+V+K P  P+KAPRE+
Sbjct: 366 PKNSALEVDLDFEHNMRPPPVITEEVTASLEDIIKKRILEGHFDDVEKPPALPSKAPREV 425

Query: 432 KELDENKSKKGLGELYEEEYVEKTNLATAPPSFTDEAKTEASILFKKLCSKLDALSHYHY 491
           KELDENKSKKGL E+YEEEY +KTNLA+ P SF +E K EAS+LFKKLC KLDALSH+H+
Sbjct: 426 KELDENKSKKGLAEIYEEEYAQKTNLASMPLSFAEEEKKEASMLFKKLCLKLDALSHFHF 485

Query: 492 APKPVIEDMSISTNVPALAMEEVAPVAVSDAAMLAPEEVFAGKGEIKEAAELTQSDRKRR 551
            PKPVIEDMSI  NVPALAMEE+AP+AVSDAAMLAPEEVFAGKG+IKE +ELTQ++RKRR
Sbjct: 486 TPKPVIEDMSIQANVPALAMEEIAPLAVSDAAMLAPEEVFAGKGDIKEESELTQAERKRR 545

BLAST of Csa1G002100 vs. TrEMBL
Match: V4SYS7_9ROSI (U3 small nucleolar ribonucleoprotein protein MPP10 OS=Citrus clementina GN=CICLE_v10019513mg PE=3 SV=1)

HSP 1 Score: 649.4 bits (1674), Expect = 3.9e-183
Identity = 338/560 (60.36%), Postives = 435/560 (77.68%), Query Frame = 1

Query: 18  LHSLKSTDPPLWLAPSPSLSQVARLASQSLFSMLKPFNPKSPFDHLLVDGFDAEQIWQQI 77
           LH LKST+PP+WLAP   LS+ AR AS+ +FS L+P+ PKSP D LL++GFDAEQIWQQI
Sbjct: 14  LHRLKSTEPPVWLAPRAELSETARKASKIIFSYLRPYAPKSPLDQLLIEGFDAEQIWQQI 73

Query: 78  DLQSQPLLASVRRDLKRFEKNPEAISNLKVSLEDKKKVIQEMG-VESGE------ESDDF 137
           DLQSQPLL+S++R++K+FEK PE I  ++  LE +KKV++ +G V  GE      ESDDF
Sbjct: 74  DLQSQPLLSSLKREVKKFEKKPEEIGKIREVLEGEKKVVESVGKVLEGERRVLAVESDDF 133

Query: 138 EEDMKE-LDEEEEEDDEEDEEEEEEDCDDREDGDTEEGEKEKSDDEVEGEEG-NGGIEDG 197
           +ED+ + LD+++++DD++DEEEEEE     E+ +  EGE+EK     +G+ G  GGIED 
Sbjct: 134 DEDLDDGLDDDDDDDDDDDEEEEEE-----EEVEGSEGEEEK-----KGKGGPEGGIEDE 193

Query: 198 FLKLKELEEFMEEDEVREYGLQKKKDGKKEKKPRKT--------EEESDDDEDDELEEFD 257
           FLK+ EL+E++EEDE REYGL+K  +  ++K  R+         E+E +D+ED++ EE  
Sbjct: 194 FLKINELQEYLEEDEAREYGLKKDSNDSRKKGGRRVLDNEEDEDEDEDEDEEDEDEEELG 253

Query: 258 LHG------EEDEDSSKLDNARYEDFFGAKKKNHVRRNKLTNGSESELSDSGDEEEENEA 317
           + G      EEDE   KL+NA YEDFFG+K+K   ++N  +     E S   DEE+E+EA
Sbjct: 254 VFGDSDDNEEEDEHRQKLENAGYEDFFGSKRKKAPKKNLKSTEELEEDSGLDDEEDEDEA 313

Query: 318 YTEPKSENLSTHQKKLKKLQSEIEMMEKANLEPKTWTMQGEVTAAKRPKNSALEVDLDFE 377
               K++NLSTH+K+ ++L++EIE MEKANL+PKTWTMQGEVTAA+RPKNSALEVDLDF+
Sbjct: 314 VETKKNDNLSTHEKQSEQLRAEIEKMEKANLDPKTWTMQGEVTAAQRPKNSALEVDLDFQ 373

Query: 378 HNVRPPPVITEEVTATLEEMIQKRILEGRFDEVQKAPKRPTKAPREIKELDENKSKKGLG 437
           HNVRP PVITEE TA+LEEMI+KRI+EG+FD+V+KA   P+KAPRE+KELDENKSKKGL 
Sbjct: 374 HNVRPAPVITEEFTASLEEMIKKRIIEGQFDDVEKAASLPSKAPRELKELDENKSKKGLA 433

Query: 438 ELYEEEYVEKTNLATAPPSFTDEAKTEASILFKKLCSKLDALSHYHYAPKPVIEDMSIST 497
           E+YEEEYV+KTN A AP +F+DE K EAS+LFKKLC KLDALSH+H+ PKPVIEDMSI  
Sbjct: 434 EVYEEEYVQKTNPAAAPLTFSDEQKKEASMLFKKLCLKLDALSHFHFTPKPVIEDMSIQA 493

Query: 498 NVPALAMEEVAPVAVSDAAMLAPEEVFAGKGEIKEAAELTQSDRKRRRASKKRKYKAMVA 554
           NVPALAMEE+APVAVSDAAMLAPEEVFAG+G++KE AELT+++RKRRRASKKRK+KA   
Sbjct: 494 NVPALAMEEIAPVAVSDAAMLAPEEVFAGRGDVKEEAELTKAERKRRRASKKRKFKAEAT 553

BLAST of Csa1G002100 vs. TrEMBL
Match: A0A061FTN6_THECC (U3 small nucleolar ribonucleoprotein protein MPP10 OS=Theobroma cacao GN=TCM_012096 PE=3 SV=1)

HSP 1 Score: 638.6 bits (1646), Expect = 6.8e-180
Identity = 338/565 (59.82%), Postives = 432/565 (76.46%), Query Frame = 1

Query: 10  NIEAGLKPLHSLKSTDPPLWLAPSPSLSQVARLASQSLFSMLKPFNPKSPFDHLLVDGFD 69
           +IEA ++ L  +KS +PP+WL P   LSQ  R AS+ LFS LKP +PKSPFD LL++GFD
Sbjct: 4   SIEAAVESLREIKSKEPPMWLVPKQELSQAVRAASKHLFSSLKPHSPKSPFDQLLIEGFD 63

Query: 70  AEQIWQQIDLQSQPLLASVRRDLKRFEKNPEAISNLKVSLED--KKKVIQEMGVESGEES 129
           AEQIWQQIDLQSQPLL ++RR++K+FEKNPE IS LK ++E   KKKV++E G +  ++ 
Sbjct: 64  AEQIWQQIDLQSQPLLYTLRREVKKFEKNPEEISKLKEAIEGGKKKKVVEEKGTDDIDDD 123

Query: 130 DDFEEDMKELDEEEEEDDEEDEEEEEEDCDDREDGDTEEGEKEKSDDEVEGEEGNGGIED 189
           DD ++D+ ++D++++ DD+E+EEE+      +E G  EE E+E  + E+EGEE  GGIED
Sbjct: 124 DDDDDDL-DMDDDDDYDDDEEEEEK------KERGREEESEREGEEMELEGEE-KGGIED 183

Query: 190 GFLKLKELEEFMEEDEVREYGLQKKKDGKKEKKPRKT------------EEESDDDEDDE 249
            FLK+KEL+E++EEDE +EYGL+K K  K E K ++             EEE +DD++DE
Sbjct: 184 KFLKIKELQEYLEEDEAKEYGLKKNKKTKTETKKKEEDTEEEEEGEDEEEEEEEDDDNDE 243

Query: 250 LEE-------FDLHGEEDEDSSKLDNARYEDFFGAKKKNHVRRNKLTNGSESELSDSGDE 309
            EE       FD   E+DEDS  L+NARYEDFFG KK N   + K  +    E  DSG +
Sbjct: 244 AEEEEDELGLFDGDDEDDEDS--LENARYEDFFGTKK-NKGPKEKAKSRDRFE-GDSGSD 303

Query: 310 EEENEAYTEPKSENLSTHQKKLKKLQSEIEMMEKANLEPKTWTMQGEVTAAKRPKNSALE 369
           +E++    + + + LSTH+++LKKLQS+IE MEKANL+PK WTM+GEVTAA+R KNSALE
Sbjct: 304 DEQD---FDKRKDGLSTHEEELKKLQSKIEQMEKANLDPKVWTMRGEVTAAQRQKNSALE 363

Query: 370 VDLDFEHNVRPPPVITEEVTATLEEMIQKRILEGRFDEVQKAPKRPTKAPREIKELDENK 429
           VDLDFEHNVRP PVITEEVTA+LE++I+ RI EG FD+VQK+    +KAPRE KELD+NK
Sbjct: 364 VDLDFEHNVRPAPVITEEVTASLEDLIKTRISEGLFDDVQKSRSLSSKAPRETKELDDNK 423

Query: 430 SKKGLGELYEEEYVEKTNLATAPPSFTDEAKTEASILFKKLCSKLDALSHYHYAPKPVIE 489
           SKKGL E+YEEE+V+KT+ A AP SF+DE K EAS+LFKKLC KLDALSH+H+ PKPV+E
Sbjct: 424 SKKGLAEVYEEEFVQKTDPAAAPLSFSDELKKEASMLFKKLCLKLDALSHFHFTPKPVVE 483

Query: 490 DMSISTNVPALAMEEVAPVAVSDAAMLAPEEVFAGKGEIKEAAELTQSDRKRRRASKKRK 549
           DMSI TNVPALAMEE+AP+AVSDAAMLAPEEVFAGKG+IKE AELT+S+RKRRRA+KKRK
Sbjct: 484 DMSIQTNVPALAMEEIAPMAVSDAAMLAPEEVFAGKGDIKEEAELTRSERKRRRANKKRK 543

Query: 550 YKAMVAKRDAKKSGNTTAPNANEGQ 554
           +KA  AKR AKK   TT  ++NEG+
Sbjct: 544 FKAEAAKRLAKKPRETTQVDSNEGK 553

BLAST of Csa1G002100 vs. TrEMBL
Match: A0A061FU83_THECC (U3 small nucleolar ribonucleoprotein protein MPP10 OS=Theobroma cacao GN=TCM_012096 PE=3 SV=1)

HSP 1 Score: 638.3 bits (1645), Expect = 8.9e-180
Identity = 338/564 (59.93%), Postives = 431/564 (76.42%), Query Frame = 1

Query: 10  NIEAGLKPLHSLKSTDPPLWLAPSPSLSQVARLASQSLFSMLKPFNPKSPFDHLLVDGFD 69
           +IEA ++ L  +KS +PP+WL P   LSQ  R AS+ LFS LKP +PKSPFD LL++GFD
Sbjct: 4   SIEAAVESLREIKSKEPPMWLVPKQELSQAVRAASKHLFSSLKPHSPKSPFDQLLIEGFD 63

Query: 70  AEQIWQQIDLQSQPLLASVRRDLKRFEKNPEAISNLKVSLED--KKKVIQEMGVESGEES 129
           AEQIWQQIDLQSQPLL ++RR++K+FEKNPE IS LK ++E   KKKV++E G +  ++ 
Sbjct: 64  AEQIWQQIDLQSQPLLYTLRREVKKFEKNPEEISKLKEAIEGGKKKKVVEEKGTDDIDDD 123

Query: 130 DDFEEDMKELDEEEEEDDEEDEEEEEEDCDDREDGDTEEGEKEKSDDEVEGEEGNGGIED 189
           DD ++D+ ++D++++ DD+E+EEE+      +E G  EE E+E  + E+EGEE  GGIED
Sbjct: 124 DDDDDDL-DMDDDDDYDDDEEEEEK------KERGREEESEREGEEMELEGEE-KGGIED 183

Query: 190 GFLKLKELEEFMEEDEVREYGLQKKKDGKKEKKPRKT------------EEESDDDEDDE 249
            FLK+KEL+E++EEDE +EYGL+K K  K E K ++             EEE +DD++DE
Sbjct: 184 KFLKIKELQEYLEEDEAKEYGLKKNKKTKTETKKKEEDTEEEEEGEDEEEEEEEDDDNDE 243

Query: 250 LEE-------FDLHGEEDEDSSKLDNARYEDFFGAKKKNHVRRNKLTNGSESELSDSGDE 309
            EE       FD   E+DEDS  L+NARYEDFFG KK N   + K  +    E  DSG +
Sbjct: 244 AEEEEDELGLFDGDDEDDEDS--LENARYEDFFGTKK-NKGPKEKAKSRDRFE-GDSGSD 303

Query: 310 EEENEAYTEPKSENLSTHQKKLKKLQSEIEMMEKANLEPKTWTMQGEVTAAKRPKNSALE 369
           +E++    + + + LSTH+++LKKLQS+IE MEKANL+PK WTM+GEVTAA+R KNSALE
Sbjct: 304 DEQD---FDKRKDGLSTHEEELKKLQSKIEQMEKANLDPKVWTMRGEVTAAQRQKNSALE 363

Query: 370 VDLDFEHNVRPPPVITEEVTATLEEMIQKRILEGRFDEVQKAPKRPTKAPREIKELDENK 429
           VDLDFEHNVRP PVITEEVTA+LE++I+ RI EG FD+VQK+    +KAPRE KELD+NK
Sbjct: 364 VDLDFEHNVRPAPVITEEVTASLEDLIKTRISEGLFDDVQKSRSLSSKAPRETKELDDNK 423

Query: 430 SKKGLGELYEEEYVEKTNLATAPPSFTDEAKTEASILFKKLCSKLDALSHYHYAPKPVIE 489
           SKKGL E+YEEE+V+KT+ A AP SF+DE K EAS+LFKKLC KLDALSH+H+ PKPV+E
Sbjct: 424 SKKGLAEVYEEEFVQKTDPAAAPLSFSDELKKEASMLFKKLCLKLDALSHFHFTPKPVVE 483

Query: 490 DMSISTNVPALAMEEVAPVAVSDAAMLAPEEVFAGKGEIKEAAELTQSDRKRRRASKKRK 549
           DMSI TNVPALAMEE+AP+AVSDAAMLAPEEVFAGKG+IKE AELT+S+RKRRRA+KKRK
Sbjct: 484 DMSIQTNVPALAMEEIAPMAVSDAAMLAPEEVFAGKGDIKEEAELTRSERKRRRANKKRK 543

Query: 550 YKAMVAKRDAKKSGNTTAPNANEG 553
           +KA  AKR AKK   TT  ++NEG
Sbjct: 544 FKAEAAKRLAKKPRETTQVDSNEG 552

BLAST of Csa1G002100 vs. TrEMBL
Match: A0A061FUL2_THECC (U3 small nucleolar ribonucleoprotein protein MPP10 OS=Theobroma cacao GN=TCM_012096 PE=3 SV=1)

HSP 1 Score: 638.3 bits (1645), Expect = 8.9e-180
Identity = 338/564 (59.93%), Postives = 431/564 (76.42%), Query Frame = 1

Query: 10  NIEAGLKPLHSLKSTDPPLWLAPSPSLSQVARLASQSLFSMLKPFNPKSPFDHLLVDGFD 69
           +IEA ++ L  +KS +PP+WL P   LSQ  R AS+ LFS LKP +PKSPFD LL++GFD
Sbjct: 4   SIEAAVESLREIKSKEPPMWLVPKQELSQAVRAASKHLFSSLKPHSPKSPFDQLLIEGFD 63

Query: 70  AEQIWQQIDLQSQPLLASVRRDLKRFEKNPEAISNLKVSLED--KKKVIQEMGVESGEES 129
           AEQIWQQIDLQSQPLL ++RR++K+FEKNPE IS LK ++E   KKKV++E G +  ++ 
Sbjct: 64  AEQIWQQIDLQSQPLLYTLRREVKKFEKNPEEISKLKEAIEGGKKKKVVEEKGTDDIDDD 123

Query: 130 DDFEEDMKELDEEEEEDDEEDEEEEEEDCDDREDGDTEEGEKEKSDDEVEGEEGNGGIED 189
           DD ++D+ ++D++++ DD+E+EEE+      +E G  EE E+E  + E+EGEE  GGIED
Sbjct: 124 DDDDDDL-DMDDDDDYDDDEEEEEK------KERGREEESEREGEEMELEGEE-KGGIED 183

Query: 190 GFLKLKELEEFMEEDEVREYGLQKKKDGKKEKKPRKT------------EEESDDDEDDE 249
            FLK+KEL+E++EEDE +EYGL+K K  K E K ++             EEE +DD++DE
Sbjct: 184 KFLKIKELQEYLEEDEAKEYGLKKNKKTKTETKKKEEDTEEEEEGEDEEEEEEEDDDNDE 243

Query: 250 LEE-------FDLHGEEDEDSSKLDNARYEDFFGAKKKNHVRRNKLTNGSESELSDSGDE 309
            EE       FD   E+DEDS  L+NARYEDFFG KK N   + K  +    E  DSG +
Sbjct: 244 AEEEEDELGLFDGDDEDDEDS--LENARYEDFFGTKK-NKGPKEKAKSRDRFE-GDSGSD 303

Query: 310 EEENEAYTEPKSENLSTHQKKLKKLQSEIEMMEKANLEPKTWTMQGEVTAAKRPKNSALE 369
           +E++    + + + LSTH+++LKKLQS+IE MEKANL+PK WTM+GEVTAA+R KNSALE
Sbjct: 304 DEQD---FDKRKDGLSTHEEELKKLQSKIEQMEKANLDPKVWTMRGEVTAAQRQKNSALE 363

Query: 370 VDLDFEHNVRPPPVITEEVTATLEEMIQKRILEGRFDEVQKAPKRPTKAPREIKELDENK 429
           VDLDFEHNVRP PVITEEVTA+LE++I+ RI EG FD+VQK+    +KAPRE KELD+NK
Sbjct: 364 VDLDFEHNVRPAPVITEEVTASLEDLIKTRISEGLFDDVQKSRSLSSKAPRETKELDDNK 423

Query: 430 SKKGLGELYEEEYVEKTNLATAPPSFTDEAKTEASILFKKLCSKLDALSHYHYAPKPVIE 489
           SKKGL E+YEEE+V+KT+ A AP SF+DE K EAS+LFKKLC KLDALSH+H+ PKPV+E
Sbjct: 424 SKKGLAEVYEEEFVQKTDPAAAPLSFSDELKKEASMLFKKLCLKLDALSHFHFTPKPVVE 483

Query: 490 DMSISTNVPALAMEEVAPVAVSDAAMLAPEEVFAGKGEIKEAAELTQSDRKRRRASKKRK 549
           DMSI TNVPALAMEE+AP+AVSDAAMLAPEEVFAGKG+IKE AELT+S+RKRRRA+KKRK
Sbjct: 484 DMSIQTNVPALAMEEIAPMAVSDAAMLAPEEVFAGKGDIKEEAELTRSERKRRRANKKRK 543

Query: 550 YKAMVAKRDAKKSGNTTAPNANEG 553
           +KA  AKR AKK   TT  ++NEG
Sbjct: 544 FKAEAAKRLAKKPRETTQVDSNEG 552

BLAST of Csa1G002100 vs. TAIR10
Match: AT5G66540.1 (AT5G66540.1 FUNCTIONS IN: molecular_function unknown)

HSP 1 Score: 577.8 bits (1488), Expect = 7.2e-165
Identity = 316/555 (56.94%), Postives = 398/555 (71.71%), Query Frame = 1

Query: 12  EAGLKPLHSLKSTDPPLWLAPSPSLSQVARLASQSLFSMLKPFNPKSPFDHLLVDGFDAE 71
           ++G + L  LK+T+PP++LAPS S+S+ AR ASQ LF  LKP NPK PFD L  DGFDAE
Sbjct: 6   DSGFEALEKLKATEPPVFLAPS-SISEDARSASQYLFMKLKPHNPKCPFDQLSSDGFDAE 65

Query: 72  QIWQQIDLQSQPLLASVRRDLKRFEKNPEAISNL-----KVSLEDKKKVIQEMGVESGEE 131
           QIWQQID+QSQPLL S+R+++KRF KNPE I  L     KVS ED    I EM ++ G +
Sbjct: 66  QIWQQIDMQSQPLLTSLRQEVKRFAKNPEEIRKLGKLALKVSHEDD---IDEMDMD-GFD 125

Query: 132 SDDFEEDMKEL--------DEEEEEDDEEDEEEEEEDCDDREDGDTEEGEKEKSDDEVEG 191
           SDD +++ KE+        DEEEEE+DEE+EEEEEE+ ++ +DGD E             
Sbjct: 126 SDDVDDEDKEIESNDSEGEDEEEEEEDEEEEEEEEEEEEEEKDGDNE------------- 185

Query: 192 EEGNGGIEDGFLKLKELEEFMEEDEVREYGLQ-KKKDGKKEKKPRKT---EEESDDDEDD 251
                GIED F K+KELEEF+EE E  EYG+  K K G  ++K +     E+E DDD+++
Sbjct: 186 -----GIEDKFFKIKELEEFLEEGEAEEYGIDHKNKKGVAQRKKQNLSDDEDEEDDDDEE 245

Query: 252 ELEEFDLH-GEEDEDSSKLDNARYEDFFGAKKKNHVRRNKLTNGSESELSDSGDEEEENE 311
           E  EFD   G +DE++ KL  ARY+DFFG KK+  ++   L+   E+E+ + G+E+    
Sbjct: 246 EDVEFDAFAGGDDEETDKLGKARYDDFFGGKKETKMKLKDLSEDEEAEIENKGNEK---- 305

Query: 312 AYTEPKSENLSTHQKKLKKLQSEIEMMEKANLEPKTWTMQGEVTAAKRPKNSALEVDLDF 371
                    LSTH++   KLQS+IE MEKANL+PK WTMQGE+TAAKRP NSALEVDLDF
Sbjct: 306 ---------LSTHERARLKLQSKIEQMEKANLDPKHWTMQGEITAAKRPMNSALEVDLDF 365

Query: 372 EHNVRPPPVITEEVTATLEEMIQKRILEGRFDEVQKAPKRPTKAPREIKELDENKSKKGL 431
           EHN RP PVITEEVTA+LE++I+ RI+E RFD+VQ+AP+ PTK  RE KELDE+KSKKGL
Sbjct: 366 EHNARPAPVITEEVTASLEDLIKSRIIEARFDDVQRAPRLPTKGKREAKELDESKSKKGL 425

Query: 432 GELYEEEYVEKTNLATAPPSFTDEAKTEASILFKKLCSKLDALSHYHYAPKPVIEDMSIS 491
            E+YE EY +K N A AP + +DE K EAS+LFKKLC KLDALSH+H+ PKPVIE+MSI 
Sbjct: 426 AEVYEAEYFQKANPAFAPTTHSDELKKEASMLFKKLCLKLDALSHFHFTPKPVIEEMSI- 485

Query: 492 TNVPALAMEEVAPVAVSDAAMLAPEEVFAGKGEIKEAAELTQSDRKRRRASKKRKYKAMV 549
            NV A+AMEEVAPVAVSDAAMLAPEE+F+GKG+IK+ +ELTQ DRKRRRA+KKRK+KA  
Sbjct: 486 PNVSAIAMEEVAPVAVSDAAMLAPEEIFSGKGDIKDESELTQEDRKRRRANKKRKFKAES 523

BLAST of Csa1G002100 vs. TAIR10
Match: AT1G56660.1 (AT1G56660.1 unknown protein)

HSP 1 Score: 75.5 bits (184), Expect = 1.2e-13
Identity = 88/364 (24.18%), Postives = 160/364 (43.96%), Query Frame = 1

Query: 96  EKNPEAISNLKVSLEDK-KKVIQEMGVESGEESDDFEEDMKELDEEEEEDDEEDEEE--- 155
           EK  E +   K   + K KK   E G E   +  D E+  +++ +E+EE +EED ++   
Sbjct: 120 EKKHEELEEEKEGKKKKNKKEKDESGPEEKNKKADKEKKHEDVSQEKEELEEEDGKKNKK 179

Query: 156 --------EEEDCDDREDGDTEEGEKEKSDDEVEGEEGNGGIEDGFLKLKELEEFMEEDE 215
                   EE+    +++   +E  K   D +V+G++  G  E G L+ ++ E+  E DE
Sbjct: 180 KEKDESGTEEKKKKPKKEKKQKEESKSNEDKKVKGKKEKG--EKGDLEKEDEEKKKEHDE 239

Query: 216 VREYGLQKKKDGKKEKKPRKTE---EESDDDEDDELEEFDLHGEEDEDSSKLDNARYEDF 275
             +    K+KD KK KK  K E   EE     D E +E D   E+++   K    + E  
Sbjct: 240 TDQE--MKEKDSKKNKKKEKDESCAEEKKKKPDKEKKEKDESTEKEDKKLKGKKGKGEK- 299

Query: 276 FGAKKKNHVRRNKLTNGSESELSD-SGDEEEENEAYTEPKSENLST------HQKKLKKL 335
              +K++  ++ K  + +E E+ D + D +E  +   + K++   T       ++   K 
Sbjct: 300 --PEKEDEGKKTKEHDATEQEMDDEAADHKEGKKKKNKDKAKKKETVIDEVCEKETKDKD 359

Query: 336 QSEIEMMEKANLEPKTWTMQGEVTAAK-RPKNSALEVDLDFEHNVRPPPVITEEVTATLE 395
             E E  +K N + +  + +GE    + + K + LE ++         P   ++     E
Sbjct: 360 DDEGETKQKKNKKKEKKSEKGEKDVKEDKKKENPLETEVMSRDIKLEEPEAEKKEEDDTE 419

Query: 396 EMIQKRILEGRFDE--------VQKAPKRPTKAPR----EIKELDENKSKKGLGELYEEE 425
           E  + ++  G  +E         +K  K+ TK P+    E ++ D++K  K  G   +EE
Sbjct: 420 EKKKSKVEGGESEEGKKKKKKDKKKNKKKDTKEPKMTEDEEEKKDDSKDVKIEGSKAKEE 476


HSP 2 Score: 64.7 bits (156), Expect = 2.1e-10
Identity = 69/280 (24.64%), Postives = 125/280 (44.64%), Query Frame = 1

Query: 93  KRFEKNPEAISNLKVSLEDKKKVIQEMGVESGEESDDFEEDMKELDEEEEEDDEEDEEEE 152
           K+ EK+ E+ +  K    DK+K  ++   E  ++    ++   E  E+E+E  +  E + 
Sbjct: 251 KKKEKD-ESCAEEKKKKPDKEKKEKDESTEKEDKKLKGKKGKGEKPEKEDEGKKTKEHDA 310

Query: 153 EEDCDDREDGDTEEGEKEKSDDEVEGEEGNGGIEDGFLKLKELEEFMEEDEVREYGLQKK 212
            E   D E  D +EG+K+K+ D+ + +E    + D   + +  ++  +E E ++     K
Sbjct: 311 TEQEMDDEAADHKEGKKKKNKDKAKKKE---TVIDEVCEKETKDKDDDEGETKQ-----K 370

Query: 213 KDGKKEKKPRKTEEESDDDE-------------DDELEEFDLHGEEDEDS-----SKLDN 272
           K+ KKEKK  K E++  +D+             D +LEE +   +E++D+     SK++ 
Sbjct: 371 KNKKKEKKSEKGEKDVKEDKKKENPLETEVMSRDIKLEEPEAEKKEEDDTEEKKKSKVEG 430

Query: 273 ARYEDFFGAKKKN-------HVRRNKLTNGSESELSDSGDEEEENEAYTEPKSE------ 332
              E+    KKK+         +  K+T   E +  DS D + E     E K +      
Sbjct: 431 GESEEGKKKKKKDKKKNKKKDTKEPKMTEDEEEKKDDSKDVKIEGSKAKEEKKDKDVKKK 490

Query: 333 ----NLSTHQKKLKKLQSEIE--MMEKANLEPKTWTMQGE 336
               ++   + KL K+  +I   M EKA +E +    +GE
Sbjct: 491 KGGNDIGKLKTKLAKIDEKIGALMEEKAEIENQIKDAEGE 521


HSP 3 Score: 62.8 bits (151), Expect = 7.8e-10
Identity = 77/329 (23.40%), Postives = 142/329 (43.16%), Query Frame = 1

Query: 96  EKNPEAISNLKVSLEDKKKVIQEMGVESGEES--DDFEEDM--------KELDEE--EEE 155
           +K+ E   +L+V   D K    E   + G+E   ++ EE+         KE DE   EE+
Sbjct: 90  KKHEEGHGDLEVKESDVKVEEHEKEHKKGKEKKHEELEEEKEGKKKKNKKEKDESGPEEK 149

Query: 156 DDEEDEEEEEEDCD-DREDGDTEEGEKEKSDDEVEGEEGNGGIEDGFLKLKELEEFMEED 215
           + + D+E++ ED   ++E+ + E+G+K K  ++ E      G E+   K K+ ++  EE 
Sbjct: 150 NKKADKEKKHEDVSQEKEELEEEDGKKNKKKEKDES-----GTEEKKKKPKKEKKQKEES 209

Query: 216 EVREYGLQKKKDGKKEKKPRKTEEESDDDEDDELEEFDLHGEEDEDSSKLDNARYEDFFG 275
           +  E    KK  GKKEK  +   E+ D+++  E      H E D++  + D+        
Sbjct: 210 KSNE---DKKVKGKKEKGEKGDLEKEDEEKKKE------HDETDQEMKEKDS-------- 269

Query: 276 AKKKNHVRRNKLTNGSESELSDSGDEEEENEAYTEPKSENLSTHQKKLKKLQSEIEMMEK 335
              K + ++ K  + +E +      E++E +  TE + + L   + K +K + E E  + 
Sbjct: 270 ---KKNKKKEKDESCAEEKKKKPDKEKKEKDESTEKEDKKLKGKKGKGEKPEKEDEGKKT 329

Query: 336 ANLEPKTWTMQGEVTAAKRPKNSALEVDLDFEHNVRPPPVITEEVTATLEEMIQKRILEG 395
              +     M  E    K  K          +   +    + +EV    +E   K   EG
Sbjct: 330 KEHDATEQEMDDEAADHKEGKKK------KNKDKAKKKETVIDEVCE--KETKDKDDDEG 384

Query: 396 RFDEVQKAPKRPTKAPREIKELDENKSKK 412
              + +K  K+  K+ +  K++ E+K K+
Sbjct: 390 ETKQ-KKNKKKEKKSEKGEKDVKEDKKKE 384

BLAST of Csa1G002100 vs. TAIR10
Match: AT1G20920.1 (AT1G20920.1 P-loop containing nucleoside triphosphate hydrolases superfamily protein)

HSP 1 Score: 74.3 bits (181), Expect = 2.6e-13
Identity = 81/344 (23.55%), Postives = 138/344 (40.12%), Query Frame = 1

Query: 69  DAEQIWQQIDLQSQPLLASVRRDLKRF-EKNPEAISNLKVSLEDKKKVIQEMGVESGEES 128
           D + + ++ DL+        RRD  R  E+  +  S  +   + +KK ++    E   + 
Sbjct: 12  DLDVVEEEADLKKS------RRDRDRSNERKKDKGSEKRREKDRRKKRVKSSDSEDDYDR 71

Query: 129 DDFEEDMKELDEEEEE-----------------DDEEDEEEEEEDCDDREDGDTEEGEKE 188
           DD EE  K  ++E E                   D ED+ EEE++ D R   + E G +E
Sbjct: 72  DDDEEREKRKEKERERRRRDKDRVKRRSERRKSSDSEDDVEEEDERDKRRVNEKERGHRE 131

Query: 189 KSDDEVEGEEGNGGIEDGFLKLKELE----------EFMEEDEVREYGLQKKKDGKKEKK 248
              D  +  + +   E+   K +E E          E  E++ V+E   ++++DG+++++
Sbjct: 132 HERDRGKDRKRDREREERKDKEREREKDRERREREREEREKERVKERERREREDGERDRR 191

Query: 249 PRKTEEESDDDEDDELEEFDLHGEEDEDSSKLDNARYEDFFGAKKKNHVRRNKLTNGSES 308
            R+ E  S  + + E    ++  EE +D  K D  R     G ++K   R   +   S  
Sbjct: 192 EREKERGSRRNRERERSR-EVGNEESDDDVKRDLKRRRK-EGGERKEKEREKSVGRSSRH 251

Query: 309 ELSDSGDEEEENEAYTEPKS--ENLSTHQKK-----------------LKKLQSEIEMME 360
           E S      E+N    E K+  E L   QKK                 LK+ + E E   
Sbjct: 252 EDSPKRKSVEDNGEKKEKKTREEELEDEQKKLDEEVEKRRRRVQEWQELKRKKEEAESES 311


HSP 2 Score: 71.2 bits (173), Expect = 2.2e-12
Identity = 79/353 (22.38%), Postives = 143/353 (40.51%), Query Frame = 1

Query: 69  DAEQIWQQIDLQSQPLLASVRRDLKRFEKNPEAISNLKVSLEDKKKVIQEMGVESGEESD 128
           D + + ++ DL+        RRD  R  +  +   + K   +D++K      V+S +  D
Sbjct: 12  DLDVVEEEADLKKS------RRDRDRSNERKKDKGSEKRREKDRRK----KRVKSSDSED 71

Query: 129 DFEEDMKELDEEEEEDDEEDEEEEEEDCDDREDGDTEEGEKEKSDDEVEGEEGNGGIEDG 188
           D++ D    D+EE E  +E E E      DR    +E  +   S+D+VE E+     E  
Sbjct: 72  DYDRD----DDEEREKRKEKERERRRRDKDRVKRRSERRKSSDSEDDVEEED-----ERD 131

Query: 189 FLKLKELEEFMEEDEVREYGLQKKKDGKKEKKPRKTEEESDDDEDDELEEFDLHGEEDED 248
             ++ E E    E E R+ G  +K+D ++E++  K  E   D E  E E  +    E E 
Sbjct: 132 KRRVNEKERGHREHE-RDRGKDRKRDREREERKDKEREREKDRERREREREE---REKER 191

Query: 249 SSKLDNARYEDFFGAKKKNHVRRNKLTNGSESELSDSGDEEEENEAYTEPKSENLSTHQK 308
             + +    ED    +++    R    N       + G+EE +++   + K       ++
Sbjct: 192 VKERERREREDGERDRREREKERGSRRNRERERSREVGNEESDDDVKRDLKRRRKEGGER 251

Query: 309 KLKKLQSEIEMMEKANLEPKTWTMQGEVTAAKRPKNSALEVDLDFEHNVRPPPVITEEVT 368
           K K+ +  +    +    PK  +++      ++ +    E +L+ E              
Sbjct: 252 KEKEREKSVGRSSRHEDSPKRKSVEDN---GEKKEKKTREEELEDEQK------------ 311

Query: 369 ATLEEMIQKRILEGRFDEVQKAPKRPTKAPREIK-ELDENKSKKGLGELYEEE 421
             L+E ++KR    R  E Q+  ++  +A  E K + D N+ K G     E E
Sbjct: 312 -KLDEEVEKR--RRRVQEWQELKRKKEEAESESKGDADGNEPKAGKAWTLEGE 323

BLAST of Csa1G002100 vs. TAIR10
Match: AT2G22795.1 (AT2G22795.1 unknown protein)

HSP 1 Score: 73.9 bits (180), Expect = 3.4e-13
Identity = 67/297 (22.56%), Postives = 129/297 (43.43%), Query Frame = 1

Query: 121 VESGEESDDFEEDMKELDE---EEEEDDEEDEEEEEEDCDDRE---DGDTEEGEK-EKSD 180
           V S EES   E + K+ +E   +EE  D E E +E+E+   +E   D +TE  EK E S 
Sbjct: 422 VSSQEESKGKESETKDKEESSSQEESKDRETETKEKEESSSQEETMDKETEAKEKVESSS 481

Query: 181 DEVEGEEGNGGIEDGFL---KLKELEEFMEEDEVREYGLQKKKDGKKEKKPRKTEEESDD 240
            E   ++    IE  FL   K KE E   +E+   +   ++K+   K+ +   ++EE+ D
Sbjct: 482 QEKNEDKETEKIESSFLEETKEKEDETKEKEESSSQEKTEEKETETKDNEESSSQEETKD 541

Query: 241 DEDDELEEFDLHGEEDEDSSKLDNARYEDFFGAKKKNHVRRNKLTNGSESELSDSGDEEE 300
            E++++E+ +   +E+   ++ +    E+    ++       K+      E  +S  +EE
Sbjct: 542 KENEKIEKEEASSQEESKENETETKEKEESSSQEETKEKENEKI------EKEESAPQEE 601

Query: 301 ENEAYTEPKSENLSTHQKKLKKLQSEIEMMEKANLEPKTWTMQGEVTAAKRPKNSALEVD 360
             E   E   +  S  Q++ K+ ++E +  E+++       +  E    ++ + +  + D
Sbjct: 602 TKEKENEKIEKEESASQEETKEKETETKEKEESSSNESQENVNTESEKKEQVEENEKKTD 661

Query: 361 LDFEHNVRPPPVITEE---VTATLEEMIQKRILEGRFDEVQKAPKRPTKAPREIKEL 405
            D   + +   V   E      T E+    +  E    + Q      T  P+E+K++
Sbjct: 662 EDTSESSKENSVSDTEQKQSEETSEKEESNKNGETEVTQEQSDSSSDTNLPQEVKDV 712


HSP 2 Score: 73.2 bits (178), Expect = 5.8e-13
Identity = 60/280 (21.43%), Postives = 129/280 (46.07%), Query Frame = 1

Query: 90  RDLKRFEKNPEAISNLKVSLEDKKKVIQEMGVESGEESDDFEEDMKE---LDEEEEEDDE 149
           +D +   K  E  S+ + +++ + +  +++   S E+++D E +  E   L+E +E++DE
Sbjct: 448 KDRETETKEKEESSSQEETMDKETEAKEKVESSSQEKNEDKETEKIESSFLEETKEKEDE 507

Query: 150 ----------EDEEEEEEDCDDREDGDTEEGEKEKSDDEVEGEEGNGGIE--DGFLKLKE 209
                     E  EE+E +  D E+  ++E  K+K ++++E EE +   E  +   + KE
Sbjct: 508 TKEKEESSSQEKTEEKETETKDNEESSSQEETKDKENEKIEKEEASSQEESKENETETKE 567

Query: 210 LEEFMEEDEVREYGLQKKKDGKKEKKPRKTEEESDDDEDDELEEFDLHGEEDEDSSKLDN 269
            EE   ++E +E     K++ K EK+    +EE+ + E++++E+ +   +E+    + + 
Sbjct: 568 KEESSSQEETKE-----KENEKIEKEESAPQEETKEKENEKIEKEESASQEETKEKETET 627

Query: 270 ARYEDFFGAKKKNHVRRNKLTNGSESELSDSGDEEEENEAYTEPKSENLSTHQKKLKKLQ 329
              E+       N  + N  T   + E  +  +++ + +     K  ++S  ++K  +  
Sbjct: 628 KEKEE----SSSNESQENVNTESEKKEQVEENEKKTDEDTSESSKENSVSDTEQKQSEET 687

Query: 330 SEIEMMEKANLEPKTWTMQGEVTAAKRPKNSALEVDLDFE 355
           SE E   K N E +    Q + ++         +V  D E
Sbjct: 688 SEKEESNK-NGETEVTQEQSDSSSDTNLPQEVKDVRTDLE 717


HSP 3 Score: 72.4 bits (176), Expect = 9.9e-13
Identity = 92/328 (28.05%), Postives = 131/328 (39.94%), Query Frame = 1

Query: 110 EDKKKVIQEMGVESGE-ESDDFEEDMKELDEEEEEDDEEDEEEEEEDCDDREDGDTEEGE 169
           +DK+  I E G E+ E ES+    +     E EE+ D    EE E +      G TEE E
Sbjct: 97  DDKENEIVEGGEENKEKESEGIVSNEDSNSEIEEKKDSGGVEESEVEEKRDNGGGTEENE 156

Query: 170 KEKSDD-EVEGEEGNGGIEDGFLKLKELEEFMEEDEVREYGLQKKKDGKKEKKPRKTEEE 229
           K  +++ EVE  + NGG E+        +   EE EV E    +K +G  E+  +   EE
Sbjct: 157 KSGTEESEVEERKDNGGTEE------NEKSGTEESEVEE----RKDNGGTEENEKSGTEE 216

Query: 230 SDDDEDDE---LEEFDLHG-EEDEDSSKLDNARYEDFFGAKKKNHVRRNKLTNGSESELS 289
           S+ +E  E    EE +  G EE E   K DN   E+             + +   ESE+ 
Sbjct: 217 SEVEERKENGGTEENEKSGSEESEVEEKKDNGGTEE-----------SREKSGTEESEVE 276

Query: 290 DSGDEEEENEAYTEPKSENLSTHQKKLKKLQSEIEMMEKANLEP------KTWTMQGEVT 349
           +  D     E+  E K EN    + +  K   E ++ EKAN+E       K      EV 
Sbjct: 277 EKKDNGSSEESEVEEKKENRGIDESEESK---EKDIDEKANIEEARENNYKGDDASSEVV 336

Query: 350 AAKRPKNSALEVDLDFEHNVRPPPVITEEVTATLEEMIQKRILEGRFDEVQKAPKRPTKA 409
                K S  E     E       + TEEV    E+ + K +L    D  + +    +  
Sbjct: 337 HESEEKTSESENSEKVEDK---SGIKTEEV----EDSVIKSVLPNTTDNGESSSDEKSTG 393

Query: 410 PREIKELDENKSKKGLGELYEE-EYVEK 425
                E D  +  K  GE  E+ E +EK
Sbjct: 397 SSSGHESDSLEGIKSEGESMEKNELLEK 393


HSP 4 Score: 65.1 bits (157), Expect = 1.6e-10
Identity = 85/363 (23.42%), Postives = 153/363 (42.15%), Query Frame = 1

Query: 110 EDKKKVIQEMGVESGEESDDFEEDMKELDEEEE----------EDDEEDEEEEEEDCDDR 169
           E++K   +E  VE  +++   EE+ K   EE E          E++E+   EE E  + +
Sbjct: 176 ENEKSGTEESEVEERKDNGGTEENEKSGTEESEVEERKENGGTEENEKSGSEESEVEEKK 235

Query: 170 EDGDTEEGEKEKS---DDEVEGEEGNGGIEDGFLKLKELEEFMEEDEVREYGLQKKKDGK 229
           ++G TEE  +EKS   + EVE ++ NG  E+      E+EE  E   + E    K+KD  
Sbjct: 236 DNGGTEE-SREKSGTEESEVEEKKDNGSSEE-----SEVEEKKENRGIDESEESKEKD-- 295

Query: 230 KEKKPRKTEEESDDDEDDELEEFDLHGEEDEDSSKLDNARYEDFFGAKK---KNHVRRNK 289
            ++K    E   ++ + D+     +H  E++ S   ++ + ED  G K    ++ V ++ 
Sbjct: 296 IDEKANIEEARENNYKGDDASSEVVHESEEKTSESENSEKVEDKSGIKTEEVEDSVIKSV 355

Query: 290 LTNGSESELSDSGDEEEENEAYTEPKSENLSTHQKKLKKLQSEIEMMEKANLEPKTWT-M 349
           L N +++  S S ++   + +  E  S         L+ ++SE E MEK  L  K +   
Sbjct: 356 LPNTTDNGESSSDEKSTGSSSGHESDS---------LEGIKSEGESMEKNELLEKEFNDS 415

Query: 350 QGEVTAAKRPKNS-------ALEVDLDFEHNVRPPPVITEEVTATLE-------EMIQKR 409
            GE +   +   S         EV    E   +      +E +++ E       E  +K 
Sbjct: 416 NGESSVTGKSTGSGDGGSQETSEVSSQEESKGKESETKDKEESSSQEESKDRETETKEKE 475

Query: 410 ILEGRFDEVQKAPKRPTKAPREIKELDENKSKKGLGELYEEEYVEKTNLATAPPSFTDEA 442
               + + + K  +   K     +E +E+K  + +   + EE  EK +        + + 
Sbjct: 476 ESSSQEETMDKETEAKEKVESSSQEKNEDKETEKIESSFLEETKEKEDETKEKEESSSQE 521


HSP 5 Score: 63.5 bits (153), Expect = 4.6e-10
Identity = 99/473 (20.93%), Postives = 184/473 (38.90%), Query Frame = 1

Query: 96  EKNPEAISNLKVSLEDKKKVIQEMGVESGEESDDFEE----------------DMKELDE 155
           E+N E  S   VS ED    I+E     G E  + EE                +  E++E
Sbjct: 108 EENKEKESEGIVSNEDSNSEIEEKKDSGGVEESEVEEKRDNGGGTEENEKSGTEESEVEE 167

Query: 156 EEE----EDDEEDEEEEEEDCDDREDGDTEEGEKEKSDD-EVEGEEGNGGIEDGFLKLKE 215
            ++    E++E+   EE E  + +++G TEE EK  +++ EVE  + NGG E+       
Sbjct: 168 RKDNGGTEENEKSGTEESEVEERKDNGGTEENEKSGTEESEVEERKENGGTEE------N 227

Query: 216 LEEFMEEDEVREYGLQKKKDGKKEKKPRKTEEESDDDEDDELEEFDLHGEEDEDSSKLDN 275
            +   EE EV E   +K   G +E + +   EES+ +E  +    +   EE E   K +N
Sbjct: 228 EKSGSEESEVEE---KKDNGGTEESREKSGTEESEVEEKKD----NGSSEESEVEEKKEN 287

Query: 276 ARYEDFFGAKKKNHVRRNKLTNGSESEL--SDSGDEEEENEAYTEPKSENLSTHQKKLKK 335
              ++   +K+K+   +  +    E+     D+  E          +SEN    + K   
Sbjct: 288 RGIDESEESKEKDIDEKANIEEARENNYKGDDASSEVVHESEEKTSESENSEKVEDKSGI 347

Query: 336 LQSEIEMMEKANLEPKTWTMQGEVTAAKRPKNSALEVDLDFEHNVRPPPVITEEVTATLE 395
              E+E     ++ P T T  GE ++ ++   S+   + D    ++     +E  +    
Sbjct: 348 KTEEVEDSVIKSVLPNT-TDNGESSSDEKSTGSSSGHESDSLEGIK-----SEGESMEKN 407

Query: 396 EMIQKRILE--GRFDEVQKAPKRPTKAPREIKELDENKSKKGLGELYEEEYVEKTNLATA 455
           E+++K   +  G      K+        +E  E+   +  KG     E E  +K   ++ 
Sbjct: 408 ELLEKEFNDSNGESSVTGKSTGSGDGGSQETSEVSSQEESKG----KESETKDKEESSSQ 467

Query: 456 PPSFTDEAKTEASILFKKLCSKLDALSHYHYAPKPVIEDMSISTNVPALAMEEVAPVAVS 515
             S   E +T+     K+  S  +         K  +E  S   N      E+     + 
Sbjct: 468 EESKDRETETKE----KEESSSQEETMDKETEAKEKVESSSQEKN------EDKETEKIE 527

Query: 516 DAAMLAPEEVFAGKGEIKEAAELTQSDRKRRRASKKRKYKAMVAKRDAKKSGN 544
            + +   EE    + E KE  E +  ++   + ++ +  +   ++ + K   N
Sbjct: 528 SSFL---EETKEKEDETKEKEESSSQEKTEEKETETKDNEESSSQEETKDKEN 544

BLAST of Csa1G002100 vs. TAIR10
Match: AT3G14670.1 (AT3G14670.1 unknown protein)

HSP 1 Score: 72.4 bits (176), Expect = 9.9e-13
Identity = 52/179 (29.05%), Postives = 85/179 (47.49%), Query Frame = 1

Query: 96  EKNPEAISNLKVSLEDKKKVIQEMGVESGEESDDFEEDMKELDEEEEEDDEEDEEEEEED 155
           ++NP  I    V  E    +I  + VE GE+SD+ EE+  E DE+EE ++EE EEEE+  
Sbjct: 44  KQNPVVIEGRGVEEEQIPTIITTV-VEEGEKSDNNEEENSEKDEKEESEEEESEEEEK-- 103

Query: 156 CDDREDGDTEEGEKEKSDDEVEGEEGNGGIEDGFLKLKELEEFMEEDEVREYGLQKKKDG 215
                    EE EKE+ + E EG    GG  D   +    E   +E+   E  + K+ D 
Sbjct: 104 ---------EEEEKEEEEKEEEGNVAGGGSSDDSSRTLGKESSSDENMDDETAVGKQVDI 163

Query: 216 KKEKKPRKTEEESDDDEDDELEEFDLHGEEDEDSSKLDNARYEDFFGAKKKNHVRRNKL 275
               K  +  +E+D D  ++  + +  G++++D  + D    +     K+K+  RR ++
Sbjct: 164 PAAMKINEMGQENDGDPKEKDGDLEKDGDQEKDPKEKDPKEKD----PKEKDPKRRTRM 206


HSP 2 Score: 46.6 bits (109), Expect = 5.8e-05
Identity = 41/158 (25.95%), Postives = 75/158 (47.47%), Query Frame = 1

Query: 80  QSQPLLASVRRDLKRFEKNPEAISNLKVSLEDKKKVIQEMGVESGEESDDFEEDMKELDE 139
           Q   ++ +V  + ++ + N E  S      +D+K+  +E      EES++     +E +E
Sbjct: 59  QIPTIITTVVEEGEKSDNNEEENSE-----KDEKEESEE------EESEE-----EEKEE 118

Query: 140 EEEEDDEEDEEEEEEDCDDREDGDTEEGEKEKSDDEVEGEEGNGGIED--GFLKLKELEE 199
           EE+E++E++EE         +D     G++  SD+ ++ E   G   D    +K+ E+ +
Sbjct: 119 EEKEEEEKEEEGNVAGGGSSDDSSRTLGKESSSDENMDDETAVGKQVDIPAAMKINEMGQ 178

Query: 200 FMEEDEVREYGLQKKKDGKKEKKPR-KTEEESDDDEDD 235
             + D   + G   +KDG +EK P+ K  +E D  E D
Sbjct: 179 ENDGDPKEKDG-DLEKDGDQEKDPKEKDPKEKDPKEKD 199


HSP 3 Score: 32.7 bits (73), Expect = 8.7e-01
Identity = 23/74 (31.08%), Postives = 33/74 (44.59%), Query Frame = 1

Query: 218 EKKPRKTEEESDDDEDDELEEFDLHGEEDEDSSKLDNARYED--FFGAKKKNHVRRNKLT 277
           EK     EE S+ DE +E EE +   EE E+  K +  + E+    G    +   R   T
Sbjct: 72  EKSDNNEEENSEKDEKEESEEEESEEEEKEEEEKEEEEKEEEGNVAGGGSSDDSSR---T 131

Query: 278 NGSESELSDSGDEE 290
            G ES   ++ D+E
Sbjct: 132 LGKESSSDENMDDE 142

BLAST of Csa1G002100 vs. NCBI nr
Match: gi|778655345|ref|XP_011651882.1| (PREDICTED: U3 small nucleolar ribonucleoprotein protein MPP10 [Cucumis sativus])

HSP 1 Score: 1095.5 bits (2832), Expect = 0.0e+00
Identity = 553/553 (100.00%), Postives = 553/553 (100.00%), Query Frame = 1

Query: 1   MEAGKVVLPNIEAGLKPLHSLKSTDPPLWLAPSPSLSQVARLASQSLFSMLKPFNPKSPF 60
           MEAGKVVLPNIEAGLKPLHSLKSTDPPLWLAPSPSLSQVARLASQSLFSMLKPFNPKSPF
Sbjct: 1   MEAGKVVLPNIEAGLKPLHSLKSTDPPLWLAPSPSLSQVARLASQSLFSMLKPFNPKSPF 60

Query: 61  DHLLVDGFDAEQIWQQIDLQSQPLLASVRRDLKRFEKNPEAISNLKVSLEDKKKVIQEMG 120
           DHLLVDGFDAEQIWQQIDLQSQPLLASVRRDLKRFEKNPEAISNLKVSLEDKKKVIQEMG
Sbjct: 61  DHLLVDGFDAEQIWQQIDLQSQPLLASVRRDLKRFEKNPEAISNLKVSLEDKKKVIQEMG 120

Query: 121 VESGEESDDFEEDMKELDEEEEEDDEEDEEEEEEDCDDREDGDTEEGEKEKSDDEVEGEE 180
           VESGEESDDFEEDMKELDEEEEEDDEEDEEEEEEDCDDREDGDTEEGEKEKSDDEVEGEE
Sbjct: 121 VESGEESDDFEEDMKELDEEEEEDDEEDEEEEEEDCDDREDGDTEEGEKEKSDDEVEGEE 180

Query: 181 GNGGIEDGFLKLKELEEFMEEDEVREYGLQKKKDGKKEKKPRKTEEESDDDEDDELEEFD 240
           GNGGIEDGFLKLKELEEFMEEDEVREYGLQKKKDGKKEKKPRKTEEESDDDEDDELEEFD
Sbjct: 181 GNGGIEDGFLKLKELEEFMEEDEVREYGLQKKKDGKKEKKPRKTEEESDDDEDDELEEFD 240

Query: 241 LHGEEDEDSSKLDNARYEDFFGAKKKNHVRRNKLTNGSESELSDSGDEEEENEAYTEPKS 300
           LHGEEDEDSSKLDNARYEDFFGAKKKNHVRRNKLTNGSESELSDSGDEEEENEAYTEPKS
Sbjct: 241 LHGEEDEDSSKLDNARYEDFFGAKKKNHVRRNKLTNGSESELSDSGDEEEENEAYTEPKS 300

Query: 301 ENLSTHQKKLKKLQSEIEMMEKANLEPKTWTMQGEVTAAKRPKNSALEVDLDFEHNVRPP 360
           ENLSTHQKKLKKLQSEIEMMEKANLEPKTWTMQGEVTAAKRPKNSALEVDLDFEHNVRPP
Sbjct: 301 ENLSTHQKKLKKLQSEIEMMEKANLEPKTWTMQGEVTAAKRPKNSALEVDLDFEHNVRPP 360

Query: 361 PVITEEVTATLEEMIQKRILEGRFDEVQKAPKRPTKAPREIKELDENKSKKGLGELYEEE 420
           PVITEEVTATLEEMIQKRILEGRFDEVQKAPKRPTKAPREIKELDENKSKKGLGELYEEE
Sbjct: 361 PVITEEVTATLEEMIQKRILEGRFDEVQKAPKRPTKAPREIKELDENKSKKGLGELYEEE 420

Query: 421 YVEKTNLATAPPSFTDEAKTEASILFKKLCSKLDALSHYHYAPKPVIEDMSISTNVPALA 480
           YVEKTNLATAPPSFTDEAKTEASILFKKLCSKLDALSHYHYAPKPVIEDMSISTNVPALA
Sbjct: 421 YVEKTNLATAPPSFTDEAKTEASILFKKLCSKLDALSHYHYAPKPVIEDMSISTNVPALA 480

Query: 481 MEEVAPVAVSDAAMLAPEEVFAGKGEIKEAAELTQSDRKRRRASKKRKYKAMVAKRDAKK 540
           MEEVAPVAVSDAAMLAPEEVFAGKGEIKEAAELTQSDRKRRRASKKRKYKAMVAKRDAKK
Sbjct: 481 MEEVAPVAVSDAAMLAPEEVFAGKGEIKEAAELTQSDRKRRRASKKRKYKAMVAKRDAKK 540

Query: 541 SGNTTAPNANEGQ 554
           SGNTTAPNANEGQ
Sbjct: 541 SGNTTAPNANEGQ 553

BLAST of Csa1G002100 vs. NCBI nr
Match: gi|659128932|ref|XP_008464444.1| (PREDICTED: U3 small nucleolar ribonucleoprotein protein MPP10 [Cucumis melo])

HSP 1 Score: 1040.4 bits (2689), Expect = 1.1e-300
Identity = 530/554 (95.67%), Postives = 538/554 (97.11%), Query Frame = 1

Query: 1   MEAGKVVLPNIEAGLKPLHSLKSTDPPLWLAPSPSLSQVARLASQSLFSMLKPFNPKSPF 60
           MEA KVVLPN EAGLKPLHSLKSTDPPLWLAPSPSLSQVARLASQSLFSMLKPFNPKSPF
Sbjct: 1   MEAEKVVLPNTEAGLKPLHSLKSTDPPLWLAPSPSLSQVARLASQSLFSMLKPFNPKSPF 60

Query: 61  DHLLVDGFDAEQIWQQIDLQSQPLLASVRRDLKRFEKNPEAISNLKVSLEDKKKVIQEMG 120
           DHLLVDGFDAEQIWQQIDLQSQPLLASVRRDLKRFEKNPEAISNLKVSLEDKKKVIQEM 
Sbjct: 61  DHLLVDGFDAEQIWQQIDLQSQPLLASVRRDLKRFEKNPEAISNLKVSLEDKKKVIQEMA 120

Query: 121 VESGEESDDFEEDMKELDEEEEEDDEEDEEEEEEDCDDREDGDTEEGEKEKSDDEVEGEE 180
           +ESGEESDDFEEDMKELDEEE   D+E+EEEEEEDCDD+EDGDTEEGEKEKSDDEVEGEE
Sbjct: 121 IESGEESDDFEEDMKELDEEE---DDEEEEEEEEDCDDKEDGDTEEGEKEKSDDEVEGEE 180

Query: 181 GNGGIEDGFLKLKELEEFMEEDEVREYGLQKKKDGKKEKKPRKTEEESDDDEDDELEEFD 240
           GNGGIEDGFLKLKELEEFMEEDEVREYGLQ KKDGKKEKK RKTEEES+DDEDDEL EFD
Sbjct: 181 GNGGIEDGFLKLKELEEFMEEDEVREYGLQNKKDGKKEKKQRKTEEESEDDEDDELGEFD 240

Query: 241 LHGEEDEDSSKLDNARYEDFFGAKKKNHVRR-NKLTNGSESELSDSGDEEEENEAYTEPK 300
           LHGEEDEDSSKLD ARYEDFFGAKKKNH+RR +KLTNGSESELSDSGDEEEENE  TEPK
Sbjct: 241 LHGEEDEDSSKLDKARYEDFFGAKKKNHLRRKSKLTNGSESELSDSGDEEEENETRTEPK 300

Query: 301 SENLSTHQKKLKKLQSEIEMMEKANLEPKTWTMQGEVTAAKRPKNSALEVDLDFEHNVRP 360
           SENLSTHQK+LKKLQSEIEMMEKANLEPKTWTMQGEVTA KRPKNSALEVDLDFEHNVRP
Sbjct: 301 SENLSTHQKRLKKLQSEIEMMEKANLEPKTWTMQGEVTAMKRPKNSALEVDLDFEHNVRP 360

Query: 361 PPVITEEVTATLEEMIQKRILEGRFDEVQKAPKRPTKAPREIKELDENKSKKGLGELYEE 420
           PPVITEEVTATLEEMIQKRILEGRFDEVQKAPKRPTKAPREIKELDENKSKKGLGELYEE
Sbjct: 361 PPVITEEVTATLEEMIQKRILEGRFDEVQKAPKRPTKAPREIKELDENKSKKGLGELYEE 420

Query: 421 EYVEKTNLATAPPSFTDEAKTEASILFKKLCSKLDALSHYHYAPKPVIEDMSISTNVPAL 480
           EYVEKTNLATAP SFTDEAKTEASILFKKLCSKLDALSHYHYAPKPVIEDMSISTNVPAL
Sbjct: 421 EYVEKTNLATAPSSFTDEAKTEASILFKKLCSKLDALSHYHYAPKPVIEDMSISTNVPAL 480

Query: 481 AMEEVAPVAVSDAAMLAPEEVFAGKGEIKEAAELTQSDRKRRRASKKRKYKAMVAKRDAK 540
           AMEEVAPVAVSDAAMLAPEEVFAGKGEIKEAAELTQSDRKRRRASKKRKYKAMVAKRDAK
Sbjct: 481 AMEEVAPVAVSDAAMLAPEEVFAGKGEIKEAAELTQSDRKRRRASKKRKYKAMVAKRDAK 540

Query: 541 KSGNTTAPNANEGQ 554
           KSGNTT PNANEGQ
Sbjct: 541 KSGNTTVPNANEGQ 551

BLAST of Csa1G002100 vs. NCBI nr
Match: gi|661879347|emb|CDP16910.1| (unnamed protein product [Coffea canephora])

HSP 1 Score: 654.1 bits (1686), Expect = 2.2e-184
Identity = 339/563 (60.21%), Postives = 435/563 (77.26%), Query Frame = 1

Query: 9   PNIEAGLKPLHSLKSTDPPLWLAPSPSLSQVARLASQSLFSMLKPFNPKSPFDHLLVDGF 68
           P   AG + L  LKSTDPPL+L PS  LS+ ARLAS+ LFS LKP+ PKSPF HLLV+GF
Sbjct: 12  PETAAGREALRKLKSTDPPLYLCPSEDLSKAARLASEYLFSSLKPYTPKSPFSHLLVNGF 71

Query: 69  DAEQIWQQIDLQSQPLLASVRRDLKRFEKNPEAISNL-KVSLEDKKKVIQEMG-VESGEE 128
           DAEQIWQQIDLQ+QPL++S+RR +K+FEKNPE I NL  +  +DKK    ++G + +G+E
Sbjct: 72  DAEQIWQQIDLQTQPLISSLRRQVKKFEKNPEEIRNLFNLGEKDKKNDGNDVGTISNGDE 131

Query: 129 SDDFEEDMKELDEEEEEDDEEDEEEEEEDCDDREDGDTEEGEKEKSDDEVEGEEGNGGIE 188
             + EE   E ++EE E D++D+EEEEE+ ++ E+G+  EG++E        + G GG+E
Sbjct: 132 KMENEEGFDEFEDEEMEGDDDDDEEEEEEEEEEEEGNAHEGKEE--------DGGGGGVE 191

Query: 189 DGFLKLKELEEFMEEDEVREYGLQK-KKDGKKEKKPRKT----EEESDDDEDDELEEFDL 248
           D FLK+KELEE++EEDE REYGL+K KK GK++++        +E+ DDDED+E E+ +L
Sbjct: 192 DKFLKIKELEEYLEEDEAREYGLKKEKKQGKRDQEDDNDKEGDDEDEDDDEDEEEEDDEL 251

Query: 249 H------GEEDEDSSKLDNARYEDFFGAK-KKNHVRRNKLTNGSES-----ELSDSGDEE 308
                  GE+DE   +L+NARYEDFFG K K    R++KL +GS++     ELS+   ++
Sbjct: 252 GVMGVDVGEDDESGEELENARYEDFFGGKGKMGQKRKSKLLHGSDNLDMDGELSEESSDD 311

Query: 309 EENEAYTEPKSENLSTHQKKLKKLQSEIEMMEKANLEPKTWTMQGEVTAAKRPKNSALEV 368
           E+N++    K + LSTH+K+L+KL+S IE MEKANLEPKTWTMQGEVTAAKRPKNSALEV
Sbjct: 312 EKNQSQ---KKQKLSTHEKELEKLRSTIEQMEKANLEPKTWTMQGEVTAAKRPKNSALEV 371

Query: 369 DLDFEHNVRPPPVITEEVTATLEEMIQKRILEGRFDEVQKAPKRPTKAPREIKELDENKS 428
           DLDFEHNVRP PVITEEVTA+LEE+IQKRILEGRFD+VQK P  P++APRE+KELDENKS
Sbjct: 372 DLDFEHNVRPAPVITEEVTASLEELIQKRILEGRFDDVQKPPTLPSRAPREVKELDENKS 431

Query: 429 KKGLGELYEEEYVEKTNLATAPPSFTDEAKTEASILFKKLCSKLDALSHYHYAPKPVIED 488
           KKGL E+YEEEYV+K+ L +   S +D+ K EA +LFK+LC KLDALSH+H+ PKPVIED
Sbjct: 432 KKGLAEIYEEEYVQKSGLVSTAVSISDQQKKEAGMLFKELCLKLDALSHFHFTPKPVIED 491

Query: 489 MSISTNVPALAMEEVAPVAVSDAAMLAPEEVFAGKGEIKEAAELTQSDRKRRRASKKRKY 548
           +SI  NVPALAMEE+AP+AVSDAAMLAPEEVF GKG+IKE +ELTQ++RKRRRA KKRK+
Sbjct: 492 LSIQANVPALAMEEIAPLAVSDAAMLAPEEVFTGKGDIKEDSELTQAERKRRRAKKKRKF 551

Query: 549 KAMVAKRDAKKSGNTTAPNANEG 553
           KA+ AKR A K+ +++  N  +G
Sbjct: 552 KAVAAKRMANKAQHSSLQNRVDG 563

BLAST of Csa1G002100 vs. NCBI nr
Match: gi|703152292|ref|XP_010110362.1| (hypothetical protein L484_002912 [Morus notabilis])

HSP 1 Score: 651.0 bits (1678), Expect = 1.9e-183
Identity = 351/570 (61.58%), Postives = 434/570 (76.14%), Query Frame = 1

Query: 12  EAGLKPLHSLKSTDPPLWLAPSPSLSQVARLASQSLFSMLKPFNPKSPFDHLLVDGFDAE 71
           +AG + LH LK+TDPP WLAPS +LSQ AR ASQ LFS L+P+ PK+PF+ LLV+GFDAE
Sbjct: 6   DAGSEALHRLKTTDPPQWLAPSAALSQTARAASQHLFSSLRPYAPKTPFEQLLVEGFDAE 65

Query: 72  QIWQQIDLQSQPLLASVRRDLKRFEKNPEAISNLKVSLEDKKKVIQEMGVESGEE----- 131
           QIWQQ+DLQSQPLL+S+RR+LKRFEK+PE I  L+V++E K++   E G+E   E     
Sbjct: 66  QIWQQLDLQSQPLLSSIRRELKRFEKDPEEIRKLRVAMEGKER---EKGLEDEVEKMEVE 125

Query: 132 SDDFEEDMKELDEEEEEDDE-EDEEEEEEDCDDREDGDTEEGEKEKSDDEVEGEEGNGGI 191
           SDDF ED+++L+EEEE+  + E +E+E+ED D+ ED   EEGE        EG EG GGI
Sbjct: 126 SDDFGEDLEDLEEEEEQQQQKEGKEDEDEDEDEDED---EEGE--------EGNEG-GGI 185

Query: 192 EDGFLKLKELEEFMEEDEVREYGLQKKKDGKKEK----------KPRKTEEESDDDEDD- 251
           ED FLK+K+LE+F++EDE REYG  KKKD KK+K          +  + EEE+ DDED+ 
Sbjct: 186 EDRFLKMKDLEKFLKEDEEREYG-SKKKDKKKKKVDDGDQEENTEKEEEEEEAGDDEDEN 245

Query: 252 -----------ELEEFDLHGEEDEDSSKLDNARYEDFFGAKKKN-HVRRNKLTNGSE-SE 311
                      E+ EF L  +EDED+ +L NARYEDFFG KKK    +++KLT   E S 
Sbjct: 246 GDGEDEDEDSEEMGEFGLDDDEDEDTDELGNARYEDFFGGKKKKPSKKKSKLTGDLEGSG 305

Query: 312 LSDSGDEEEENEAYTEPKSENLSTHQKKLKKLQSEIEMMEKANLEPKTWTMQGEVTAAKR 371
           + D G+++E        K + LSTH+K+L K + +IE MEK+NLEPKTWTMQGEVTAAKR
Sbjct: 306 MDDDGEDDE--------KQQTLSTHEKELAKRKLKIEQMEKSNLEPKTWTMQGEVTAAKR 365

Query: 372 PKNSALEVDLDFEHNVRPPPVITEEVTATLEEMIQKRILEGRFDEVQKAPKRPTKAPREI 431
           PKNSALEVDLDFEHN+RPPPVITEEVTA+LE++I+KRILEG FD+V+K P  P+KAPRE+
Sbjct: 366 PKNSALEVDLDFEHNMRPPPVITEEVTASLEDIIKKRILEGHFDDVEKPPALPSKAPREV 425

Query: 432 KELDENKSKKGLGELYEEEYVEKTNLATAPPSFTDEAKTEASILFKKLCSKLDALSHYHY 491
           KELDENKSKKGL E+YEEEY +KTNLA+ P SF +E K EAS+LFKKLC KLDALSH+H+
Sbjct: 426 KELDENKSKKGLAEIYEEEYAQKTNLASMPLSFAEEEKKEASMLFKKLCLKLDALSHFHF 485

Query: 492 APKPVIEDMSISTNVPALAMEEVAPVAVSDAAMLAPEEVFAGKGEIKEAAELTQSDRKRR 551
            PKPVIEDMSI  NVPALAMEE+AP+AVSDAAMLAPEEVFAGKG+IKE +ELTQ++RKRR
Sbjct: 486 TPKPVIEDMSIQANVPALAMEEIAPLAVSDAAMLAPEEVFAGKGDIKEESELTQAERKRR 545

BLAST of Csa1G002100 vs. NCBI nr
Match: gi|567894710|ref|XP_006439843.1| (hypothetical protein CICLE_v10019513mg [Citrus clementina])

HSP 1 Score: 649.4 bits (1674), Expect = 5.5e-183
Identity = 338/560 (60.36%), Postives = 435/560 (77.68%), Query Frame = 1

Query: 18  LHSLKSTDPPLWLAPSPSLSQVARLASQSLFSMLKPFNPKSPFDHLLVDGFDAEQIWQQI 77
           LH LKST+PP+WLAP   LS+ AR AS+ +FS L+P+ PKSP D LL++GFDAEQIWQQI
Sbjct: 14  LHRLKSTEPPVWLAPRAELSETARKASKIIFSYLRPYAPKSPLDQLLIEGFDAEQIWQQI 73

Query: 78  DLQSQPLLASVRRDLKRFEKNPEAISNLKVSLEDKKKVIQEMG-VESGE------ESDDF 137
           DLQSQPLL+S++R++K+FEK PE I  ++  LE +KKV++ +G V  GE      ESDDF
Sbjct: 74  DLQSQPLLSSLKREVKKFEKKPEEIGKIREVLEGEKKVVESVGKVLEGERRVLAVESDDF 133

Query: 138 EEDMKE-LDEEEEEDDEEDEEEEEEDCDDREDGDTEEGEKEKSDDEVEGEEG-NGGIEDG 197
           +ED+ + LD+++++DD++DEEEEEE     E+ +  EGE+EK     +G+ G  GGIED 
Sbjct: 134 DEDLDDGLDDDDDDDDDDDEEEEEE-----EEVEGSEGEEEK-----KGKGGPEGGIEDE 193

Query: 198 FLKLKELEEFMEEDEVREYGLQKKKDGKKEKKPRKT--------EEESDDDEDDELEEFD 257
           FLK+ EL+E++EEDE REYGL+K  +  ++K  R+         E+E +D+ED++ EE  
Sbjct: 194 FLKINELQEYLEEDEAREYGLKKDSNDSRKKGGRRVLDNEEDEDEDEDEDEEDEDEEELG 253

Query: 258 LHG------EEDEDSSKLDNARYEDFFGAKKKNHVRRNKLTNGSESELSDSGDEEEENEA 317
           + G      EEDE   KL+NA YEDFFG+K+K   ++N  +     E S   DEE+E+EA
Sbjct: 254 VFGDSDDNEEEDEHRQKLENAGYEDFFGSKRKKAPKKNLKSTEELEEDSGLDDEEDEDEA 313

Query: 318 YTEPKSENLSTHQKKLKKLQSEIEMMEKANLEPKTWTMQGEVTAAKRPKNSALEVDLDFE 377
               K++NLSTH+K+ ++L++EIE MEKANL+PKTWTMQGEVTAA+RPKNSALEVDLDF+
Sbjct: 314 VETKKNDNLSTHEKQSEQLRAEIEKMEKANLDPKTWTMQGEVTAAQRPKNSALEVDLDFQ 373

Query: 378 HNVRPPPVITEEVTATLEEMIQKRILEGRFDEVQKAPKRPTKAPREIKELDENKSKKGLG 437
           HNVRP PVITEE TA+LEEMI+KRI+EG+FD+V+KA   P+KAPRE+KELDENKSKKGL 
Sbjct: 374 HNVRPAPVITEEFTASLEEMIKKRIIEGQFDDVEKAASLPSKAPRELKELDENKSKKGLA 433

Query: 438 ELYEEEYVEKTNLATAPPSFTDEAKTEASILFKKLCSKLDALSHYHYAPKPVIEDMSIST 497
           E+YEEEYV+KTN A AP +F+DE K EAS+LFKKLC KLDALSH+H+ PKPVIEDMSI  
Sbjct: 434 EVYEEEYVQKTNPAAAPLTFSDEQKKEASMLFKKLCLKLDALSHFHFTPKPVIEDMSIQA 493

Query: 498 NVPALAMEEVAPVAVSDAAMLAPEEVFAGKGEIKEAAELTQSDRKRRRASKKRKYKAMVA 554
           NVPALAMEE+APVAVSDAAMLAPEEVFAG+G++KE AELT+++RKRRRASKKRK+KA   
Sbjct: 494 NVPALAMEEIAPVAVSDAAMLAPEEVFAGRGDVKEEAELTKAERKRRRASKKRKFKAEAT 553

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
RTL1_MOUSE5.8e-2332.00Retrotransposon-like protein 1 OS=Mus musculus GN=Rtl1 PE=2 SV=1[more]
YCF2_OENAR1.5e-1526.76Protein Ycf2 OS=Oenothera argillicola GN=ycf2-A PE=3 SV=1[more]
ABCF4_DICDI7.6e-1525.45ABC transporter F family member 4 OS=Dictyostelium discoideum GN=abcF4 PE=3 SV=1[more]
AN32B_MOUSE6.4e-1436.80Acidic leucine-rich nuclear phosphoprotein 32 family member B OS=Mus musculus GN... [more]
AN32B_RAT6.4e-1436.80Acidic leucine-rich nuclear phosphoprotein 32 family member B OS=Rattus norvegic... [more]
Match NameE-valueIdentityDescription
W9S508_9ROSA1.3e-18361.58U3 small nucleolar ribonucleoprotein protein MPP10 OS=Morus notabilis GN=L484_00... [more]
V4SYS7_9ROSI3.9e-18360.36U3 small nucleolar ribonucleoprotein protein MPP10 OS=Citrus clementina GN=CICLE... [more]
A0A061FTN6_THECC6.8e-18059.82U3 small nucleolar ribonucleoprotein protein MPP10 OS=Theobroma cacao GN=TCM_012... [more]
A0A061FU83_THECC8.9e-18059.93U3 small nucleolar ribonucleoprotein protein MPP10 OS=Theobroma cacao GN=TCM_012... [more]
A0A061FUL2_THECC8.9e-18059.93U3 small nucleolar ribonucleoprotein protein MPP10 OS=Theobroma cacao GN=TCM_012... [more]
Match NameE-valueIdentityDescription
AT5G66540.17.2e-16556.94 FUNCTIONS IN: molecular_function unknown[more]
AT1G56660.11.2e-1324.18 unknown protein[more]
AT1G20920.12.6e-1323.55 P-loop containing nucleoside triphosphate hydrolases superfamily pro... [more]
AT2G22795.13.4e-1322.56 unknown protein[more]
AT3G14670.19.9e-1329.05 unknown protein[more]
Match NameE-valueIdentityDescription
gi|778655345|ref|XP_011651882.1|0.0e+00100.00PREDICTED: U3 small nucleolar ribonucleoprotein protein MPP10 [Cucumis sativus][more]
gi|659128932|ref|XP_008464444.1|1.1e-30095.67PREDICTED: U3 small nucleolar ribonucleoprotein protein MPP10 [Cucumis melo][more]
gi|661879347|emb|CDP16910.1|2.2e-18460.21unnamed protein product [Coffea canephora][more]
gi|703152292|ref|XP_010110362.1|1.9e-18361.58hypothetical protein L484_002912 [Morus notabilis][more]
gi|567894710|ref|XP_006439843.1|5.5e-18360.36hypothetical protein CICLE_v10019513mg [Citrus clementina][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR012173Mpp10
Vocabulary: Cellular Component
TermDefinition
GO:0005634nucleus
GO:0005732small nucleolar ribonucleoprotein complex
GO:0034457Mpp10 complex
Vocabulary: Biological Process
TermDefinition
GO:0006364rRNA processing
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006606 protein import into nucleus
biological_process GO:0006364 rRNA processing
biological_process GO:0030490 maturation of SSU-rRNA
cellular_component GO:0005829 cytosol
cellular_component GO:0034457 Mpp10 complex
cellular_component GO:0005732 small nucleolar ribonucleoprotein complex
cellular_component GO:0019013 viral nucleocapsid
cellular_component GO:0005634 nucleus
cellular_component GO:0032040 small-subunit processome
molecular_function GO:0003674 molecular_function
This gene is associated with the following unigenes:
Unigene NameAnalysis NameSequence type in Unigene
CU093076cucumber EST collection version 3.0transcribed_cluster
CU133847cucumber EST collection version 3.0transcribed_cluster
CU137539cucumber EST collection version 3.0transcribed_cluster
CU161862cucumber EST collection version 3.0transcribed_cluster

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Csa1G002100.1Csa1G002100.1mRNA


The following transcribed_cluster feature(s) are associated with this gene:

Feature NameUnique NameType
CU093076CU093076transcribed_cluster
CU137539CU137539transcribed_cluster
CU133847CU133847transcribed_cluster
CU161862CU161862transcribed_cluster


Analysis Name: InterPro Annotations of cucumber (Chinese Long)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR012173U3 small nucleolar ribonucleoprotein complex, subunit Mpp10PIRPIRSF017300snoRNP_Mpp10coord: 28..551
score: 7.9E
IPR012173U3 small nucleolar ribonucleoprotein complex, subunit Mpp10PANTHERPTHR17039U3 SMALL NUCLEOLAR RIBONUCLEOPROTEIN PROTEIN MPP10coord: 10..540
score: 1.4E
IPR012173U3 small nucleolar ribonucleoprotein complex, subunit Mpp10PFAMPF04006Mpp10coord: 56..547
score: 2.3E
NoneNo IPR availableunknownCoilCoilcoord: 303..323
score: -coord: 127..155
scor

The following gene(s) are paralogous to this gene:

None