Dustin Sallings's blog

April 16, 2012

Incremental Mapreduce for Analytics with R

[This post also appears on Dustin's github blog.]

I've been wanting to describe some of my work with using R to help me understand data I'm collecting in Couchbase Server because I find it quite interesting, useful and easy. However, it's been difficult for me to figure out a good starting point because I don't know who the audience would be. That is, finding the right set of assumptions to get going has been quite hard.

Read more »

June 7, 2011

New Operations in Membase

[This post also appears on Dustin's github blog.]

We built a couple of new protocol operations for people building applications. The general goal of adding an operation is to keep it orthogonal to other commands while enhancing the functionality in a way that lets you do things that couldn’t be done before, or at least were common and difficult to do efficiently.

Here is a description of the new commands and an idea of how they might be used.

Read more »

October 28, 2010

How to Test Everything

[This post also appears on Dustin's github blog].

I recently had a Membase user point out a sequence of operations that led to an undesirable state. I’ve got a lot of really good engine tests I’ve written, but not this case:

Read more »

August 7, 2010

Memcached security

Memcached security is a hot topic since the sensepost guys released go-derper at blackhat.

The presentation was pretty good and informative, but it seems like the hype around it has left a bunch of people confused. Although much of this was covered in the presentation, it needs to be restated as much as possible.

Read more »

June 28, 2010

Scaling memcached with vbuckets

For years, people have used memcached to scale large sites. Originally, there was a simple modulo selection hash algorithm that was used. It still is used quite a bit actually and it’s quite easy to understand (although, it’s shown regularly that some people don’t truly understand it when applied to their full system). The algorithm is basically this:

Read more »

April 9, 2010

What We're Doing in Memcached

We’ve been steadily hacking on memcached. We think it’s going very well, but we do want to make sure everybody who cares has the opportunity to see what’s going on behind the proverbial curtain. The basic theme is to build a platform that allows a company to solve its scaling problems without preventing you from solving your own. Extensibility The biggest thing we’ve been working on is getting the storage engine interface really solid. Trond has been thinking about this for two years and did an excellent presentation on an application of it at last year’s MySQL User Conference.

Read more »

March 16, 2010

How I Learned to Stop Worrying and Love Dynamically Loadable Modular Engines

Memcached Keeps You Wanting More

Memcached is a pretty simple system with pretty simple semantics. Many users have wished for just a little more functionality than provided out of the box which has led to several forks and related projects.

To accommodate what are really just minimal differences, lots of projects have spun up as either forks of memcached, or entirely new projects

Read more »