Category: systems

MAR 18 2012

Elasticsearch Space-savers

After [setting up](/real-time-search-with-mongodb-and-elasticsearch/) ElasticSearch, you'll be faced with the task of optimizing your index configuration for speed and for size. There are millions of documents in our index, and, for performance, it’s important that all of that be kept in memory. As a result, index size is pretty important. We...
MAR 15 2012

Real-time Search with MongoDB and Elasticsearch

Something I worked on a couple of weeks ago at Stripe was overhauling the entire search infrastructure. If you’ve ever used the search feature in manage, it may have appeared sluggish or may have even timed out on you - this is mostly due to the fact that our search...
DEC 02 2011

Paxos/Multi-paxos Algorithm

A [couple of posts ago](/eventual-consistency/), I talked about the distributed systems programming class I was taking here in my last semester at Berkeley. Our projects are really cool - we've done everything from a quorum KVS to a distributed lock manager, and for our final project, my group chose to...
OCT 17 2011

Eventual Consistency

I mentioned in my previous post that I was taking a [Programming in the Cloud](http://db.cs.berkeley.edu/w/cs-194-17-programming-the-cloud/) class on distributed systems and programming this semester, and one of the perks of this class is that we occasionally have speakers from the industry come in to talk about various topics. Last week, [Doug...