Sunday, October 10, 2010

Time series, disk swapping, shazam patents and other stories

Here's a collection of useful Cassandra and HBase articles I've come across in the past few months:

Time series storage in the big 2 NoSQL systems - Cassandra and Hbase:
  - http://outerthought.org/lily/417-ot.html
  - http://markmail.org/search/?q=cassandra+counter+increment
  - https://www.cloudkick.com/blog/2010/mar/02/4_months_with_cassandra/
  - http://github.com/zohmg/zohmg

Apache Hive vs Pig:
  - http://www.jroller.com/otis/entry/hadoop_land_hive_vs_pig
  - http://www.larsgeorge.com/2009/10/hive-vs-pig.html

Cassandra GC and swapping:
  - http://blog.mikiobraun.de/2010/08/cassandra-gc-tuning.html
  - http://www.riptano.com/blog/whats-new-cassandra-065

Geohashing sounded like an ingenious concept. Here's something built on Cassandra:

  - http://www.slideshare.net/mmalone/scaling-gis-data-in-nonrelational-data-stores

Well, yeah that's a lot of NoSQL articles to read. Here's a hilarious video against NoSQL to balance it. (Warning: Watch for foul language. For a more civilized roast, see this)
  - http://www.royans.net/arch/mongodb-is-webscale-humor/

Patent trouble . Here's the story of a smart guy who wrote a music recognizer over a weekend and got into some trouble with Patent lawyers.

After a long time, I found a nice JUnit presentation that made me reconsider my decision to switch to TestNG from JUnit.

Some Linux fun - swapping OS pages and opening 500K sockets (Also see above for what Cassandara did  to prevent swapping):
  - http://blog.urbanairship.com/blog/2010/09/29/linux-kernel-tuning-for-c500k/
  - http://jcole.us/blog/archives/2010/09/28/mysql-swap-insanity-and-the-numa-architecture/

Until next time!