MessiandNeymar

  • Subscribe to our RSS feed.
  • Twitter
  • StumbleUpon
  • Reddit
  • Facebook
  • Digg

Friday, September 21, 2012

Another week of Big Data love

Posted on 11:19 AM by Unknown

My goodness I've been busy this week! What have I been doing? Well, at least part of it has involved reading about the latest Big Data happenings...

  • Interesting article from the eBay team about how they are using Zookeeper: Grid Computing with Fault-Tolerant Actors and ZooKeeper
    Before diving in further, this is a good place to give a shout-out to the folks who designed, developed and maintain ZooKeeper. I think it’s one of the most useful open-source contributions for distributed computing of the past decade. Most of the fault tolerance in Nebula can be reduced to simple patterns that make use of the ZooKeeper model (e.g., ephemeral, session-bound znodes), its serialized atomic updates, and the associated watch mechanism.
  • Epic essay on consistency algorithms in NoSQL databases: Distributed Algorithms in NoSQL Databases
    In the rest of this article we study a number of distributed activities like replication of failure detection that could happen in a database. These activities, highlighted in bold below, are grouped into three major sections:
    • Data Consistency. Historically, NoSQL paid a lot of attention to tradeoffs between consistency, fault-tolerance and performance to serve geographically distributed systems, low-latency or highly available applications. Fundamentally, these tradeoffs spin around data consistency, so this section is devoted data replication and data repair.
    • Data Placement. A database should accommodate itself to different data distributions, cluster topologies and hardware configurations. In this section we discuss how to distribute or rebalance data in such a way that failures are handled rapidly, persistence guarantees are maintained, queries are efficient, and system resource like RAM or disk space are used evenly throughout the cluster.
    • System Coordination. Coordination techniques like leader election are used in many databases to implements fault-tolerance and strong data consistency. However, even decentralized databases typically track their global state, detect failures and topology changes. This section describes several important techniques that are used to keep the system in a coherent state.
  • Discussion about the GitHub outage, and specifically about the issues surrounding declaring a node dead and deciding when to failover:
    • GitHub availability this week
    • Is automated failover the root of all evil?
    • Shoot The Automated Failure In The Head
    • The Math of Automated Failover
    • Handling Database Failover at Craigslist
  • Big Data? Cloud? Distributed Systems? Next month's OSDI conference is there! OSDI '12 Program
  • A tiny bit dated, but this reading list from a graduate course in Cloud Computing taught by Professor Stoica of Berkeley last fall has a wonderful set of core references and reading material: Cloud computing: Systems, Networking, and Frameworks
    In this course, we describe the critical technology trends that are enabling cloud computing, the architecture and the design of existing deployments, the services and the applications they offer, and the challenges that needs to be addressed to help cloud computing to reach its full potential.
  • Peter Bailis published a nice essay about the detailed tradeoffs between latency and efficiency in replicated data stores (essentially, when is it worth wasting work for faster response time): Doing redundant work to speed up distributed queries
    In distributed systems, there’s a subtle and somewhat underappreciated strategy for reducing tail latencies: doing redundant work. If you send the same request to multiple servers, (all else equal) you’re going to get an answer back faster than waiting for a single server. Waiting for, say, one of three servers to reply is often faster than waiting for one of one to reply.
  • Here's a nice Kirk McKusick article about the current state-of-the-art in filesystems as they evolve to adapt to the changing disk sector size (the underlying hardware is moving from 512 byte sectors to 4096 byte sectors): Disks from the Perspective of a File System
    File systems need to be aware of the change to the underlying media and ensure that they adapt by always writing in multiples of the larger sector size. Historically, file systems were organized to store files smaller than 512 bytes in a single sector. With the change in disk technology, most file systems have avoided the slowdown of 512-byte writes by making 4,096 bytes the smallest allocation size.
  • And this week's coffee-house debate over whether MongoDB is the greatest or the worst technology ever to be visited upon the earth:
    • Why I Migrated Away From MongoDB
    • And the Hacker News discussion that ensued
Email ThisBlogThis!Share to XShare to FacebookShare to Pinterest
Posted in | No comments
Newer Post Older Post Home

0 comments:

Post a Comment

Subscribe to: Post Comments (Atom)

Popular Posts

  • Shelter
    I meant to post this as part of my article on Watership Down , but then totally forgot: Shelter In Shelter you experience the wild as a moth...
  • The Legend of 1900: a very short review
    Fifteen years late, we stumbled across The Legend of 1900 . I suspect that 1900 is the sort of movie that many people despise, and a few peo...
  • Rediscovering Watership Down
    As a child, I was a precocious and voracious reader. In my early teens, ravenous and impatient, I raced through Richard Adams's Watershi...
  • Must be a heck of a rainstorm in Donetsk
    During today's Euro 2012 match between Ukraine and France, the game was suspended due to weather conditions, which is a quite rare occur...
  • Beethoven and Jonathan Biss
    I'm really enjoying the latest Coursera class that I'm taking: Exploring Beethoven’s Piano Sonatas . This course takes an inside-out...
  • Starting today, the games count
    In honor of the occasion: The Autumn Wind is a pirate, Blustering in from sea, With a rollocking song, he sweeps along, Swaggering boisterou...
  • Parbuckling
    The enormous project to right and remove the remains of the Costa Concordia is now well underway. There's some nice reporting on the NP...
  • For your weekend reading
    I don't want you to be bored this weekend, so I thought I'd pass along some articles you might find interesting. If not, hopefully y...
  • Are some algorithms simply too hard to implement correctly?
    I recently got around to reading a rather old paper: McKusick and Ganger: Soft Updates: A Technique for Eliminating Most Synchronous Writes ...
  • Don't see me!
    When she was young, and she had done something she was embarrassed by or felt guilty about, my daughter would sometimes hold up her hand to ...

Blog Archive

  • ►  2013 (165)
    • ►  September (14)
    • ►  August (19)
    • ►  July (16)
    • ►  June (17)
    • ►  May (17)
    • ►  April (18)
    • ►  March (24)
    • ►  February (19)
    • ►  January (21)
  • ▼  2012 (335)
    • ►  December (23)
    • ►  November (30)
    • ►  October (33)
    • ▼  September (34)
      • Uncharacteristic performance from Nakamura
      • I can now vote to close...
      • What I'm reading this week...
      • How to pass a football
      • Driver-less cars
      • Ultra-high-speed photography of rock skipping
      • Learnable Programming
      • Online networking class
      • Yosemite Hantavirus investigations continue
      • Data center power efficiency
      • Jim Gray's mantle
      • 2012 IgNobel awards
      • London 2012 tournament underway
      • Special Delivery
      • Another week of Big Data love
      • Click and Drag
      • A crypto grab-bag
      • The unfriendliness of software
      • Today, one for each of my parents...
      • VMWorld 2012 info
      • When Josiah Whitney climbed Mount Shasta
      • Load the trebuchets!
      • Go underground!
      • A cat is not a dog
      • A post full of follow-ups
      • One year ago
      • 2012 Chess Olympiad heading for a thrilling finish!
      • The Pit River
      • Pulphead: a very short review
      • 2nd/1st RNZIR farewell
      • Kramnik-Aronian, Istanbul Olympiad round 6
      • Journey: a very short review
      • It's a long weekend, so ...
      • VF, MS, and ranking
    • ►  August (29)
    • ►  July (39)
    • ►  June (27)
    • ►  May (48)
    • ►  April (32)
    • ►  March (30)
    • ►  February (10)
Powered by Blogger.

About Me

Unknown
View my complete profile