Feb
21st
Thu
21st
Yahoo deploys large scale Hadoop cluster
Yahoo’s Eric Baldeschwieler reports that Yahoo is now running a 10k+ core Hadoop cluster that holds over 5 petabytes of data.
Very cool. It appears Yahoo has now achieved most of the Hadoop design requirements they laid out back in July 2006.
Please see also my July 2006 post, “Yahoo building Google FS clone?”, and my April 2007 post, “Yahoo Pig and Google Sawzall”.