- The Google File System
- MapReduce: Simplified Data Processing on Large Clusters
The open source Hadoop project implements these functions so you can build a Google like cluster in your data center. In case you do not have 100s of servers at your disposal ask Amazon for help. The Amazon EC2 and S3 utility computing services can host your cluster at a reasonable price. Check out this tutorial by Tom White who illustrates how to use Hadoop and Amazon Web Services together using a large collection of web access logs:
Yahoo has recently announced their support for Hadoop.
"Looking ahead and thinking about how the economics of large scale computing continue to improve, it's not hard to imagine a time when Hadoop and Hadoop-powered infrastructure is as common as the LAMP (Linux, Apache, MySQL, Perl/PHP/Python) stack that helped to powered the previous growth of the Web."
2 comments:
Interesting thoughts. How about Open Sourcing Amazon EC2 and S3 as well?
That would be a game changer!
See my blog: http://smoothspan.wordpress.com/2007/08/16/how-does-virtualization-impact-hosting-providers-a-secret-blueprint-for-web-hosting-world-domination/
Thanks for your comment Bob. Your blog is very interesting!
Post a Comment