<?xml version="1.0" encoding="UTF-8"?> <rss
version="2.0"
xmlns:content="http://purl.org/rss/1.0/modules/content/"
xmlns:wfw="http://wellformedweb.org/CommentAPI/"
xmlns:dc="http://purl.org/dc/elements/1.1/"
xmlns:atom="http://www.w3.org/2005/Atom"
xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
> <channel><title>beer planet &#187; hadoop</title> <atom:link href="http://beerpla.net/tag/hadoop/feed/" rel="self" type="application/rss+xml" /><link>http://beerpla.net</link> <description>where things have nothing to do with beer - tutorials, tips, how-tos, thoughts, hacks, and other techy nonsense</description> <lastBuildDate>Fri, 06 Jan 2012 08:50:59 +0000</lastBuildDate> <language>en</language> <sy:updatePeriod>hourly</sy:updatePeriod> <sy:updateFrequency>1</sy:updateFrequency> <generator>http://wordpress.org/?v=3.3.1</generator> <atom:link rel='hub' href='http://beerpla.net/?pushpress=hub'/> <item><title>Hadoop Primer &#8211; Yet Another Hadoop Introduction</title><link>http://beerpla.net/2008/10/20/hadoop-primer-yet-another-hadoop-introduction/</link> <comments>http://beerpla.net/2008/10/20/hadoop-primer-yet-another-hadoop-introduction/#comments</comments> <pubDate>Tue, 21 Oct 2008 06:48:38 +0000</pubDate> <dc:creator>Artem Russakovskii</dc:creator> <category><![CDATA[Databases]]></category> <category><![CDATA[Programming]]></category> <category><![CDATA[hadoop]]></category> <category><![CDATA[install]]></category> <category><![CDATA[MySQL]]></category> <category><![CDATA[scaling]]></category> <category><![CDATA[start]]></category> <guid
isPermaLink="false">http://beerpla.net/2008/10/20/hadoop-primer-yet-another-hadoop-introduction/</guid> <description><![CDATA[<p><a
href="http://beerpla.net/wp-content/uploads/HadoopPrimerYetAnotherHadoopIntroduction_14EC8/image.png" class="lightview" rel="gallery['469']" title="image"><img
title="image" style="display: inline; margin: 0px 10px 10px 0px" height="71" alt="image" src="http://beerpla.net/wp-content/uploads/HadoopPrimerYetAnotherHadoopIntroduction_14EC8/image_thumb.png" width="300" align="left" /></a> I just came upon a <a
href="http://wikis.sun.com/download/attachments/38208497/Hadoop-Primer.pdf">pretty good Hadoop introduction paper</a> posted on Sun’s wiki. <b><a
href="http://hadoop.apache.org/core/">Apache Hadoop</a></b> is a free Java software framework that supports data intensive distributed applications. It enables applications to work with thousands of nodes and petabytes of data. Hadoop was inspired by Google&#039;s <a
href="http://en.wikipedia.org/wiki/MapReduce">MapReduce</a> and <a
href="http://en.wikipedia.org/wiki/GoogleFS">Google File System</a> (GFS) (<a
href="http://en.wikipedia.org/wiki/Hadoop">wikipedia</a>). I wouldn’t call it an alternative to mysql &#8211; they’re in completely different weight categories. I like to think of Hadoop as a complement &#8211; I think it’s closer to memcached in its functions than to mysql. Perhaps a hybrid of both but a unique beast nonetheless. If you’re serious about scaling, you owe it to yourself to start exploring Hadoop yesterday....<div
class=clear></div> <a
href="http://beerpla.net/2008/10/20/hadoop-primer-yet-another-hadoop-introduction/" class="read_more"><div
class=excerpt-end>Read the rest of this article &#187;</div></a></p>]]></description> <content:encoded><![CDATA[<p><a
href="http://beerpla.net/wp-content/uploads/HadoopPrimerYetAnotherHadoopIntroduction_14EC8/image.png" class="lightview" rel="gallery['469']" title="image"><img
title="image" style="display: inline; margin: 0px 10px 10px 0px" height="71" alt="image" src="http://beerpla.net/wp-content/uploads/HadoopPrimerYetAnotherHadoopIntroduction_14EC8/image_thumb.png" width="300" align="left" /></a> I just came upon a <a
href="http://wikis.sun.com/download/attachments/38208497/Hadoop-Primer.pdf">pretty good Hadoop introduction paper</a> posted on Sun’s wiki. <b><a
href="http://hadoop.apache.org/core/">Apache Hadoop</a></b> is a free Java software framework that supports data intensive distributed applications. It enables applications to work with thousands of nodes and petabytes of data. Hadoop was inspired by Google&#039;s <a
href="http://en.wikipedia.org/wiki/MapReduce">MapReduce</a> and <a
href="http://en.wikipedia.org/wiki/GoogleFS">Google File System</a> (GFS) (<a
href="http://en.wikipedia.org/wiki/Hadoop">wikipedia</a>). I wouldn’t call it an alternative to mysql &#8211; they’re in completely different weight categories. I like to think of Hadoop as a complement &#8211; I think it’s closer to memcached in its functions than to mysql. Perhaps a hybrid of both but a unique beast nonetheless. If you’re serious about scaling, you owe it to yourself to start exploring Hadoop yesterday.</p><p>A couple of reasons for sharing the primer:</p><ul><li>it is short and concise</li><li>it has examples</li><li>and most importantly, it finally pushed me to install Hadoop on a 4-machine cluster and start playing around with it</li></ul><p>So, take a look at the <a
href="http://wikis.sun.com/download/attachments/38208497/Hadoop-Primer.pdf">primer PDF</a>, <a
href="http://www.apache.org/dyn/closer.cgi/hadoop/core/">download</a> Hadoop, and <a
href="http://hadoop.apache.org/core/docs/current/quickstart.html">quickstart</a> it. Here’s a more detailed <a
href="http://wiki.apache.org/hadoop/GettingStartedWithHadoop">set up</a> page.</p><p><a
href="http://wiki.apache.org/hadoop/PoweredBy">The big guys</a> are using it, why aren’t you?</p><div
class="shr-bookmarks shr-bookmarks-expand"><ul
class="socials"><li
class="shr-twitter"> <a
href="http://www.shareaholic.com/api/share/?title=Hadoop+Primer+%26ndash%3B+Yet+Another+Hadoop+Introduction&amp;link=http://beerpla.net/2008/10/20/hadoop-primer-yet-another-hadoop-introduction/&amp;notes=%20I%20just%20came%20upon%20a%20pretty%20good%20Hadoop%20introduction%20paper%20posted%20on%20Sun%E2%80%99s%20wiki.%20Apache%20Hadoop%20is%20a%20free%20Java%20software%20framework%20that%20supports%20data%20intensive%20distributed%20applications.%20It%20enables%20applications%20to%20work%20with%20thousands%20of%20nodes%20and%20petabytes%20of%20data.%20Hadoop%20was%20inspired%20by%20Google%27s%20MapR&amp;short_link=http://bit.ly/9kKAzp&amp;v=1&amp;apitype=1&amp;apikey=8afa39428933be41f8afdb8ea21a495c&amp;source=Shareaholic&amp;template=%24%7Btitle%7D+-+%24%7Bshort_link%7D&amp;service=7&amp;tags=&amp;ctype=" rel="nofollow" class="external" title="Tweet This!">Tweet This!</a></li><li
class="shr-facebook"> <a
href="http://www.shareaholic.com/api/share/?title=Hadoop+Primer+%26ndash%3B+Yet+Another+Hadoop+Introduction&amp;link=http://beerpla.net/2008/10/20/hadoop-primer-yet-another-hadoop-introduction/&amp;notes=%20I%20just%20came%20upon%20a%20pretty%20good%20Hadoop%20introduction%20paper%20posted%20on%20Sun%E2%80%99s%20wiki.%20Apache%20Hadoop%20is%20a%20free%20Java%20software%20framework%20that%20supports%20data%20intensive%20distributed%20applications.%20It%20enables%20applications%20to%20work%20with%20thousands%20of%20nodes%20and%20petabytes%20of%20data.%20Hadoop%20was%20inspired%20by%20Google%27s%20MapR&amp;short_link=http://bit.ly/9kKAzp&amp;v=1&amp;apitype=1&amp;apikey=8afa39428933be41f8afdb8ea21a495c&amp;source=Shareaholic&amp;template=&amp;service=5&amp;tags=&amp;ctype=" rel="nofollow" title="Share this on Facebook">Share this on Facebook</a></li><li
class="shr-googlebuzz"> <a
href="http://www.shareaholic.com/api/share/?title=Hadoop+Primer+%26ndash%3B+Yet+Another+Hadoop+Introduction&amp;link=http://beerpla.net/2008/10/20/hadoop-primer-yet-another-hadoop-introduction/&amp;notes=%20I%20just%20came%20upon%20a%20pretty%20good%20Hadoop%20introduction%20paper%20posted%20on%20Sun%E2%80%99s%20wiki.%20Apache%20Hadoop%20is%20a%20free%20Java%20software%20framework%20that%20supports%20data%20intensive%20distributed%20applications.%20It%20enables%20applications%20to%20work%20with%20thousands%20of%20nodes%20and%20petabytes%20of%20data.%20Hadoop%20was%20inspired%20by%20Google%27s%20MapR&amp;short_link=http://bit.ly/9kKAzp&amp;v=1&amp;apitype=1&amp;apikey=8afa39428933be41f8afdb8ea21a495c&amp;source=Shareaholic&amp;template=&amp;service=257&amp;tags=&amp;ctype=" rel="nofollow" class="external" title="Post on Google Buzz">Post on Google Buzz</a></li><li
class="shr-reddit"> <a
href="http://www.shareaholic.com/api/share/?title=Hadoop+Primer+%26ndash%3B+Yet+Another+Hadoop+Introduction&amp;link=http://beerpla.net/2008/10/20/hadoop-primer-yet-another-hadoop-introduction/&amp;notes=%20I%20just%20came%20upon%20a%20pretty%20good%20Hadoop%20introduction%20paper%20posted%20on%20Sun%E2%80%99s%20wiki.%20Apache%20Hadoop%20is%20a%20free%20Java%20software%20framework%20that%20supports%20data%20intensive%20distributed%20applications.%20It%20enables%20applications%20to%20work%20with%20thousands%20of%20nodes%20and%20petabytes%20of%20data.%20Hadoop%20was%20inspired%20by%20Google%27s%20MapR&amp;short_link=http://bit.ly/9kKAzp&amp;v=1&amp;apitype=1&amp;apikey=8afa39428933be41f8afdb8ea21a495c&amp;source=Shareaholic&amp;template=&amp;service=40&amp;tags=&amp;ctype=" rel="nofollow" class="external" title="Share this on Reddit">Share this on Reddit</a></li><li
class="shr-hackernews"> <a
href="http://www.shareaholic.com/api/share/?title=Hadoop+Primer+%26ndash%3B+Yet+Another+Hadoop+Introduction&amp;link=http://beerpla.net/2008/10/20/hadoop-primer-yet-another-hadoop-introduction/&amp;notes=%20I%20just%20came%20upon%20a%20pretty%20good%20Hadoop%20introduction%20paper%20posted%20on%20Sun%E2%80%99s%20wiki.%20Apache%20Hadoop%20is%20a%20free%20Java%20software%20framework%20that%20supports%20data%20intensive%20distributed%20applications.%20It%20enables%20applications%20to%20work%20with%20thousands%20of%20nodes%20and%20petabytes%20of%20data.%20Hadoop%20was%20inspired%20by%20Google%27s%20MapR&amp;short_link=http://bit.ly/9kKAzp&amp;v=1&amp;apitype=1&amp;apikey=8afa39428933be41f8afdb8ea21a495c&amp;source=Shareaholic&amp;template=&amp;service=202&amp;tags=&amp;ctype=" rel="nofollow" class="external" title="Submit this to Hacker News">Submit this to Hacker News</a></li><li
class="shr-delicious"> <a
href="http://www.shareaholic.com/api/share/?title=Hadoop+Primer+%26ndash%3B+Yet+Another+Hadoop+Introduction&amp;link=http://beerpla.net/2008/10/20/hadoop-primer-yet-another-hadoop-introduction/&amp;notes=%20I%20just%20came%20upon%20a%20pretty%20good%20Hadoop%20introduction%20paper%20posted%20on%20Sun%E2%80%99s%20wiki.%20Apache%20Hadoop%20is%20a%20free%20Java%20software%20framework%20that%20supports%20data%20intensive%20distributed%20applications.%20It%20enables%20applications%20to%20work%20with%20thousands%20of%20nodes%20and%20petabytes%20of%20data.%20Hadoop%20was%20inspired%20by%20Google%27s%20MapR&amp;short_link=http://bit.ly/9kKAzp&amp;v=1&amp;apitype=1&amp;apikey=8afa39428933be41f8afdb8ea21a495c&amp;source=Shareaholic&amp;template=&amp;service=2&amp;tags=&amp;ctype=" rel="nofollow" class="external" title="Share this on del.icio.us">Share this on del.icio.us</a></li><li
class="shr-stumbleupon"> <a
href="http://www.shareaholic.com/api/share/?title=Hadoop+Primer+%26ndash%3B+Yet+Another+Hadoop+Introduction&amp;link=http://beerpla.net/2008/10/20/hadoop-primer-yet-another-hadoop-introduction/&amp;notes=%20I%20just%20came%20upon%20a%20pretty%20good%20Hadoop%20introduction%20paper%20posted%20on%20Sun%E2%80%99s%20wiki.%20Apache%20Hadoop%20is%20a%20free%20Java%20software%20framework%20that%20supports%20data%20intensive%20distributed%20applications.%20It%20enables%20applications%20to%20work%20with%20thousands%20of%20nodes%20and%20petabytes%20of%20data.%20Hadoop%20was%20inspired%20by%20Google%27s%20MapR&amp;short_link=http://bit.ly/9kKAzp&amp;v=1&amp;apitype=1&amp;apikey=8afa39428933be41f8afdb8ea21a495c&amp;source=Shareaholic&amp;template=&amp;service=38&amp;tags=&amp;ctype=" rel="nofollow" class="external" title="Stumble upon something good? Share it on StumbleUpon">Stumble upon something good? Share it on StumbleUpon</a></li><li
class="shr-mail"> <a
href="http://www.shareaholic.com/api/share/?title=Hadoop%20Primer%20%26ndash%3B%20Yet%20Another%20Hadoop%20Introduction&amp;link=http://beerpla.net/2008/10/20/hadoop-primer-yet-another-hadoop-introduction/&amp;notes=%20I%20just%20came%20upon%20a%20pretty%20good%20Hadoop%20introduction%20paper%20posted%20on%20Sun%E2%80%99s%20wiki.%20Apache%20Hadoop%20is%20a%20free%20Java%20software%20framework%20that%20supports%20data%20intensive%20distributed%20applications.%20It%20enables%20applications%20to%20work%20with%20thousands%20of%20nodes%20and%20petabytes%20of%20data.%20Hadoop%20was%20inspired%20by%20Google%27s%20MapR&amp;short_link=http://bit.ly/9kKAzp&amp;v=1&amp;apitype=1&amp;apikey=8afa39428933be41f8afdb8ea21a495c&amp;source=Shareaholic&amp;template=&amp;service=201&amp;tags=&amp;ctype=" rel="nofollow" class="external" title="Email this to a friend?">Email this to a friend?</a></li></ul><div
style="clear: both;"></div></div> Similar Posts:<ul><li><a
href="http://beerpla.net/2008/04/15/mysql-conference-liveblogging-the-future-of-mysql-tuesday-1155am-2/" rel="bookmark" title="April 15, 2008">MySQL Conference Liveblogging: The Future Of MySQL (Tuesday 11:55AM)</a></li><li><a
href="http://beerpla.net/2010/01/11/web-dev-browser-breakdown-statscharts-from-plaxo-com-for-december-2009-and-thoughts/" rel="bookmark" title="January 11, 2010">[Web Dev] Browser Breakdown Stats+Charts From Plaxo.com For December 2009 And Thoughts</a></li><li><a
href="http://beerpla.net/2009/09/03/comparison-between-solr-and-sphinx-search-servers-solr-vs-sphinx-fight/" rel="bookmark" title="September 3, 2009">Comparison Between Solr And Sphinx Search Servers (Solr Vs Sphinx &#8211; Fight!)</a></li><li><a
href="http://beerpla.net/2008/09/05/mysql-slave-lag-delay-explained-and-7-ways-to-battle-it/" rel="bookmark" title="September 5, 2008">MySQL Slave Lag (Delay) Explained And 7 Ways To Battle It</a></li><li><a
href="http://beerpla.net/2009/05/11/mysql-deletingupdating-rows-common-to-2-tables-speed-and-slave-lag-considerations/" rel="bookmark" title="May 11, 2009">[MySQL] Deleting/Updating Rows Common To 2 Tables &#8211; Speed And Slave Lag Considerations</a></li></ul><p><a
class="a2a_dd a2a_target addtoany_share_save" href="http://www.addtoany.com/share_save#url=http%3A%2F%2Fbeerpla.net%2F2008%2F10%2F20%2Fhadoop-primer-yet-another-hadoop-introduction%2F&amp;title=Hadoop%20Primer%20%26ndash%3B%20Yet%20Another%20Hadoop%20Introduction" id="wpa2a_2"><img
src="http://beerpla.net/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share"/></a></p>]]></content:encoded> <wfw:commentRss>http://beerpla.net/2008/10/20/hadoop-primer-yet-another-hadoop-introduction/feed/</wfw:commentRss> <slash:comments>1</slash:comments> </item> </channel> </rss>
