{"id":12412,"date":"2014-04-07T11:13:34","date_gmt":"2014-04-07T10:13:34","guid":{"rendered":"http:\/\/timoelliott.com\/blog\/?p=6574"},"modified":"2014-04-07T11:13:34","modified_gmt":"2014-04-07T10:13:34","slug":"no-hadoop-isnt-going-to-replace-your-data-warehouse","status":"publish","type":"post","link":"https:\/\/timoelliott.com\/blog\/2014\/04\/no-hadoop-isnt-going-to-replace-your-data-warehouse.html","title":{"rendered":"No, Hadoop Isn&#8217;t Going To Replace Your Data Warehouse"},"content":{"rendered":"<p>The data says that Hadoop isn&#8217;t going to replace your enterprise data warehouse.<\/p>\n<h3>Hadoop Adds Rather Than Subtracts<\/h3>\n<p>According to Gartner\u2019s latest <a href=\"http:\/\/www.gartner.com\/newsroom\/id\/2649419\" target=\"_blank\">surveys<\/a>, the number of CIOs that think that Hadoop will <em>replace<\/em> their existing analytics infrastructure has plummeted over the last few years, and is now down to just 3%.<\/p>\n<p><a href=\"https:\/\/i0.wp.com\/timoelliott.com\/blog\/wp-content\/uploads\/2014\/04\/gartnerdwhadoop.jpg?ssl=1\"><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" style=\"float: none; margin: 0px auto; display: block; border-width: 0px;\" title=\"gartner dw hadoop\" alt=\"gartner dw hadoop\" src=\"https:\/\/i0.wp.com\/timoelliott.com\/blog\/wp-content\/uploads\/2014\/04\/gartnerdwhadoop_thumb.jpg?resize=608%2C382&#038;ssl=1\" width=\"608\" height=\"382\" border=\"0\" \/><\/a><\/p>\n<p>Are these CIOs deluded? No, the fact that the numbers continue to drop are a good indication that they understand the technology &#8212; and its limitations.<\/p>\n<p>Note this DOESN\u2019T mean that enterprises don&#8217;t see value of Hadoop, or aren&#8217;t going to massively increase their investment in it. Quite the opposite. At this week\u2019s <a href=\"http:\/\/hadoopsummit.org\/amsterdam\/\" target=\"_blank\">Hadoop Summit<\/a> in Amsterdam, I heard from large enterprises such as Deutsche Telekom, Centrica, EDF, HSBC, and ING bank that are making <strong>big strategic bets on Hadoop as a core data framework for the future<\/strong>.<\/p>\n<p>But like the survey respondents, they believe that Hadoop ultimately \u201cadds more than subtracts.\u201d For example, <a href=\"https:\/\/www.linkedin.com\/in\/alasdairanderson\" target=\"_blank\">Alasdair Anderson<\/a> of HSBC gave great, concrete example of the cost and flexibility advantages of Hadoop in his presentation \u201cHadoop-economics, the Invisible Hand of Big Data.\u201d But he also included a \u201cHealth Warning\u201d to use Hadoop where it makes sense:<\/p>\n<blockquote><p>\u201cThere\u2019s no relationship between the EDW and Hadoop right now \u2014 they are going to be complimentary. It\u2019s NOT about rip and replace: we\u2019re not going to get rid of RDBMS or MPP, but instead use the right tool for right job \u2014 and that will very much be driven by price.\u201d<\/p><\/blockquote>\n<p><a href=\"http:\/\/www.kimballgroup.com\/\" target=\"_blank\">Ralph Kimball<\/a>, one of the key data warehousing pioneers, echoed these sentiments in a <a href=\"http:\/\/cloudera.com\/content\/cloudera\/en\/resources\/library\/recordedwebinar\/building-a-hadoop-data-warehouse-video.html\" target=\"_blank\">recent CloudEra webinar<\/a>. In a must-see presentation designed to explain Hadoop to data warehouse experts, he positively gushed over the new opportunities, but here\u2019s what he had to say in the Q&amp;A section:<\/p>\n<blockquote><p>\u201cHere\u2019s a question that made me laugh a little bit, but it\u2019s a serious question: \u2018Well does this mean that relational databases are going to die?\u2019. I think that there was a sense, three or four years ago, that maybe this was all a giant zero sum game between Hadoop and relational databases, and that has simply gone away. Everyone has now realized that there\u2019s a huge legacy value in relational databases for the purposes they are used for. Not only transaction processing, but for all the very focused, index-oriented queries on that kind of data, and that will continue in a very robust way forever. Hadoop, therefore, will present this alternative kind of environment for different types of analysis for different kinds of data, and the two of them will coexist. And they will call each other. There may be points at which the business user isn\u2019t actually quite sure which one of them they are touching at any point of time.\u201d<\/p><\/blockquote>\n<h3>Embrace Technology Change Outside of Hadoop, Too<\/h3>\n<p>So Hadoop isn\u2019t going to \u201creplace\u201d EDW. But how about that notion that EDWs are legacy, to be maintained but not used for new analytic applications? Will EDWs become \u201cthe mainframes of 21st century\u201d as one Hadoop Summit attendee put it?<\/p>\n<p>No. Ignoring the many advantages of Hadoop would be dumb. But it would be just as dumb to ignore the other revolutionary technology breakthroughs in the DW space. In particular, new in-memory processing opportunities have created a brand-new category that Gartner calls \u201c<a href=\"https:\/\/www.gartner.com\/doc\/2657815\/hybrid-transactionanalytical-processing-foster-opportunities\" target=\"_blank\">hybrid transactional\/analytic platforms<\/a>\u201d (HTAP):<\/p>\n<blockquote><p>\u201cHybrid transaction\/analytical processing will empower application leaders to innovate via greater situation awareness and improved business agility. This will entail an upheaval in the established architectures, technologies and skills driven by use of in-memory computing technologies as enablers.\u201d<\/p><\/blockquote>\n<p><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" style=\"float: none; margin: 0px auto; display: block; border-width: 0px;\" title=\"image\" alt=\"image\" src=\"https:\/\/i0.wp.com\/timoelliott.com\/blog\/wp-content\/uploads\/2014\/04\/image_thumb.png?resize=610%2C470&#038;ssl=1\" width=\"610\" height=\"470\" border=\"0\" \/><\/p>\n<p>HTAP isn\u2019t just a way of speeding up existing applications and analytics. The simplicity of the architecture (data is stored just once), and the flexibility of the platform means it has a <a href=\"http:\/\/diginomica.com\/2014\/02\/03\/sap-hana-bring-real-savings-yes-numbers-impressive\/\" target=\"_blank\">lower total cost of ownership that traditional disk-based platforms<\/a>. This means it\u2019s also rapidly becoming the <a href=\"https:\/\/www.suiteonhana.com\/cross-industry\" target=\"_blank\">default platform<\/a> for new business application deployments, and the heart of new &#8220;real-time&#8221; applications.<\/p>\n<p><strong>This means there is no a three-way tug for new analytic uses &#8212; directly on the operational system with HTAP, Hadoop, or traditional DW (and\/or flexible data mashing with data discovery tools)<\/strong><\/p>\n<h3>But isn&#8217;t Hadoop going to do all that?<\/h3>\n<p>No. This was a gratingly prevalent view at at Hadoop Summit, but it\u2019s firmly in the \u201cwishful thinking\u201d camp.<\/p>\n<p>Yes, there are projects to make in-memory and ACID compliance part of the Hadoop Framework.\u00a0Storm and Flume mean you can start using Hadoop with streaming data.\u00a0YARN may turn into a \u201cgeneric app provisioning system via Linux containers.\u201d<\/p>\n<p>Does this mean that you\u2019ll be able to do more with Hadoop in the future? Yes. Is it going to be easier to make applications? Yes. Is forty years of business process and data warehousing technology and expertise going be obsolete any time soon? No!<\/p>\n<p>One small example, from <a href=\"https:\/\/twitter.com\/alanfgates\" target=\"_blank\">Alan Gates<\/a>, one of the co-founders of Hortonworks, presenting the new ACID features in the next version of Hive, answering a question about whether this would enable OLTP support:<\/p>\n<blockquote><p>\u201cWe\u2019re not aiming at that, it\u2019s not what HIVE is good at, we don\u2019t think it makes any sense, and it would be a total fail if you did that.\u201d<\/p><\/blockquote>\n<p>There are other projects for adding transactions to Hadoop, but integration with the corporate world will require support for standards like SQL, and the irony is that it will be a big challenge for Hadoop to support even 22yr-old <a href=\"http:\/\/en.wikipedia.org\/wiki\/SQL-92\" target=\"_blank\">SQL-92<\/a>. As analyst <a href=\"http:\/\/twitter.com\/merv\" target=\"_blank\">Merv Adrian<\/a> put it in a recent Gartner conference presentation: \u201cWhat is remarkable is that Hadoop does SQL. Just don\u2019t expect it to do it well\u201d. (More in Gartner\u2019s report: \u201c<a href=\"https:\/\/www.gartner.com\/doc\/2668815\/choosing-sql-access-strategy-hadoop\" target=\"_blank\">choosing your SQL access strategy for Hadoop<\/a>\u201d)<\/p>\n<h3>The Architecture of the Future<\/h3>\n<p>The right answer for the future is the boring, pragmatic one that big data organizations like Hortonworks emphasize in their &#8220;<a href=\"http:\/\/hortonworks.com\/wp-content\/uploads\/2013\/09\/Reference.Architecture.SAP_Hortonworks.v1.1.pdf\" target=\"_blank\">modern data architecture<\/a>\u201d diagram. <em><strong>To make the most of your data, you will need all these proven and new technologies working seamlessly together. <\/strong><\/em><\/p>\n<p><a href=\"https:\/\/i0.wp.com\/timoelliott.com\/blog\/wp-content\/uploads\/2014\/04\/hadoopmodernarchitecture.png?ssl=1\"><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" style=\"float: none; margin-left: auto; display: block; margin-right: auto; border: 0px;\" title=\"hadoop modern architecture\" alt=\"hadoop modern architecture\" src=\"https:\/\/i0.wp.com\/timoelliott.com\/blog\/wp-content\/uploads\/2014\/04\/hadoopmodernarchitecture_thumb.png?resize=608%2C495&#038;ssl=1\" width=\"608\" height=\"495\" border=\"0\" \/><\/a><\/p>\n<h3>It\u2019s About The Applications of The Future<\/h3>\n<p>The real opportunity of new technologies like Hadoop and In-Memory processing is to enable new, more flexible, analytics-focused, actionable applications. And that takes much more than just a platform. Organizations want best-practice business applications capable of analyzing big data and putting it in the hands of people on the front line of your organization that need it, via cloud and mobile devices. And they want a vibrant ecosystem capable of helping the organization make the best use of those applications.<\/p>\n<p>For more information about what that might look like, take a look at <a href=\"http:\/\/www.sap.com\/solution\/big-data.html\" target=\"_blank\">SAP\u2019s big data solutions<\/a>.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>The data says that Hadoop isn\u2019t going to replace your enterprise data warehouse. Here&#8217;s why.<\/p>\n","protected":false},"author":2,"featured_media":12862,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"jetpack_post_was_ever_published":false,"_jetpack_newsletter_access":"","_jetpack_dont_email_post_to_subs":false,"_jetpack_newsletter_tier_id":0,"_jetpack_memberships_contains_paywalled_content":false,"_jetpack_memberships_contains_paid_content":false,"footnotes":"","jetpack_publicize_message":"","jetpack_publicize_feature_enabled":true,"jetpack_social_post_already_shared":false,"jetpack_social_options":{"image_generator_settings":{"template":"highway","default_image_id":0,"font":"","enabled":false},"version":2}},"categories":[14,1],"tags":[],"class_list":["post-12412","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-thoughts","category-uncategorized"],"jetpack_publicize_connections":[],"jetpack_featured_media_url":"https:\/\/i0.wp.com\/timoelliott.com\/blog\/wp-content\/uploads\/2014\/04\/gartnerdwhadoop_thumb-2.jpg?fit=608%2C382&ssl=1","jetpack_sharing_enabled":true,"jetpack_shortlink":"https:\/\/wp.me\/p3X9RF-3ec","_links":{"self":[{"href":"https:\/\/timoelliott.com\/blog\/wp-json\/wp\/v2\/posts\/12412","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/timoelliott.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/timoelliott.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/timoelliott.com\/blog\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/timoelliott.com\/blog\/wp-json\/wp\/v2\/comments?post=12412"}],"version-history":[{"count":0,"href":"https:\/\/timoelliott.com\/blog\/wp-json\/wp\/v2\/posts\/12412\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/timoelliott.com\/blog\/wp-json\/wp\/v2\/media\/12862"}],"wp:attachment":[{"href":"https:\/\/timoelliott.com\/blog\/wp-json\/wp\/v2\/media?parent=12412"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/timoelliott.com\/blog\/wp-json\/wp\/v2\/categories?post=12412"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/timoelliott.com\/blog\/wp-json\/wp\/v2\/tags?post=12412"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}