{"id":12376,"date":"2013-12-18T12:54:27","date_gmt":"2013-12-18T11:54:27","guid":{"rendered":"http:\/\/timoelliott.com\/blog\/?p=5989"},"modified":"2013-12-18T12:54:27","modified_gmt":"2013-12-18T11:54:27","slug":"the-battle-of-the-database-elephants","status":"publish","type":"post","link":"https:\/\/timoelliott.com\/blog\/2013\/12\/the-battle-of-the-database-elephants.html","title":{"rendered":"The Battle of The Database Elephants"},"content":{"rendered":"<p><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" style=\"background-image: none; padding-left: 0px; padding-right: 0px; display: inline; padding-top: 0px; border-width: 0px;\" title=\"database-elephants-banner\" alt=\"database-elephants-banner\" src=\"https:\/\/i0.wp.com\/timoelliott.com\/blog\/wp-content\/uploads\/2013\/12\/database-elephants-banner.jpg?resize=690%2C310&#038;ssl=1\" width=\"690\" height=\"310\" border=\"0\" \/><\/p>\n<p><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" style=\"background-image: none; margin: 0px 0px 5px 5px; padding-left: 0px; padding-right: 0px; display: inline; float: right; padding-top: 0px; border: 0px;\" title=\"stonebraker_x220\" alt=\"stonebraker_x220\" src=\"https:\/\/i0.wp.com\/timoelliott.com\/blog\/wp-content\/uploads\/2013\/12\/stonebraker_x220.jpg?resize=91%2C136&#038;ssl=1\" width=\"91\" height=\"136\" align=\"right\" border=\"0\" \/><a href=\"http:\/\/about.gigaom.com\/\">GigaOM Research<\/a> recently published an <a href=\"http:\/\/gigaom.com\/2013\/12\/12\/database-whiz-michael-stonebraker-on-the-impending-battle-of-the-database-elephants\/\" target=\"_blank\">interview<\/a> with database rock star <a href=\"http:\/\/en.wikipedia.org\/wiki\/Michael_Stonebraker\" target=\"_blank\">Michael Stonebraker<\/a> on \u201cthe impending battle of the database elephants,\u201d covering his thoughts on the disruption in the database market.<\/p>\n<p>This blog includes the excerpts I thought were most interesting:<\/p>\n<h3>SAP enters the database market<\/h3>\n<blockquote><p>&#8220;In the OLTP market, recent advances have completely convinced me that main memory database systems &#8230;<strong> are going to completely take over<\/strong>&#8221;<\/p>\n<p>&#8220;The database market is really alive, vibrant, with lots of new ideas, and I think the legacy vendors face the &#8220;innovator&#8217;s dilemma&#8221; in spades&#8221;<\/p>\n<p>&#8220;SAP is in the database business and SAP customers are Oracle&#8217;s biggest customer right now, and<strong> among the elephants there\u2019s going to be a duke it out between Oracle and SAP<\/strong> and I\u2019m delighted to look on from the side.\u201d<\/p><\/blockquote>\n<h3>Legacy databases are obsolete<\/h3>\n<blockquote><p>&#8220;I think data warehouses are an SQL market. It\u2019s just there\u2019s the new way to do it and the old way to do it, and the legacy vendors have the old way to do it. In OLTP, I think it\u2019s a SQL market also, and <strong>the legacy vendors have the old way and there\u2019s a new, much better way<\/strong>.<\/p>\n<p>In round numbers, the database market is a third OLTP, a third data warehouses and a third everything else, and I think \u201ceverything else\u201d is primarily a non-SQL market. I think in datawarehouses and in OLTP, it will remain a SQL market, it\u2019s just the implementations have to change from what they are now to better ideas.<\/p>\n<p>The codebases that the elephants, the legacy vendors are selling right now are 25 years old. And<strong> it&#8217;s time for them to be retired and sent to the home for obsolete software!<\/strong>&#8220;<\/p><\/blockquote>\n<h3>Modern ideas behind HANA<\/h3>\n<blockquote><p>&#8220;My expectation is that<strong> SAP will make a compelling case for their SAP customers switching off of Oracle and onto HANA<\/strong>. That case has not been made yet, it\u2019s way to early. The real thing to watch is how SAP customers are going to react to persuasion from SAP to switch database systems.&#8221;<\/p>\n<p>&#8220;<strong>I\u2019ve looked at the ideas [behind HANA], and I think the ideas are good<\/strong>. They are modern ideas. It&#8217;s too soon to whether the implementation will hold up to the ideas. My suspicion is that it deserves to be taken seriously, and that it will have a very large elephant pushing it very hard&#8221;<\/p><\/blockquote>\n<h3>Gap between NoSQL and SQL narrowing<\/h3>\n<blockquote><p>\u201cMy favorite way to categorize the NoSQL guys is that they started off as \u201cNoSQL,\u201d meaning \u201cSQL is bad.\u201d After a while, that turned into NoSQL\u00a0 meaning \u201cnot only SQL\u201d \u2013 SQL was fine, and they wanted to co-exist with SQL systems. <strong>My prediction is that NoSQL will come to mean &#8220;not yet SQL.&#8221; <\/strong><\/p>\n<p>The two things the NoSQL guys say is number one, \u201cdon&#8217;t use SQL, instead use low-level record-at-a-time language.\u201d Cassandra and Mongo have both announced what looks like \u2013 unless you squint\u2014a high level language that is basically SQL. I think the NoSQL guys will move to putting higher-level languages on their products, and thereby make the difference between NoSQL and SQL get much smaller.<\/p>\n<p>I also think that the second thing is that they don&#8217;t like <a href=\"http:\/\/en.wikipedia.org\/wiki\/ACID\" target=\"_blank\">ACID<\/a>.\u00a0 The biggest proponent of NoSQL non-ACID has been historically a guy named <a href=\"http:\/\/research.google.com\/people\/jeff\/\" target=\"_blank\">Jeff Dean<\/a> at Google, who is responsible for most or all of their database offerings. And he and the team recently wrote a system called <a href=\"http:\/\/en.wikipedia.org\/wiki\/Spanner_(database)\" target=\"_blank\">Spanner<\/a>. Spanner is a pure ACID system. So Google is moving to ACID and I think <strong>the NoSQL market will move away from <\/strong><a href=\"http:\/\/en.wikipedia.org\/wiki\/Eventual_consistency\" target=\"_blank\"><strong>eventual consistency<\/strong><\/a><strong> and toward ACID<\/strong>, and so I think the distinction between the two camps will decrease in the future.<\/p>\n<p>\u201cThere\u2019s been 40 years of DBMS research, starting way back in the 70s. This was a\u00a0 huge debate in the 70s in the relational database research world, and if you go back and look at the history in the 70s, all the discussion today of ACID vs non-ACID all got wrangled out back then. The NoSQL engineers didn&#8217;t&#8230; You know, &#8220;<strong>If you don&#8217;t pay attention to history, you&#8217;re going to have to repeat it<\/strong>,&#8221; which I think is what\u2019s happening.<\/p><\/blockquote>\n<h3>Did Oracle take good care of <a href=\"http:\/\/www.oracle.com\/technetwork\/database\/database-technologies\/timesten\/overview\/index.html\" target=\"_blank\">TimesTen<\/a>?<\/h3>\n<blockquote><p><a href=\"https:\/\/i0.wp.com\/timoelliott.com\/blog\/wp-content\/uploads\/2013\/12\/only-10-percent-useful-work.jpg?ssl=1\"><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" style=\"background-image: none; margin: 0px 0px 5px 5px; padding-left: 0px; padding-right: 0px; display: inline; float: right; padding-top: 0px; border: 0px;\" title=\"only-10-percent-useful-work\" alt=\"only-10-percent-useful-work\" src=\"https:\/\/i0.wp.com\/timoelliott.com\/blog\/wp-content\/uploads\/2013\/12\/only-10-percent-useful-work_thumb.jpg?resize=165%2C161&#038;ssl=1\" width=\"165\" height=\"161\" align=\"right\" border=\"0\" \/><\/a>&#8220;That&#8217;s a technical question&#8230; I and some others wrote a paper called &#8220;<a href=\"http:\/\/nms.csail.mit.edu\/~stavros\/pubs\/OLTP_sigmod08.pdf\" target=\"_blank\">OLTP through the looking-glass, and what we found there<\/a>.&#8221;<\/p>\n<p>We took an open source legacy DBMS called <a href=\"http:\/\/research.cs.wisc.edu\/shore\/\" target=\"_blank\">Shore<\/a> that is from university of Wisconsin. And we said \u201csuppose all the data fits in main memory?\u201d If you have a terabyte of data or less, or maybe even these days two or three or five terabytes, it\u2019s perfectly reasonable to put that in main memory.<\/p>\n<p>So we ran the industry standard benchmark, which is <a href=\"http:\/\/www.tpc.org\/tpcc\/\" target=\"_blank\">TPC-C<\/a>, on data with a buffer pool big enough to hold all the data. And then we said \u201cwhere do all the cycles go?\u201d<\/p>\n<p>The answers were a little bit shocking: <strong>less than 10% goes into useful work \u2013 meaning actually solving the SQL command that comes in.<\/strong> The other 90+% went to four different places [\u2026.]<\/p>\n<p>TimesTen was architected in the 90s [\u2026] it\u2019s got three of the four big pieces of overhead, and so its ability to go blindingly fast is really compromised. And so I think the question is not &#8220;has any particular system been well taken care of?&#8221; or not, it&#8217;s more than that any system written more than six or eight years ago didn&#8217;t realize where all the overhead is going, and wasn&#8217;t architected in a way that goes blindingly fast.&#8221;<\/p><\/blockquote>\n<h3>The full interview<\/h3>\n<p>Listen to the full \u201c<a href=\"https:\/\/soundcloud.com\/gigaom-structure\/database-whiz-michael\/s-Uum1j\" target=\"_blank\">Structure Show<\/a>\u201d (The Stonebraker portion starts at 18:20)<\/p>\n<p><iframe loading=\"lazy\" src=\"https:\/\/w.soundcloud.com\/player\/?url=https%3A\/\/api.soundcloud.com\/tracks\/124398181&amp;color=00a8ff\" height=\"166\" width=\"100%\" frameborder=\"no\" scrolling=\"no\"><\/iframe><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Database rock star Michael Stonebraker says &#8220;main memory database systems are going to completely take over&#8230; among the elephants there\u2019s going to be a duke it out between Oracle and SAP&#8221;<\/p>\n","protected":false},"author":2,"featured_media":5985,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"jetpack_post_was_ever_published":false,"_jetpack_newsletter_access":"","_jetpack_dont_email_post_to_subs":false,"_jetpack_newsletter_tier_id":0,"_jetpack_memberships_contains_paywalled_content":false,"_jetpack_memberships_contains_paid_content":false,"footnotes":"","jetpack_publicize_message":"","jetpack_publicize_feature_enabled":true,"jetpack_social_post_already_shared":false,"jetpack_social_options":{"image_generator_settings":{"template":"highway","default_image_id":0,"font":"","enabled":false},"version":2}},"categories":[14],"tags":[100,173,229,351,353,533,560,749,750,773,781,793,911,916,931,1002,1010,1023,1129],"class_list":["post-12376","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-thoughts","tag-analytics","tag-big-data","tag-cassandra","tag-data-warehousing","tag-databases","tag-google","tag-hana","tag-mongo","tag-mongodb","tag-nosql","tag-oltp","tag-oracle","tag-sap","tag-sap-hana","tag-saphana","tag-spanner","tag-sql","tag-stonebraker","tag-voltdb"],"jetpack_publicize_connections":[],"jetpack_featured_media_url":"https:\/\/i0.wp.com\/timoelliott.com\/blog\/wp-content\/uploads\/2013\/12\/database-elephants-banner-1.jpg?fit=690%2C310&ssl=1","jetpack_sharing_enabled":true,"jetpack_shortlink":"https:\/\/wp.me\/p3X9RF-3dC","_links":{"self":[{"href":"https:\/\/timoelliott.com\/blog\/wp-json\/wp\/v2\/posts\/12376","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/timoelliott.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/timoelliott.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/timoelliott.com\/blog\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/timoelliott.com\/blog\/wp-json\/wp\/v2\/comments?post=12376"}],"version-history":[{"count":0,"href":"https:\/\/timoelliott.com\/blog\/wp-json\/wp\/v2\/posts\/12376\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/timoelliott.com\/blog\/wp-json\/wp\/v2\/media\/5985"}],"wp:attachment":[{"href":"https:\/\/timoelliott.com\/blog\/wp-json\/wp\/v2\/media?parent=12376"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/timoelliott.com\/blog\/wp-json\/wp\/v2\/categories?post=12376"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/timoelliott.com\/blog\/wp-json\/wp\/v2\/tags?post=12376"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}