{"id":12335,"date":"2013-07-05T10:58:09","date_gmt":"2013-07-05T09:58:09","guid":{"rendered":"http:\/\/timoelliott.com\/blog\/?p=5380"},"modified":"2013-07-05T10:58:09","modified_gmt":"2013-07-05T09:58:09","slug":"7-definitions-of-big-data-you-should-know-about","status":"publish","type":"post","link":"https:\/\/timoelliott.com\/blog\/2013\/07\/7-definitions-of-big-data-you-should-know-about.html","title":{"rendered":"7 Definitions of Big Data You Should Know About"},"content":{"rendered":"<p><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" style=\"background-image: none; padding-left: 0px; padding-right: 0px; display: inline; padding-top: 0px; border: 0px;\" title=\"big-data-in-lego\" alt=\"big-data-in-lego\" src=\"https:\/\/i0.wp.com\/timoelliott.com\/blog\/wp-content\/uploads\/2013\/07\/big-data-in-lego.jpg?resize=690%2C310&#038;ssl=1\" width=\"690\" height=\"310\" border=\"0\" \/><\/p>\n<p>Faced with the ongoing confusion over the term \u2018Big Data,\u2019 here\u2019s a handy \u2013 and somewhat cynical \u2013 guide to some of the key definitions that you might see out there.<\/p>\n<p>The first thing to note is that \u2013 despite <a href=\"http:\/\/en.wikipedia.org\/wiki\/Big_data\" target=\"_blank\">what Wikipedia says<\/a> \u2013 everybody in the industry generally agrees that Big Data isn\u2019t just about having more data (since that\u2019s just inevitable, and boring).<\/p>\n<h3>(1) The Original Big Data<\/h3>\n<p>Big Data as the three Vs: Volume, Velocity, and Variety. This is the most venerable and well-known definition, first coined by <a href=\"http:\/\/blogs.gartner.com\/doug-laney\/files\/2012\/01\/ad949-3D-Data-Management-Controlling-Data-Volume-Velocity-and-Variety.pdf\" target=\"_blank\">Doug Laney of Gartner over twelve years ago<\/a>. Since then, many others have tried to <a href=\"http:\/\/en.wikipedia.org\/wiki\/Up_to_eleven\" target=\"_blank\">take it to 11<\/a> with additional Vs including Validity, Veracity, Value, and Visibility.<\/p>\n<h3>(2) Big Data as Technology<\/h3>\n<p>Why did a 12-year old term suddenly zoom into the spotlight? It wasn&#8217;t simply because we do indeed now have a lot more volume, velocity, and variety than a decade ago. Instead, it was fueled by new technology, and in particular the fast rise of open source technologies such as <a href=\"http:\/\/hadoop.apache.org\/\" target=\"_blank\">Hadoop<\/a> and other <a href=\"https:\/\/en.wikipedia.org\/wiki\/NoSQL\" target=\"_blank\">NoSQL<\/a> ways of storing and manipulating data.<\/p>\n<p>The users of these new tools needed a term that differentiated them from previous technologies, and\u2013somehow\u2013ended up settling on the woefully inadequate term Big Data. If you go to a big data conference, you can be assured that sessions featuring relational databases\u2013no matter how many Vs they boast\u2013will be in the minority.<\/p>\n<h3>(3) Big Data as Data Distinctions<\/h3>\n<p>The problem with big-data-as-technology is that (a) it&#8217;s vague enough that every vendor in the industry jumped in to claim it for themselves and (b) everybody &#8216;knew&#8217; that they were supposed to elevate the debate and talk about something more business-y and useful.<\/p>\n<p>Here are two good attempts to help organizations understand why Big Data now is different from mere big data in the past:<\/p>\n<ul>\n<li><a href=\"http:\/\/hortonworks.com\/blog\/7-key-drivers-for-the-big-data-market\/\" target=\"_blank\">Transactions, Interactions, and Observations.<\/a> This one is from <a href=\"https:\/\/twitter.com\/shaunconnolly\" target=\"_blank\">Shaun Connolly of Hortonworks.<\/a>\u00a0 Transactions make up the majority of what we have collected, stored and analyzed in the past. Interactions are data that comes from things like people clicking on web pages. Observations are data collected automatically.<\/li>\n<li><a href=\"http:\/\/9sight.com\/Big_Data_Zoo.pdf\" target=\"_blank\">Process-Mediated Data, Human-Sourced Information, and Machine-Generated Data<\/a>. This is brought to us by <a href=\"https:\/\/twitter.com\/BarryDevlin\" target=\"_blank\">Barry Devlin<\/a>, who co-wrote the first paper on data warehousing. It is basically the same as the above, but with clearer names.<\/li>\n<\/ul>\n<h3>(4) Big Data as Signals<\/h3>\n<p>This is another business-y approach that divides the world by intent and timing rather than the type of data, courtesy of SAP\u2019s <a href=\"https:\/\/twitter.com\/nstevenlucas\" target=\"_blank\">Steve Lucas<\/a>. The &#8216;old world&#8217; is about transactions, and by the time these transactions are recorded, it&#8217;s too late to do anything about them: companies are constantly &#8216;managing out of the rear-view mirror&#8217;. In the &#8216;new world,&#8217; companies can instead use <a href=\"http:\/\/www.saphana.com\/community\/blogs\/blog\/2012\/08\/21\/beyond-the-balance-sheet-run-your-business-on-new-signals-in-the-age-of-big-data\" target=\"_blank\">new &#8216;signal&#8217; data<\/a> to anticipate what&#8217;s going to happen, and intervene to improve the situation.<\/p>\n<p>Examples include tracking brand sentiment on social media (if your &#8216;likes&#8217; fall off a cliff, your sales will surely follow) and predictive maintenance (complex algorithms determine when you need to replace an aircraft part, before the plane gets expensively stuck on the runway).<\/p>\n<h3>(5) Big Data as Opportunity<\/h3>\n<p>This one is from <a href=\"https:\/\/451research.com\/biography?eid=333\">451 Research&#8217;s Matt Aslett<\/a> and broadly defines big data as &#8216;analyzing data that was <a href=\"http:\/\/insights.wired.com\/profiles\/blogs\/attention-big-data-enthusiasts-here-s-what-you-shouldn-t-ignore#axzz2YA8N0ne7\" target=\"_blank\">previously ignored<\/a> because of technology limitations.&#8217; (OK, so technically, Matt used the term \u2018Dark Data\u2019 rather than Big Data, but it\u2019s close enough). This is my personal favorite, since I believe it lines up best with how the term is actually used in most articles and discussions.<\/p>\n<h3>(6) Big Data as Metaphor<\/h3>\n<p>In his wonderful book <a href=\"http:\/\/thehumanfaceofbigdata.com\/\" target=\"_blank\">The Human Face of Big Data<\/a>, journalist <a href=\"http:\/\/en.wikipedia.org\/wiki\/Rick_Smolan\" target=\"_blank\">Rick Smolan<\/a> says big data is \u201cthe process of helping the planet grow a nervous system, one in which we are just another, human, type of sensor.\u201d Deep, huh? But by the time you\u2019ve read <a href=\"https:\/\/timoelliott.com\/blog\/2013\/05\/sapphire-now-the-human-face-of-big-data.html\" target=\"_blank\">some of stories in the book<\/a> or the mobile app, you\u2019ll be nodding your head in agreement.<\/p>\n<h3>(7) Big Data as New Term for Old Stuff<\/h3>\n<p>This is the laziest and most cynical use of the term, where projects that were possible using previous technology, and would have been called BI or analytics in the past have <a href=\"https:\/\/timoelliott.com\/blog\/2012\/12\/is-big-data-the-new-term-for-business-intelligence.html\" target=\"_blank\">suddenly been rebaptized<\/a> in a fairly blatant attempt to jump on the big data bandwagon.<\/p>\n<p>And finally, one bonus, fairly useless <a href=\"https:\/\/timoelliott.com\/blog\/2012\/12\/what-is-big-data.html\" target=\"_blank\">definition of big data<\/a>. Still not enough for you? Here&#8217;s <a href=\"http:\/\/www.opentracker.net\/article\/25-definitions-big-data\">30+ more and counting<\/a>!.<\/p>\n<p>The bottom line: whatever the disagreements over the definition, everybody agrees on one thing: big data is a big deal, and will lead to huge new opportunities in the coming years.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>This is a handy and cynical guide to seven different definitions of Big Data. The one thing that everybody does agree on? It&#8217;s a big deal. <\/p>\n","protected":false},"author":2,"featured_media":5379,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"jetpack_post_was_ever_published":false,"_jetpack_newsletter_access":"","_jetpack_dont_email_post_to_subs":false,"_jetpack_newsletter_tier_id":0,"_jetpack_memberships_contains_paywalled_content":false,"_jetpack_memberships_contains_paid_content":false,"footnotes":"","jetpack_publicize_message":"","jetpack_publicize_feature_enabled":true,"jetpack_social_post_already_shared":false,"jetpack_social_options":{"image_generator_settings":{"template":"highway","default_image_id":0,"font":"","enabled":false},"version":2}},"categories":[3,14],"tags":[27,61,100,149,160,173,204,324,351,370,560,576,878,911,916,931],"class_list":["post-12335","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-bi-20","category-thoughts","tag-bigdata","tag-61","tag-analytics","tag-barry-devlin","tag-bi","tag-big-data","tag-business-intelligence","tag-dashboards","tag-data-warehousing","tag-decision-support-systems","tag-hana","tag-hortonworks","tag-reporting","tag-sap","tag-sap-hana","tag-saphana"],"jetpack_publicize_connections":[],"jetpack_featured_media_url":"https:\/\/i0.wp.com\/timoelliott.com\/blog\/wp-content\/uploads\/2013\/07\/big-data-in-lego-1.jpg?fit=690%2C310&ssl=1","jetpack_sharing_enabled":true,"jetpack_shortlink":"https:\/\/wp.me\/p3X9RF-3cX","_links":{"self":[{"href":"https:\/\/timoelliott.com\/blog\/wp-json\/wp\/v2\/posts\/12335","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/timoelliott.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/timoelliott.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/timoelliott.com\/blog\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/timoelliott.com\/blog\/wp-json\/wp\/v2\/comments?post=12335"}],"version-history":[{"count":0,"href":"https:\/\/timoelliott.com\/blog\/wp-json\/wp\/v2\/posts\/12335\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/timoelliott.com\/blog\/wp-json\/wp\/v2\/media\/5379"}],"wp:attachment":[{"href":"https:\/\/timoelliott.com\/blog\/wp-json\/wp\/v2\/media?parent=12335"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/timoelliott.com\/blog\/wp-json\/wp\/v2\/categories?post=12335"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/timoelliott.com\/blog\/wp-json\/wp\/v2\/tags?post=12335"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}