SYS-CON MEDIA Authors: Elizabeth White, Yeshim Deniz, Roger Strukhoff, Jason Bloomberg, Pat Romanski

Blog Feed Post

My Top 10 Blogs of 2014

Bill Schmarzo

2014 was a good year for big data, and for some of my big data blogs.  This is my annual “My Top 10 Blogs” list.  Choosing 10 forces me to exclude some interesting topics, such as my Big Data MBA series from my teaching stint at the University of San Francisco.  Fortunately, my clients shared many real-world challenges so I had plenty of fuel for blogs.  Here’s to 2015 being an even bigger year for Big Data!

#10 (Tied) – Dear John Letter

A good friend of mine who took over a company that provides a statistics software package inspired this blog.  I have much admiration for my friend “John”, so I thought it only fair to share with him what challenges I thought he’d face when moving the software package forward, within a big data world.  Hopefully this is useful advice for any company looking to transition or build software for this brave new big data world.

Figure 1:  The Move to Actionable Analytics

Figure 1:  The Move to Actionable Analytics

#10 (Tied) – Waiting For Godot: Developing Competitive Differentiation

This blog really wasn’t about big data, but instead it challenged organizations to really understand where and how data, analytics and technology could be leveraged to provide business differentiation versus just business parity.  I really wanted the big data conversation to focus on business differentiation instead of what version of Hadoop the organization used.  Ugh!

Figure 2:  Examples of Software Products that Provide Business Differentiation

Figure 2:  Examples of Software Products that Provide Business Differentiation

#9 – Data Science – The More Actionable Data, the Better

Janet Yellen is the “Moneyball” queen of the Federal Reserve.  Janet Yellen caught my attention as an example of a leader who understands that one needs multiple perspectives (metrics) in order to thoroughly understand the state of the business.  Janet Yellen uses a dashboard of metrics to understand the job market and take decisive action.  Although more data is better than less data, it’s perhaps even more important to employ a range of metrics and scores that help you understand the situation and make better decisions.

Figure 3: Janet Yellen's Jobs Dashboard

Figure 3: Janet Yellen’s Jobs Dashboard

#8 – In the World of Big Data, Small Data Rules!!

This blog gave me a chance to revisit my childhood game of Strat-o-matic baseball.  This blog addresses the power of integrating “small” data – data captured via consumer comments, work orders, physician or mechanic notes, etc. – with the “big” data from your operational systems.  There are many opportunities for organizations to ask that one extra question – to capture the “small data” – that, when combined with the organization’s big data, can yield more accurate and actionable insights.  Organizations need a thoughtful process for ensuring the capture and central storage of this small data.

Figure 4: Boston Red Sox search for variables that are better predictors of performance

Figure 4: Boston Red Sox search for variables that are better predictors of performance

#7 – Save the Ugly Baby: the Pixar Success Story

This blog was the direct result of watching our data science team struggle with a particular client’s data.  The data science team was trying to explain to the client that the analytic process was fraught with failure, but that failures are progress because they teach you what doesn’t work.  I happened to be reading an article about innovation at Pixar, and the connection between the two processes became clear – be patient, keep testing, keep fine-tuning and eventually success will happen.

Figure 5:  Saving The Ugly Baby Process

Figure 5:  Saving The Ugly Baby Process

#6 – Best Practices for Analytics Profiles

This blog came very close to being #1 because it captures the analytic process that we use in every one of our big data and data science engagements.  Our data science team is always looking to build analytic profiles at the individual entity level, whether that’s customers, products, employees, students, slot machines, wind turbines, etc.  We like to designate these entities as the organization’s “strategic nouns,” those entities where superior knowledge and insights can help optimize key business processes and uncover new monetization opportunities.  And this approach works every time.  I love it!

Figure 6:  The Power of Analytic Profiles and Strategic "Nouns"

Figure 6:  The Power of Analytic Profiles and Strategic “Nouns”

#5 – Don’t Think Better; Think Different

This blog reflects that biggest problem that I see with organizations with respect to moving into the big data world: organizations are trying to apply old paradigms to new technologies and capabilities.  It just doesn’t work.  For example, I don’t know how many times I’ve seen organizations try to incrementally extend their BI and data warehouse capabilities by trying to integrate Hadoop and HDFS capabilities, and totally swing and miss.  Then they blame it on the technology and not their approach.  Ugh!

By the way, be sure to read the comment about “Don’t Think Business Functions; Think Business Initiatives” at the bottom of the blog.  Interesting observation shared with my by one of my Big Data MBA students.

Figure 7:  Evolution from Descriptive to Predictive to Prescriptive Analytics

Figure 7:  Evolution from Descriptive to Predictive to Prescriptive Analytics

#4 – Hadoop Data Modeling Lessons – by Vin Diesel

This blog took me nearly 12 months to write, and even now I’m not certain that I got it right. The Vin Diesel comment is a reference to his role as Xander Cage in the movie “XXX” where he admonishes the Prague police for not using their full firepower (heat seeking missile) in a standoff with a sniper. The way that a data scientist designs their data models is very different from how a data warehouse designer designs a data model.  In order to take advantage of the raw processing power of MPP and Hadoop architectures, the data scientists want long flat tables that eliminate joins.  And that’s what we’ve got – massive flat files against which the data scientists are running their analytics.  They’re sort of ugly, but very effective from an analytics perspective.

Figure 8:  Data Science - Flatten the Star!

Figure 8:  Data Science – Flatten the Star!

#3 – How I’ve Learned To Stop Worrying And Love The Data Lake

This is one of my favorite blogs and a topic that I discuss frequently. I especially like how the blog addresses the advantages of a data lake to the data warehousing and BI teams.  I’m a big fan of the data lake (even if I’m not a big fan of the name) and see more and more of our clients adopting the data lake not only to support their analytics environments, but more and more to free up expensive resources off of their data warehousing environments.  The data lake is really a winner, and 2015 will be the year of the data lake (and I’ll be speaking about the Big Data MBA and the Data Lake at the February Strata conference…hint, hint)!

Figure 9:  The Power of the Data Lake

Figure 9:  The Power of the Data Lake

#2 – Dynamic Duo of Analytic Power: Business Intelligence Analyst PLUS Data Scientist

This is one of my more recent blogs, and it has gotten lots of favorable feedback.  This blog took me a couple of months to get right.  I really struggled to understand the key differences between how the traditional Business Intelligence process works (which I knew well from my Business Objects days), and how the data science process works.  But after several engagements where I got a chance to work closely with our data science team, I think I finally nailed it!

Figure 10:  The Data Science Engagement Process

Figure 10:  The Data Science Engagement Process

#1 – “Big Ideas” Big Data Business Model Maturity Index

This one is clearly my favorite.  We turned the Big Data Business Model Maturity Index into a very cool, visually engaging animation.  The animation turns the Big Data Business Model Maturity Index into a living, and hopefully thought-provoking story with engaging visuals. It’s a very creative presentation, and it actually makes me look smart.  Heck, even my kids were impressed!!

 

Big Data Business Model Maturity Index

Big Data Business Model Maturity Index

So that’s it, my “Top 10 + 1” blogs.  2014 was a great year and I hope to have even more to share in 2015! Thank you for all your feedback and comments!

My Top 10 Blogs of 2014
Bill Schmarzo

Read the original blog entry...

More Stories By William Schmarzo

Bill Schmarzo, author of “Big Data: Understanding How Data Powers Big Business” and “Big Data MBA: Driving Business Strategies with Data Science”, is responsible for setting strategy and defining the Big Data service offerings for Hitachi Vantara as CTO, IoT and Analytics.

Previously, as a CTO within Dell EMC’s 2,000+ person consulting organization, he works with organizations to identify where and how to start their big data journeys. He’s written white papers, is an avid blogger and is a frequent speaker on the use of Big Data and data science to power an organization’s key business initiatives. He is a University of San Francisco School of Management (SOM) Executive Fellow where he teaches the “Big Data MBA” course. Bill also just completed a research paper on “Determining The Economic Value of Data”. Onalytica recently ranked Bill as #4 Big Data Influencer worldwide.

Bill has over three decades of experience in data warehousing, BI and analytics. Bill authored the Vision Workshop methodology that links an organization’s strategic business initiatives with their supporting data and analytic requirements. Bill serves on the City of San Jose’s Technology Innovation Board, and on the faculties of The Data Warehouse Institute and Strata.

Previously, Bill was vice president of Analytics at Yahoo where he was responsible for the development of Yahoo’s Advertiser and Website analytics products, including the delivery of “actionable insights” through a holistic user experience. Before that, Bill oversaw the Analytic Applications business unit at Business Objects, including the development, marketing and sales of their industry-defining analytic applications.

Bill holds a Masters Business Administration from University of Iowa and a Bachelor of Science degree in Mathematics, Computer Science and Business Administration from Coe College.

Latest Stories
Atmosera delivers modern cloud services that maximize the advantages of cloud-based infrastructures. Offering private, hybrid, and public cloud solutions, Atmosera works closely with customers to engineer, deploy, and operate cloud architectures with advanced services that deliver strategic business outcomes. Atmosera's expertise simplifies the process of cloud transformation and our 20+ years of experience managing complex IT environments provides our customers with the confidence and trust tha...
With the introduction of IoT and Smart Living in every aspect of our lives, one question has become relevant: What are the security implications? To answer this, first we have to look and explore the security models of the technologies that IoT is founded upon. In his session at @ThingsExpo, Nevi Kaja, a Research Engineer at Ford Motor Company, discussed some of the security challenges of the IoT infrastructure and related how these aspects impact Smart Living. The material was delivered interac...
CloudEXPO has been the M&A capital for Cloud companies for more than a decade with memorable acquisition news stories which came out of CloudEXPO expo floor. DevOpsSUMMIT New York faculty member Greg Bledsoe shared his views on IBM's Red Hat acquisition live from NASDAQ floor. Acquisition news was announced during CloudEXPO New York which took place November 12-13, 2019 in New York City. Our Silicon Valley 2019 schedule will showcase 200 keynotes, sessions, general sessions, power panels, and...
Intel is an American multinational corporation and technology company headquartered in Santa Clara, California, in the Silicon Valley. It is the world's second largest and second highest valued semiconductor chip maker based on revenue after being overtaken by Samsung, and is the inventor of the x86 series of microprocessors, the processors found in most personal computers (PCs). Intel supplies processors for computer system manufacturers such as Apple, Lenovo, HP, and Dell. Intel also manufactu...
Darktrace is the world's leading AI company for cyber security. Created by mathematicians from the University of Cambridge, Darktrace's Enterprise Immune System is the first non-consumer application of machine learning to work at scale, across all network types, from physical, virtualized, and cloud, through to IoT and industrial control systems. Installed as a self-configuring cyber defense platform, Darktrace continuously learns what is ‘normal' for all devices and users, updating its understa...
As you know, enterprise IT conversation over the past year have often centered upon the open-source Kubernetes container orchestration system. In fact, Kubernetes has emerged as the key technology -- and even primary platform -- of cloud migrations for a wide variety of organizations. Kubernetes is critical to forward-looking enterprises that continue to push their IT infrastructures toward maximum functionality, scalability, and flexibility. As they do so, IT professionals are also embr...
The widespread success of cloud computing is driving the DevOps revolution in enterprise IT. Now as never before, development teams must communicate and collaborate in a dynamic, 24/7/365 environment. There is no time to wait for long development cycles that produce software that is obsolete at launch. DevOps may be disruptive, but it is essential. DevOpsSUMMIT at CloudEXPO expands the DevOps community, enable a wide sharing of knowledge, and educate delegates and technology providers alike.
Cloud-Native thinking and Serverless Computing are now the norm in financial services, manufacturing, telco, healthcare, transportation, energy, media, entertainment, retail and other consumer industries, as well as the public sector. The widespread success of cloud computing is driving the DevOps revolution in enterprise IT. Now as never before, development teams must communicate and collaborate in a dynamic, 24/7/365 environment. There is no time to wait for long development cycles that pro...
At CloudEXPO Silicon Valley, June 24-26, 2019, Digital Transformation (DX) is a major focus with expanded DevOpsSUMMIT and FinTechEXPO programs within the DXWorldEXPO agenda. Successful transformation requires a laser focus on being data-driven and on using all the tools available that enable transformation if they plan to survive over the long term. A total of 88% of Fortune 500 companies from a generation ago are now out of business. Only 12% still survive. Similar percentages are found throug...
OpsRamp is an enterprise IT operation platform provided by US-based OpsRamp, Inc. It provides SaaS services through support for increasingly complex cloud and hybrid computing environments from system operation to service management. The OpsRamp platform is a SaaS-based, multi-tenant solution that enables enterprise IT organizations and cloud service providers like JBS the flexibility and control they need to manage and monitor today's hybrid, multi-cloud infrastructure, applications, and wor...
Apptio fuels digital business transformation. Technology leaders use Apptio's machine learning to analyze and plan their technology spend so they can invest in products that increase the speed of business and deliver innovation. With Apptio, they translate raw costs, utilization, and billing data into business-centric views that help their organization optimize spending, plan strategically, and drive digital strategy that funds growth of the business. Technology leaders can gather instant recomm...
The Master of Science in Artificial Intelligence (MSAI) provides a comprehensive framework of theory and practice in the emerging field of AI. The program delivers the foundational knowledge needed to explore both key contextual areas and complex technical applications of AI systems. Curriculum incorporates elements of data science, robotics, and machine learning-enabling you to pursue a holistic and interdisciplinary course of study while preparing for a position in AI research, operations, ...
CloudEXPO has been the M&A capital for Cloud companies for more than a decade with memorable acquisition news stories which came out of CloudEXPO expo floor. DevOpsSUMMIT New York faculty member Greg Bledsoe shared his views on IBM's Red Hat acquisition live from NASDAQ floor. Acquisition news was announced during CloudEXPO New York which took place November 12-13, 2019 in New York City.
Industry after industry is under siege as companies embrace digital transformation (DX) to disrupt existing business models and disintermediate their competitor’s customer relationships. But what do we mean by “Digital Transformation”? The coupling of granular, real-time data (e.g., smartphones, connected devices, smart appliances, wearables, mobile commerce, video surveillance) with modern technologies (e.g., cloud native apps, big data architectures, hyper-converged technologies, artificial in...
Codete accelerates their clients growth through technological expertise and experience. Codite team works with organizations to meet the challenges that digitalization presents. Their clients include digital start-ups as well as established enterprises in the IT industry. To stay competitive in a highly innovative IT industry, strong R&D departments and bold spin-off initiatives is a must. Codete Data Science and Software Architects teams help corporate clients to stay up to date with the mod...