![]() |
Let your Ideas Suck
Let me reiterate a couple of key points in the preceding paragraph:
The story concept begins to take on more relevance when you contemplate that the Pixar movies start out like a newborn baby—ugly. Every one of Pixar’s stories started out ugly. A new idea or thought is hard to define initially, so it’s likely not very attractive upon conception. Consequently, the idea or concept requires lots of protection and nurturing in its formative days. Every new idea and new concept in any field that involves creativity and innovation needs protection in its “baby” stage. Pixar is set up to protect the director’s ugly baby and their success speaks for itself (see Figure 1). ![]() Figure 1: Box Office Mojo Let Your Analytic Models SuckMuch like the innovative approach taken by Pixar directors in developing their industry-leading movies, folks in the data science space need to understand that their analytics baby is going to start out “ugly” as well. The first iteration of the analytic model probably won’t look pretty and, in fact, probably won’t yield much in the form of any new or actionable insights. But it’s critically important to all involved parties—data scientists, business stakeholders and senior management alike—that the data science team protect the ugly baby and let it grow during the formative stages of the model development. For example, in a current vision workshop that we are running in the area of improving product quality and testing effectiveness, one of our data scientist grabbed industry production data from the data.gov website and wanted to explore whether there was any correlation between industry growth and the potential demand spikes for our customer’s products (see Figure 2). This analysis doesn’t really tell us anything on first glance. Pretty rudimentary analysis, but it’s a good starting point (yep, the baby is sort of ugly at this point). But then we start to ask additional questions, such as:
In another example, we’re trying to predict line stops, so we are scouring the data looking for any correlations and anomalies that might point to some predictors or lead indicators for line stops (see Figure 3). ![]() Figure 3: Analyzing Line Stop Data This analysis provides a great starting point, but the baby is ugly because there is nothing here that is yet actionable. The analysis begs the next level of questions or inquiry, such as:
Manage Expectations: The Baby Will Become BeautifulOrganizations that may not be familiar with the data science “exploration, develop, test, explore, refine, test, explore…” process should take some level of comfort in the fact that this is not new and that there exists a well-defined process, called the Cross Industry Standard Process (CRISP) for Data Mining. This process defines how data scientists are going to convert the ugly baby into something beautiful (see Figure 4). ![]() Figure 4: Cross Industry Standard Process (CRISP) for Data Mining It takes time, but it works as a natural process based on curiosity (get some results, as some more questions, get some data, and get some more results). Stay Curious My FriendLet me reiterate a couple of key points from the Pixar approach:
It’s critically important that the analytic model is allowed to start off ugly and that it goes through its natural failure and growth pattern. And there will be many, many failures along the discovery path. And that’s good, because acknowledging and dealing with initial failures helps you move that much closer to a successful end result.
“Failure is success in progress” “I have not failed. I’ve just found 10,000 ways that won’t work.” And to quote a third innovator, “Be patient and stay curious, my friend.” |
