I was trying to find some good domain name for our upcoming business science website, when something suddenly became clear to me. Many of us have been confused for a long time about what data science means, how it is different from statistics, machine learning, data mining, or operations research, and the rise of the data scientist light - a new species of coders who call themselves data scientist after a few hours of Python/R training, working on a small project at best, and spending $200 for their training. The data scientist light is not a real one, even though I believe that you can learn data science from scratch on the job, just as I did.
This introduction brings me to the ABCD's, and the arguments are further developed in my conclusion below. These four domains are certainly overlapping. But I believe that identifying them brings more clarity about roles differentiation and collaboration.
I finally decided to call myself business scientist, as my experience is more and more aligned with this domain (being an entrepreneur), though, like many of us here, I have significant knowledge and expertise in all four domains, especially in data science and analytics science. My motivation to call myself a business scientist is also partly to not be confused with a data scientist light. This erroneous statement is sometimes brought against us (real) data scientists, by a minority of vocal analytics scientists. I believe that we need to dispel this myth. Part of the reason, I believe, is because math-free solutions that in addition, trade accuracy for robustness (in order to fit in black-box systems or be usable by the layman) are not respected by some traditional statisticians, who erroneously believe that automation and/or removing statistical jargon and mathematical background, is not possible. Maybe because it could jeopardize their jobs?
In the end, I want to make data science accessible to everyone, not to an elite of initiated, change-adverse professionals. It requires a new, unified, simple, efficient, math-free or math-light (but not data science light) approach to analytics problems and solutions, as well as algorithmic ingeniosity. This is feasible, but more difficult than producing extremely complicated statistical models - which is what I was doing earlier in my career.
DSC Resources
Additional Reading
Follow us on Twitter: @DataScienceCtrl | @AnalyticBridge
Comment
The definitions appear somewhat ad hoc. I agree that Data Science is new, and we are still competing for good definitions for it. I've sketched my own outline for such a definition in a blog on this site:
http://www.datasciencecentral.com/profiles/blogs/control-the-uncle-...
But I'm more concerned that the definitions here just generally miss the mark. For instance, Computer Science has been around for more than a half century, and we know pretty much what it is. I know a bit of Computer Science having earned advanced degrees in the subject and studying and doing it for several decades. It is not really the stuff discussed in this post, which is largely systems integration. It's like saying fine arts is about mixing paint. Computer Science is the study of complexity: the complexity of problems, solutions, and systems. That idea isn't new. Harold Abelson and Gerald Sussman articulated it in the 1980s. And it's a good description of Computer Science because it allows us to reason about what things we should or shouldn't do with Computer Science. It's also consistent with scientific method.
Data Science, though newer, appears to be quite different from what is described in this post. Let us assume that the people who consider themselves data scientists--myself included--do not wish to abuse the term "science". And speaking for myself, Data Science is the extraction of meaning from observations. In order to do Data Science, we have to verify that the meanings extracted are consistent with the real world. This leads to data driven refinement processes that are analogous to the processes carried out by other scientific instruments.
The definitions in this post also leave out the extremely important field of Simulation Science, which mirrors Data Science, except from a model-driven vice a data-driven perspective. Where Data Science refines what we observe about the world, Simulation Science refines what we believe about the world. Data Science and Simulation Science are like the left hand and the right hand of the same being, and it's really difficult to do either without the other. One will often dominate, but they go hand-in-hand. And this actually gets to Irv Lustig's question about optimization, because penultimate step in both the Data Science and Simulation Science processes is based on optimization. Cutting to the chase, these refinement processes take stuff we find in the world (observations or beliefs) and refine them into actionable intelligence about the world, the next step is to turn actionable intelligence into actions, and that is a control problem which relies on optimization.
I'm curious why you titled this post "The ABCD's of Business Optimization". I.e., why use the word "optimization"?
© 2021 TechTarget, Inc.
Powered by
Badges | Report an Issue | Privacy Policy | Terms of Service
Most Popular Content on DSC
To not miss this type of content in the future, subscribe to our newsletter.
Other popular resources
Archives: 2008-2014 | 2015-2016 | 2017-2019 | Book 1 | Book 2 | More
Most popular articles
You need to be a member of Data Science Central to add comments!
Join Data Science Central