Based on requests from clients - vendors of data processing platforms and products - as well as trends in popular blogs,  job postings, and my own reading. Here are a few topics recently gaining strong traction (items beyond #13 were recently added)::

  1. The rise of data plumbing, to make big data run smoothly, safely, reliably, and fast through all "data pipes" (Internet, Intranet, in-memory, local servers, cloud, Hadoop clusters etc.), optimizing redundancy, load balance, data caching, data storage, data compression, signal extraction, data summarization and more. We bought the domain name DataPlumbing.com last week.
  2. The rise of the data plumber, system architect, and system analyst (a new breed of engineers and data scientists), a direct result of the rise of data plumbing
  3. Use of data science in unusual fields such as astrophysics, and the other way around (data science integrating techniques from these fields)
  4. The death of the fake data scientist
  5. The rise of the right-sized data (as oppose to big data). Other keywords related to this trend is "light analytics", big data diet", "data outsourcing", the re-birth of "small data". Not that big data is going away, it is indeed getting bigger every second, but many businesses are trying to leverage an increasingly smaller portion of it, rather than being lost in a (costly) ocean of unexploited data.
  6. Putting more intelligence (sometimes called AI or deep learning) into rudimentary big data applications (currently lacking any true statistical science) such as recommendation engines, crowdsourcing or collaborative filtering. Purpose: detecting and eliminating spam, fake profiles, fake traffic, propaganda, attacks, scams, bad recommendations and other abuses, as early as possible.
  7. Increased awareness of data security and protection, against computer or business hackers.
  8. The rise of mobile data exploitation. For instance processing billions of text messages to detect the spread of a disease or other global risks, to help design alarm systems or market the right product in real-time (via opt-in, user-customized text messages) to a walking customer in a shopping mall. Not sure that even the NSA is capable of doing it as of today. The issue is more about capturing and reacting to the right signal, rather than absorbing/digesting big data. Another trend is optimization of revenue from mobile apps, leveraging mobile app dashboards.
  9. The rise of the "automated statistician", in short, automated, scalable, robust analytic solutions fit for batch processing, real-time, machine-to-machine communications, and black-box analytics used by non-experts. More on this in our upcoming book, entitled data science 2.0.
  10. Predictive modeling without models. Operations research and mathematicians contributing to the science of predicting, bringing mathematical optimization and simulation as an alternative to delicate and mysterious statistical models.
  11. High performance computing (HPC) which could revolutionize the way algorithms are designed.
  12. Increased collaboration between government agencies worldwide to standardize data and share it, for intelligence purposes. Imagine the census bureau sharing data with the IRS. Or banks in US sharing data with security agencies in Switzerland.
  13. Forecasting space weather (best time / best location lo land on Mars), and natural events on Earth (volcanoes, Earthquakes, undersea weather patterns and implications to humans, when will Earth's magnetic field flip). 
  14. Use of data science for automated content generation (including content aggregation and classification); for automated correction of student essays; data science used in court to strengthen the level of evidence - or lack of - against a defendant;  for plagiarism detection; for car traffic optimization and to compute optimum routes; for identifying, selecting and keeping ideal employees; for automated IRS audits sent to taxpayers to avoid costly litigation and time wasting; for urban planning; for precision agriculture
  15. Measuring yield of big data or data science initiatives (that is, benefit after software and HR costs, over baseline)
  16. Digital health: diagnostic/treatment offered by a robot (artificial intelligence, decision trees) and/or remote doctors; digital law: same thing, with attorneys replaced by robots, at least for mundane cases or tasks. Even lawyers and doctors could have their jobs replaced by robots! This assumes that a lot of medical or legal data gets centralized, processed and made well structured for easy querying, updating and retrieval by (automated) deep learning systems.
  17. Analytic processes (even in batch mode) accessible from your browser anywhere on any device. Growth of analytics apps and APIs.

DSC Resources

Additional Reading

Follow us on Twitter: @DataScienceCtrl | @AnalyticBridge

Views: 43578

Tags: predictive modeling


You need to be a member of Data Science Central to add comments!

Join Data Science Central

Comment by Andy Vidan on June 30, 2020 at 7:31am

Just came across this article again....amazing how even as the field matured in many ways, the listed points are all still spot on. Most efforts we're seeing is around pushing data science (data-driven products) into production .... and practitioners are now consolidating around DataOps principles.

Composable.ai has been focused on building an end-to-end composable dataops platform to provide the required capabilities.

Comment by Chintan Donda on November 9, 2015 at 11:38pm

There must be something like DATA INSURANCE to cover the Data Security & Data Lifespan. 

Comment by Lars Fiedler on November 21, 2014 at 4:03pm

Dataflow is huge right now.  I'd throw in dataflowanalytics.com into the mix.

Comment by Big Data Queen on November 19, 2014 at 10:11am
Vincent, Big Data is surely a big deal. We definitely are seeing an increase in activity with companies responding to the impact big data has made on their business. For companies any size, getting meaningful insights from data analytics is an important priority. LexisNexis has open sourced its HPCC Systems big data platform which represents more than a decade of internal research and development in the big data analytics field. Designed by data scientists, their built-in libraries for Machine Learning and BI integration provide a complete integrated solution from data ingestion and data processing to data delivery. More at http://hpccsystems.com

© 2021   TechTarget, Inc.   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service