publicdomainpictures.net Part I of this series described the data feudalism we currently have, why we have it, and how it might be possible for at least some of us to esc...
Picture Courtesy: Freepik The article explains the algorithm behind the recently introduced Python package named PyHard, based on the concept of Instance Space Analysis. ...
Data quality is critical in any web scraping or data integration project. Data-driven businesses rely on customer data, it helps their products, provides valuable insight...
In my prior blog “Reframing Data Management: Data Management 2.0″, I talked about the importance of transforming data management into a business strategy that...
Note: Many thanks to those who provided comments on my previous blog “Features Part 1: Are Features the new Data?”. And special thanks to Somil Gupta and Ha...
When a business enters the domain of data management, it is easy to get lost in a flurry of promises, brochures, demos and the promise of the future. In the first article...
Source: see last section in this article Of course this question begs for the answer that it is both an art and a science. I view it more like craftsmanship. This article...
Data engirds the entire world. Data is evolving just like any other thing on this globe. Being a part of this tech-oriented world, today we human beings create as much in...
A common challenge for teams working on video machine learning applications is how to scale and automate their ML lifecycle when working with these types of large unstruc...
Logical data warehouse’, ‘data fabric’, and ‘data mesh’ are just a few of the names for new modern data architecture paradigms that are being promoted as the wa...