Web scraping, also known as web harvesting and web data extraction, basically refers to collecting data from websites via the Hypertext Transfer Protocol (HTTP) or ...
Introduction Due to COVID19, we already have 1000 deaths in Italy. Two months ago, these folks, many of them elderly, would have been celebrating Christmas and looking fo...
Much like we have Chemical Engineering and Electrical Engineering and Mechanical Engineering, it is time to formalize of field of Data Engineering. This is a special tw...
As I write this blog, we are still in the early stages of the coronavirus crisis. It is a scary situation which has caused hoarding, panic, fake news, lies, stock marke...
1. Introduction Pandas First, Pandas is an open source Python library for data analysis. It contains data manipulation and data structures tools designed to make spreadsh...
We are very happy to announce that DSC has found a new home: Tech Target (Nasdaq: TTGT), the global leader in purchase intent-driven marketing and sales services, has a 2...
A few years ago, in a Q&A session following a presentation I gave on data analysis (DA) to a group of college recruits for my then consulting company, I was asked to ...
An unpublished experimental report may have an interesting new way to understand the unexpected result of the Michelson-Morley Experiment of 1887. The new understanding...
Introduction Kubernetes is being described as the next ‘Java’ i.e. it is fast becoming an endemic/ underlying platform for the whole industry just like the Java progr...
Edge computing is a hot topic and carries with it some confusion, particularly around storage. Handling data properly at the edge can ensure a scalable, cost-effective an...