Data Scientists 4.0 The 4th Industrial Revolution was publicly announced in 2011 at the Hannover Fair (1). Since then, many resources have been appeared around the so cal...
I have completed the following through courses at coursera: R-Programming, Getting and cleaning data – John Hopkins University Introduction to SQL – University of Mic...
Spark is a powerful tool which can be applied to solve many interesting problems. Some of them have been discussed in our previous posts. Today we will consider another i...
The vast possibilities of artificial intelligence are of increasing interest in the field of modern information technologies. One of its most promising and evolving direc...
Spark SQL is a part of Apache Spark big data framework designed for processing structured and semi-structured data. It provides a DataFrame API that simplifies and accele...
Spark’s primary abstraction is a distributed collection of items called a Resilient Distributed Dataset (RDD). It is a fault-tolerant collection of elements which allow...
My nephew’s a very impressive young man. Five years ago, he received a PhD in Biochemistry/Molecular Biology from a prestigious university, earning numerous teachin...
2018 is set to be the year data finally delivers for both businesses and consumers. Alex Comyn, chief strategy officer at Amaze, explores 8 key trends that are set to imp...
Critically reading scientific papers is critical for Data Scientists working some areas – especially those working in health. With that in mind, here are some key c...
Clear up the confusion of how all-encompassing terms like artificial intelligence, machine learning, and deep learning differ. Machine learning and artificial intellig...