“What if we add these variables?..” is a deadly type of a question that can ruin your analytic project. Now, while curiosity is the best friend of a data scientist, there’s a curse that comes with it – some call it analysis paralysis, others – just over-analysis, but I call these situations “analytic rabbit holes”. As you start any data science project – be it an in-depth statistical research, machine learning model, or a simple business analysis – there…Continue
Added by Karolis Urbonas on March 28, 2017 at 2:30am — No Comments
Analytics needs a evangelist! Without such a person, you just don’t get the impact that Analytics actually is capable of providing! Mostly this evangelist needs to be right at the top, the CEO! I have worked with a range of industries & everywhere the degree of impact shoots up once you have a CXO who is evangelising this change.
Of course, some CMOs have led their organizations into embracing the practice, including John Costello, former exec VP-CMO of Home…Continue
Added by Ajay Kelkar on March 27, 2017 at 8:30pm — No Comments
This article was posted on Data Flair. Below is a quick overview of the original article.
This tutorial provides introduction to Apache Spark, what are its ecosystem components, Spark abstraction – RDD, transformation and action. The objective of this introductory guide is to provide detailed overview of…Continue
Added by Emmanuelle Rieuf on March 27, 2017 at 4:00pm — No Comments
Is big data just for the big players? Many SME owners shy away from taking advantage of big data in their business operations. However, commentators (including experts at Forbes) have been challenging the presumptions behind this practice. Big data is indeed useful for smaller businesses, they argue. All that is needed is a concrete strategy for how best to deploy big data in the service of smaller…Continue
Added by James Pointon on March 26, 2017 at 11:00pm — No Comments
In order to prevent my programs from freezing up while running long calculations, I generally run the calculations on separate threads. In Java, this process can be accomplished by separating the GUI from processing. In the code below, a thread for an instance of MyProcessing would be invoked using start(): e.g. “(new MyProcessing()).start();” would run indefinitely until T is made null. T can be made null by calling stop() or by directly making T null. Often when the GUI is closing, I…Continue
Added by Don Philip Faithful on March 25, 2017 at 9:42am — No Comments
Added by Sigrid Keydana on March 25, 2017 at 7:52am — No Comments
Monday newsletter published by Data Science Central. Previous editions can be found here. The contribution flagged with a + is our selection for the picture of the week.
Featured Resources and Technical ContributionsContinue
Added by Vincent Granville on March 25, 2017 at 7:30am — No Comments
These videos display data sets with a time dimension: each frame in the video shows the data set at a given time, thus showing how the data evolved over time. To learn about how these videos were produced, click here and also here and…Continue
Fintech industries have been catching up quite well in blending advance analytics with RPA (Robotic Process Automation) implementation in its business process in the journey of digital transformation. Human time is too expensive to be wasted in carrying out mundane and repetitive tasks. Data and advance Analytics are proving to be a huge differentiator in most of the businesses primarily in BFSI Sector. Business decisions…Continue
Added by Niraj Kumar on March 24, 2017 at 11:30pm — No Comments
This guest blog is contributed by Evans Walsh
For the remaining part of 2017, companies will have to come to terms with the new reality in town; it is not all about what you know. Instead, it is what you do with it. Various gurus have already said that 2017 is the year when Google Analytics and big data will go main stream. It is the year when analytics will not only reflect performances but also drive major businesses.
For companies that might be falling behind, this…Continue
Added by Shay Pal on March 24, 2017 at 6:30pm — No Comments
This article was written by Matthew Rubashkin. With a background in optical physics and biomedical research, Matthew has a broad range of experiences in software development, database engineering, and data analytics.
At SVDS, our R&D team has been investigating different deep learning technologies, from recognizing images of trains to speech recognition. We needed to build a pipeline for ingesting…Continue
Added by Emmanuelle Rieuf on March 24, 2017 at 12:30pm — No Comments
There was a recent publication of a story lamenting the shortage of Indian talent in Artificial Intelligence (AI) and related fields. While the article largely focused on the challenges tech startups face while recruiting AI talent, it’s clear from the conversations we have been having with established enterprises that the supply-demand imbalance for AI talent is as acute across company sizes and industries.
What’s the big problem?
As the article…Continue
Added by Rishabh Kaul on March 24, 2017 at 3:00am — No Comments
Last Sunday at Trivadis Tech Event, I talked about R for Hackers. It was the first session slot on Sunday morning, it was a crazy, nerdy topic, and yet there were, like, 30 people attending! An emphatic thank you to everyone who came!
R a crazy, nerdy topic, - why that, you'll be asking? What's so nerdy about using R?
Well, it was about R. But it was neither an introduction ("how to get things done quickly with R"), nor was it even about data science. True, you…
Added by Sigrid Keydana on March 24, 2017 at 2:30am — No Comments
Here is our selection of featured articles and resources posted since Monday:
Added by Vincent Granville on March 23, 2017 at 8:42am — No Comments
Recently, an HR expert told me that to progress in my career I’d need to spend at least 50% of my time managing laterally. Laterally as in not “up” to my boss or “down” to the team I lead.
My immediate instinct was to discard this flat out: after all, how would someone even measure this? Designing an experiment so large as to statistically prove the 50% is extremely tough. There are hundreds of factors contributing to every…Continue
Added by Catalin Ciobanu on March 23, 2017 at 5:30am — No Comments
An exploratory study must always be designed and executed in order to answer a number of a-priory questions. Our experience in dozens of scientific projects has allowed us identify the 5 mostly shared kinds of questions that…Continue
Added by Ray G. Butler on March 23, 2017 at 4:30am — No Comments
Machine learning is a very hot topic for many key reasons, and because it provides the ability to automatically obtain deep insights, recognize unknown patterns, and create high performing predictive models from data, all without requiring explicit programming…Continue
Added by Emmanuelle Rieuf on March 22, 2017 at 3:00pm — No Comments
From the previous post on “Poor Data Management Practices“, the discussion ended with a high level approach to one possible solution for data silos. Traditional approaches for solving the data silo problem can cost millions of dollars (even for a moderately sized company), and typically requires a huge effort in integration work (e.g., data modeling, system…Continue
Added by Randall Shane on March 22, 2017 at 1:30pm — No Comments
From my experience, English speakers can find the most jobs in the U.S. (West Coast, obviously), United Kingdom (London), Ireland, Netherlands (Amsterdam), Switzerland, and Belgium. New Zealand and Australia are pretty popular among developers who love the laid-back lifestyle.
But the scenarios change when we talk about non-English speaking nations. Japan is growing exponentially; Russia and China have a huge culture of programming, and IT companies are growing rapidly in these…Continue
Added by ARPIT MISHRA on March 22, 2017 at 1:00am — No Comments
This is part of a new series of articles: once or twice a month, we post previous articles that were very popular when first published. These articles are at least 6 month old but no more than 12 month old. The previous digest in this series was posted here a while back.
20 Great Blogs Posted in the last 12…Continue
Added by Vincent Granville on March 21, 2017 at 3:48pm — No Comments