Here is our selection of featured articles, technical contributions, and forum questions posted since Monday:
Added by Vincent Granville on January 31, 2019 at 10:00am — No Comments
Learn about CART in this guest post by Jillur Quddus, a lead technical architect, polyglot software engineer and data scientist with over 10 years of hands-on experience in architecting and engineering distributed, scalable, high-performance, and secure solutions used to combat serious organized crime, cybercrime, and fraud.
Although both linear regression models allow and logistic regression models allow us to predict a categorical outcome, both of these models assume…Continue
Added by Packt Publishing on January 31, 2019 at 4:09am — No Comments
The first book is posted on data science…Continue
This article is about Intuitive explanation of Degrees of Freedom and How Degrees of Freedom affects Sudoku.
A lot of aspiring Data Scientists take courses on statistics and get befuddled with the concept of Degrees of Freedom. Some memorize it by rote as ‘n-1'.
But there is a intuitive reason why it is ‘n-1’.…Continue
Artificial Intelligence and the technology behind it are growing at a furious pace. Marketers have realized its vast potential and are striving to extract the technology’s opportunities in full. There are numerous advancements being made in this regard, and many organizations have taken center stage of the AI world with in depth data analysis and data…Continue
Added by Ronald van Loon on January 29, 2019 at 10:22pm — No Comments
Diving into the many underlying trends throughout the entire 1.5 Billion rows of NYC Taxi data with Pivot Billion…
Added by Benjamin Waxer on January 29, 2019 at 7:26am — No Comments
When building applications that ingest a large amount of customer data sets, what is your preferred method of data transfer? Which APIs do you leverage to acquire and transmit?
Added by Fahad Zaidi on January 29, 2019 at 4:19am — No Comments
This article was written by Sondos Atwi.
What is Cross-Validation?
In Machine Learning, Cross-validation is a resampling method used for model evaluation to avoid testing a model on the same dataset on which it was trained. This is a common mistake, especially that a separate…Continue
Added by Andrea Manero-Bastin on January 28, 2019 at 11:30pm — No Comments
Summary: Not enough labeled training data is a huge barrier to getting at the equally large benefits that could be had from deep learning applications. Here are five strategies for getting around the data problem including the latest in One Shot Learning.
I had an interesting discussion with one of my son's friends at a neighborhood gathering over the holidays. He's just reached the halfway point of a Chicago-area Masters in Analytics program and wanted to pick my brain on the state of the discipline.
Of the four major program foci of business, data, computation, and algorithms, he acknowledged…Continue
Added by steve miller on January 28, 2019 at 8:24am — No Comments
Understanding customer transactional behaviour pays well for any business. With the tsunami of start ups in recent times and the immense money flow in businesses, customers find lucrative offers from companies for acquisition, retention & referrals strategies. Understanding transactional behaviour of a customer has become even more complex with the invent of new business houses everyday. Although, with the rise of powerful machines, one can…Continue
Added by PS Dhillon on January 28, 2019 at 4:00am — No Comments
First days after the celebration of the New Year is the time when looking back we can analyze our actions, promises and draw conclusions whether our predictions and expectations came true. As 2018 came to its end, it is perfect time to analyze it and to set trends for the next year. The amount of data generated every minute…Continue
Added by Igor Bobriakov on January 28, 2019 at 2:00am — No Comments
Guest blog by Igor Bobriakov.
First days after celebration of the New Year is the time when looking back we can analyze our actions, promises and draw conclusions whether our predictions and expectations came true. As 2018 came to its end, it is perfect time to analyze it and to set trends for the next year.…Continue
Added by Capri Granville on January 27, 2019 at 9:30am — No Comments
Added by Vincent Granville on January 27, 2019 at 9:00am — No Comments
One can seriously argue about what programming language is the best for data analysis, but there is one universal metric that can define your choice: speed of calculations. Therefore, the word "best" in the title means the languages that lead to most performant applications. If most performant program can also be written in an easy-to-use, easy-to-learn, dynamically-typed…
Added by jwork.ORG on January 26, 2019 at 2:54pm — No Comments
Added by Krishna Pera on January 26, 2019 at 5:45am — No Comments
New home construction plays a significant role in housing economy, while simultaneously impacting other sectors such as timber, furniture and home appliances. New house sales is also an important indicator of country’s overall economic health and direction. In the last 50 years there has been few significant bumps and turning points in this sector that shaped the trajectory of the overall economy. Here I review the…Continue
Added by Mab Alam on January 25, 2019 at 8:15pm — No Comments
Call it a “Forrest Gump moment;” an instance of being in the right place at the right time for no other reason than just plain luck. A “Forrest Gump moment” is based upon Tom Hanks’ character in the movie “Forrest Gump,” a guy who always seemed to be in the right place at the right time meeting Presidents Kennedy, Johnson and Nixon at critical points in American history.
I too have had a Forrest Gump moment in meeting President Reagan, however, my deeper Forrest Gump…Continue
Added by Bill Schmarzo on January 25, 2019 at 1:50pm — No Comments
SIP application server (AS) text logs analysis may help in detection and, in some specific situations, prediction of different types of issues within a VoIP network. SIP server text logs contain the information which is difficult to obtain or even cannot be obtained from other sources, such as CDRs or signaling traffic captures.
The following parameters, among others, can help in estimating…Continue
Added by Ilya Selitser on January 25, 2019 at 12:00am — No Comments
This resource is part of a series on specific topics related to data science: regression, clustering, neural networks, deep learning, decision trees, ensembles, correlation, Python, R, Tensorflow, SVM, data reduction, feature selection, experimental design, cross-validation, model fitting, and many more. To keep receiving these articles, sign up on…Continue
Added by Vincent Granville on January 24, 2019 at 12:30pm — No Comments