In a previous post, I have provided a discussion of model stacking, a popular approach in data science competitions for boosting predictive performance. Since then, the post has attracted some attention, so I have decided to put together a Python package which provides a simple API to stack models with minimal effort.
In this post, I will present the …Continue
Added by Burak Himmetoglu on January 14, 2019 at 10:42pm — No Comments
Feature selection and engineering are the most important factors which affect the success of predictive modeling. This remains true even today despite the success of deep learning, which comes with automatic feature engineering. Parsimonious and interpretable models provide simple insights into business problems and therefore they are deemed very valuable. Furthermore, in many occasions the underlying size and structure of the data being analyzed may not allow the use…Continue
Added by Burak Himmetoglu on September 19, 2018 at 9:58pm — No Comments
Time-series data arise in many fields including finance, signal processing, speech recognition and medicine. A standard approach to time-series problems usually requires manual engineering of features which can then be fed into a machine learning algorithm. Engineering of features generally requires some domain knowledge of the discipline where the data has originated from. For example, if one is dealing with signals (i.e. classification of EEG signals), then possible features would involve…Continue
In this blog post, I will discuss feature engineering using the Tidyverse collection of libraries. Feature engineering is crucial for a variety of reasons, and it requires some care to produce any useful outcome. In this post, I will consider a dataset that contains description of crimes in San Francisco between…Continue
Added by Burak Himmetoglu on April 10, 2017 at 7:30am — No Comments
There are many great tutorials on neural networks that one can find online nowadays. Simply searching for the words “Neural Network” will produce numerous results on GithubGist. Even tough there are many examples floating around on the web, I decided to have my own Introduction to Neural Networks!
In my tutorial, I specifically tried to illustrate the use of Python classes to define layers in the network as objects. Each layer object has forward and backward propagation methods which…Continue
Recently, I have been working on the Neural Networks for Machine Learning course offered by Coursera and taught by Geoffrey Hinton. Overall, it is a nice course and provides an introduction to some of the modern topics in deep learning. However, there are instances where the student has to do lots of extra work in order to understand the topics covered in full detail.
One of the assignments in…Continue
Added by Burak Himmetoglu on December 17, 2016 at 10:00am — No Comments
This blog was originally published on my website.
If you have ever competed in a Kaggle competition, you are probably familiar with the use of combining different predictive models for improved accuracy which will creep your score up in the leader board. While it is widely used, there are only a few resources that I am aware of where a clear description is available (One that I know of is …Continue
Added by Burak Himmetoglu on December 1, 2016 at 6:00pm — No Comments