Subscribe to DSC Newsletter

Burak Himmetoglu's Blog (7)

Pancake: A Python package for model stacking

In a previous post, I have provided a discussion of model stacking, a popular approach in data science competitions for boosting predictive performance. Since then, the post has attracted some attention, so I have decided to put together a Python package which provides a simple API to stack models with minimal effort.

In this post, I will present the …

Continue

Added by Burak Himmetoglu on January 14, 2019 at 10:42pm — No Comments

An overview of feature selection strategies

Introduction

Feature selection and engineering are the most important factors which affect the success of predictive modeling. This remains true even today despite the success of deep learning, which comes with automatic feature engineering. Parsimonious and interpretable models provide simple insights into business problems and therefore they are deemed very valuable. Furthermore, in many occasions the underlying size and structure of the data being analyzed may not allow the use…

Continue

Added by Burak Himmetoglu on September 19, 2018 at 9:58pm — No Comments

Time series classification with Tensorflow

Time-series data arise in many fields including finance, signal processing, speech recognition and medicine. A standard approach to time-series problems usually requires manual engineering of features which can then be fed into a machine learning algorithm. Engineering of features generally requires some  domain knowledge of the discipline where the data has originated from. For example, if one is dealing with signals (i.e. classification of EEG signals), then possible features would involve…

Continue

Added by Burak Himmetoglu on August 22, 2017 at 7:00am — 6 Comments

Feature Engineering with Tidyverse

In this blog post, I will discuss feature engineering using the Tidyverse collection of libraries. Feature engineering is crucial for a variety of reasons, and it requires some care to produce any useful outcome. In this post, I will consider a dataset that…

Continue

Added by Burak Himmetoglu on April 10, 2017 at 7:30am — No Comments

Yet another introduction to Neural Networks

There are many great tutorials on neural networks that one can find online nowadays. Simply searching for the words “Neural Network” will produce numerous results on GithubGist. Even tough there are many examples floating around on the web, I decided to have my own Introduction to Neural Networks!

In my tutorial, I specifically  tried to illustrate the use of Python classes to define layers in the network as objects. Each layer object has forward and backward propagation methods which…

Continue

Added by Burak Himmetoglu on February 7, 2017 at 2:30pm — 3 Comments

Deciphering the Neural Language Model

Recently, I have been working on the Neural Networks for Machine Learning course offered by Coursera and taught by Geoffrey Hinton. Overall, it is a nice course and provides an introduction to some of the modern topics in deep learning. However, there are instances where the student has to do lots of extra work in order to understand the topics covered in full detail.

One of the assignments in…

Continue

Added by Burak Himmetoglu on December 17, 2016 at 10:00am — No Comments

Stacking models for improved predictions: A case study for housing prices

This blog was originally published on my website.

If you have ever competed in a Kaggle competition, you are probably familiar with the use of combining different predictive models for improved accuracy which will creep your score up in the leader board. While it is widely used, there are only a few resources that I am aware of where a clear description is available (One that I know of is …

Continue

Added by Burak Himmetoglu on December 1, 2016 at 6:00pm — No Comments

Videos

  • Add Videos
  • View All

© 2019   Data Science Central ®   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service