[Cheat Sheet] Python Basics For Data Science
The use of Python as a data science tool has been on the rise over the past few years: 54% of the respondents of the latest O’Reilly… Read More »[Cheat Sheet] Python Basics For Data Science
The use of Python as a data science tool has been on the rise over the past few years: 54% of the respondents of the latest O’Reilly… Read More »[Cheat Sheet] Python Basics For Data Science
This is part of a new series of articles: once or twice a month, we post previous articles that were very popular when first published.… Read More »22 Great Blogs Posted in the last 12 Months
Summary: Convolutional Neural Nets are getting all the press but it’s Recurrent Neural Nets that are the real workhorse of this generation of AI. We’ve… Read More »Recurrent Neural Nets – The Third and Least Appreciated Leg of the AI Stool
(This post originally appeared on recurrentnull.wordpress.com, as first part in a series on sentiment analysis of movie reviews.) Imagine I show you a book review,… Read More »Sentiment Analysis of Movie Reviews (1):Bag-of-Words Models
Recently, in a previous post, we reviewed a path to leverage legacy Excel data and import CSV files thru MySQL into Spark 2.0.1. This may apply… Read More »Migrating an Excel Spreadsheet Directly to HDFS and Spark 2.0.1 (Part 2)
Data transfer – what it is? Whilst you are online, everything is about transfer of data – thus, emails and web pages are basically a… Read More »Home Internet Data Usage – FAQS
This article was written by Ariful Mondal. Artful is a senior manager, data science and big data analytics consultant at Tata Consultancy Services. 1. Introduction This is… Read More »Classifications in R: Response Modeling/Credit Scoring/Credit Rating using Machine Learning Techniques
A theme in my blogs is how the “structure” of data – rather than just the “content” – affects what that data can say and… Read More »Structural Accommodation
This post is a brief review of leading Data Integration tools in the market. Heavily referencing from the Gartner 2016 report and peer reviews from… Read More »Data Integration Tools – Market Study
Contributed by Rob Castellano. He is currently in the NYC Data Science Academy 12 week full time Data Science Bootcamp program taking place between April 11th to July 1st, 2016. This… Read More »Cultural Institutions of New York City: Data, Analysis, R Code, Visu