Subscribe to DSC Newsletter
Michael Malak
  • Male
  • Denver, CO
  • United States
Share on Facebook
Share

Michael Malak's Friends

  • Sam Johnson

Gifts Received

Gift

Michael Malak has not received any gifts yet

Give a Gift

 

Michael Malak's Page

Latest Activity

Alfred liked Michael Malak's blog post Choropleth in D3.js and Pandas (iPython Notebook)
Apr 28, 2016
Michael Malak replied to L.V.'s discussion The scariest use of machine learning
"In terms of killing thousands by flying machines based on machine learning, this is already happening. http://arstechnica.co.uk/security/2016/02/the-nsas-skynet-program-may-be-killing-thousands-of-innocent-people/ In this case, your nightmare…"
Mar 28, 2016
Michael Malak commented on Zygimantas Jacikevicius's blog post Top 5 graph visualisation tools
"I believe you have confused GraphX for .NET http://www.panthernet.ru/en/projects-en/graphx-en with Apache Spark GraphX http://spark.apache.org/graphx/ From what I can tell, the two projects have nothing to do with each other, other…"
Feb 10, 2016
Gürol Canbek liked Michael Malak's blog post Free NAS eBook: "Frontiers in Massive Data Analysis"
Jan 18, 2016
Michael Malak commented on Vincent Granville's blog post How to compete against data scientists charging $30/hour
"Vincent: you inspired my blog post today: The End of Data Science As We Know It"
Mar 6, 2014
Michael Malak posted a blog post

5 Data Science Sins To Beware

InformationWeek has an interview this week with resident Data Science Central blogger Michael Walker about the most common traps awaiting data scientists:http://www.informationweek.com/big-data/news/big-data-analytics/5-data-science-sins-to-beware/240162426See More
Oct 10, 2013
Michael Malak's blog post was featured

Free NAS eBook: "Frontiers in Massive Data Analysis"

A new 191-page PDF eBook published by the National Academies of Sciences Press is available, "Frontiers in Massive Data Analysis," and can be downloaded for free (after free website registration):http://www.nap.edu/catalog.php?record_id=18374The first 9 of the 10 chapters offer a comprehensive survey of state-of-the-art big data architectures, machine learning, and analysis techniques.Chapter 10 really shines as it offers a new…See More
Sep 23, 2013
Michael Malak posted a blog post

Free NAS eBook: "Frontiers in Massive Data Analysis"

A new 191-page PDF eBook published by the National Academies of Sciences Press is available, "Frontiers in Massive Data Analysis," and can be downloaded for free (after free website registration):http://www.nap.edu/catalog.php?record_id=18374The first 9 of the 10 chapters offer a comprehensive survey of state-of-the-art big data architectures, machine learning, and analysis techniques.Chapter 10 really shines as it offers a new…See More
Sep 23, 2013
Michael Malak liked Stephen Penn, DM, PMP's blog post How to Avoid Political Blunders in Analytical Discussions
Aug 6, 2013
Michael Malak's blog post was featured

Choropleth in D3.js and Pandas (iPython Notebook)

There have been various attempts to integrate the D3.js visualization framework into iPython Notebook, in order to provide more visualization options than available with the standard Matplotlib. In my blog post today, I take one of the better integration attempts out there, port it from Windows to the Mac, and demonstrate:1. Passing a Pandas Dataframe from iPython Notebook into the D3.js Javascript2. Generating geo color maps in D3.js (not a built-in capability) by pulling geo shape data from…See More
Jul 30, 2013
Michael Malak posted a blog post

Choropleth in D3.js and Pandas (iPython Notebook)

There have been various attempts to integrate the D3.js visualization framework into iPython Notebook, in order to provide more visualization options than available with the standard Matplotlib. In my blog post today, I take one of the better integration attempts out there, port it from Windows to the Mac, and demonstrate:1. Passing a Pandas Dataframe from iPython Notebook into the D3.js Javascript2. Generating geo color maps in D3.js (not a built-in capability) by pulling geo shape data from…See More
Jul 29, 2013
Michael Malak commented on Michael Walker's blog post Data Science Summer Reading List 2013
"Doing Data Science won't be available until Sep. 22, which is after the fall equinox :-) Delivering Business Analytics: Practical Guidelines for Best Practice by Evan Stubbs, published just this past February, is a good book on managing a data…"
Jul 11, 2013
Michael Malak posted a blog post

SparkGrams: compact in-spreadsheet histograms

My new blog post on what I coined as "sparkgrams".  Included is an implementation in YUI3 for custom website presentations of data, but I wish R and iPython Notebook had similar functionality.http://technicaltidbit.blogspot.com/2013/06/histogram-thumbnails-inside-yui3-data.htmlSee More
Jun 18, 2013
Michael Malak posted a blog post

Spark Streaming: Real-time Hadoop

Spark and Spark Streaming are two components of the "Berkeley Data Analytics Stack" (BDAS).  Spark Streaming is one of the few open source options available for "Real-time Big Data".  See my slides and 35-minute presentation from last night, which was part of Global Big Data Week: http://technicaltidbit.blogspot.com/2013/04/presentation-on-spark.htmlSee More
Apr 24, 2013
Michael Malak liked Michael Malak's blog post Automatically deskew before machine learning in R
Mar 21, 2013
Joel Pulliam commented on Michael Malak's blog post Automatically deskew before machine learning in R
"Well done!"
Mar 18, 2013

Profile Information

Short Bio
Spark developer for two Fortune 200 firms since 2013Q1. Author of upcoming book Spark GraphX In Action. VP Data Science Association.
My Web Site Or LinkedIn Profile
http://technicaltidbit.com
Professional Status
Technical
Your Company:
Oracle
Interests:
Networking
What is your Favorite Data Mining or Analytical Website?
http://datascienceassn.org

Michael Malak's Blog

5 Data Science Sins To Beware

Posted on October 10, 2013 at 8:26am 0 Comments

InformationWeek has an interview this week with resident Data Science Central blogger Michael Walker about the most common traps awaiting data scientists:

http://www.informationweek.com/big-data/news/big-data-analytics/5-data-science-sins-to-beware/240162426

Free NAS eBook: "Frontiers in Massive Data Analysis"

Posted on September 23, 2013 at 9:35am 0 Comments

A new 191-page PDF eBook published by the National Academies of Sciences Press is available, "Frontiers in Massive Data Analysis," and can be downloaded for free (after free website registration):

http://www.nap.edu/catalog.php?record_id=18374

The first 9 of the 10 chapters offer a comprehensive survey of state-of-the-art big data architectures, machine learning, and analysis techniques.

Chapter 10 really…

Continue

Choropleth in D3.js and Pandas (iPython Notebook)

Posted on July 29, 2013 at 4:23am 0 Comments

There have been various attempts to integrate the D3.js visualization framework into iPython Notebook, in order to provide more visualization options than available with the standard Matplotlib. In my blog post today, I take one of the better integration attempts out there, port it from Windows to the Mac, and demonstrate:

1. Passing a Pandas Dataframe from iPython Notebook into the D3.js Javascript

2. Generating geo color maps in D3.js (not a built-in…

Continue

SparkGrams: compact in-spreadsheet histograms

Posted on June 18, 2013 at 5:17am 0 Comments

My new blog post on what I coined as "sparkgrams".  Included is an implementation in YUI3 for custom website presentations of data, but I wish R and iPython Notebook had similar functionality.

http://technicaltidbit.blogspot.com/2013/06/histogram-thumbnails-inside-yui3-data.html

Comment Wall

You need to be a member of Data Science Central to add comments!

Join Data Science Central

  • No comments yet!
 
 
 

Follow Us

Videos

  • Add Videos
  • View All

Resources

© 2018   Data Science Central™   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service