Data Analytics favorite Apache Spark, is progressing as a reference standard for Big Data, and a “fast and general engine for large-scale data processing”. In our previous post, we detailed how to expand ML tools using a PySpark kernel and leverage the …Continue
Added by Marc Borowczak on June 9, 2016 at 10:30am — No Comments
This article on a complete tutorial to learn Data Science with Pyhon from scratch, was posted by Kunal Jain. Kunal is a post graduate from IIT Bombay in Aerospace Engineering. He has spent more than 8 years in field of Data Science. He learned basics of Python within a week. And, since then, he has not only explored this language to the depth, but also has helped many other to learn this language.
Python was originally a general purpose language. But, over the years, with strong…
A rather comprehensive list of algorithms can be found here. Many are posted and available for free on Github or Stackexchange. Algoritmia provides developers with over 800 algorithms, though you have to pay a fee to access them.…Continue
Added by Emmanuelle Rieuf on June 9, 2016 at 9:30am — No Comments
What are opioids?
Opioids are medications used to treat moderate to severe pain that may not respond well to other pain medications. They reduce the sending of pain messages to the brain and reduce feelings of pain. They include codeine, fentanyl, hydrocodone, methadone, morphine and oxycodone. Hydrocodone medications are the most commonly prescribed for conditions such as dental and injury-related pain. Morphine is used before and after surgery for severe…Continue
Added by Salil Sheth on June 9, 2016 at 9:07am — No Comments
We are pleased to announce a new free e-book from Manning Publications: Exploring Data Science. Exploring Data Science is a collection of five chapters hand picked by John Mount and Nina Zumel, introducing you to various areas in data science and explaining which methodologies work best for each.…Continue
Added by John Mount on June 9, 2016 at 9:05am — No Comments
This blog was originally published on the AYLIEN Text Analysis blog.
We wanted to gather and analyze news content in order to look for similarities and differences in the way two journalists write headlines for their respective news articles and blog posts. The two reporters we selected operate in, and write about, two very different industries/topics and have two very different writing…Continue
This video was built as a result of our internal hackathon using Teradata Listener to absorb real time small messages from Transformers and other devices on the Power Grid in Southern California. The video demonstrates a real time predictive analytic showcasing proactive repairs of the power grid to reduce costs and avoid disruptions of power service.…
Added by John Thuma on June 9, 2016 at 1:00am — No Comments
This book was written for those who need to know how to collect, analyze and present data. It is meant to be a first course for practitioners, a book for private study or brush-up on statistics, and supplementary reading for general statistics classes. The book is untraditional, both with respect to the choice of topics and the presentation: Topics were determined by what is most useful for practical statistical work, and the presentation is as non-mathematical as possible. The book contains…Continue
Added by Birger S. Madsen on June 8, 2016 at 10:32pm — No Comments
I teach a masters program on Smart cities (Big Data/Data Science/IoT analytics) at Citysciences (part of the University of Madrid)
I am pleased to announce the AI / Deep Learning Lab for Future Cities - at University of…Continue
This article on "CERN just released 300 terabytes worth of data to the public", was posted by Alfredo Carpineti from IFLScience. Alfredo is an Italian Scientist and Science Communicator with a Ph.D. in Astrophysics and a Master of “Quantum Fields And Fundamental Forces”.
If you’ve ever dreamed of working on the largest experiment in the world, you can now make that dream a reality from the comfort of your own home. CERN has just released more than 300 terabytes (TB) of high-quality…Continue
Added by Emmanuelle Rieuf on June 8, 2016 at 10:00am — No Comments
This is a collection of 10 great GitHub repositories focusing on IPython, TensorFlow, Theano and related topics, for data scientists. The last one is not on GitHub.
Predictive analytics continues to be a top priority for all types of organizations. And IBM SPSS Modeler continues to be the predictive analytics workbench leading businesses rely on to make predictions with confidence.
IBM is partnering with survey research firm TechValidate to gain a better understanding of how organizations are using SPSS Modeler to help them make…Continue
Added by Christine O'Connor on June 8, 2016 at 8:30am — No Comments
This article "Revenge of the nerds" was written by the Data Team from The Economist. It is a daily chart that explains which degrees give the best financial returns. It concerns all degrees from American Universities (Sample of 452 institutions; based on 2012-13 costs.)
More blue points at the top means higher return for engineering, computer science and maths degrees. Note that admission rate does not seem to have a big impact on ROI, as if high admission rate comes with lower…Continue
Added by Emmanuelle Rieuf on June 8, 2016 at 8:30am — No Comments
For companies newly endeavoring in establishing capabilities in Data Science, it is important to keep a few crucial points in mind. Clean data, applicable models, and business intuition are all key to success. Do not remove any of them from the equation. Data Science is essentially about identifying and/or creating the cleanest possible data set, then searching mathematically for patterns within it. The goal should be to help business users make important data-driven…Continue
Added by Gaurav Agrawal on June 8, 2016 at 6:01am — No Comments
In the book Hadoop: The definitive guide, Tom white quotes Grace Hopper, “In pioneer days they used oxen for heavy pulling, and when one ox couldn’t budge a log, they didn’t try to grow a larger ox. We shouldn’t be trying for bigger computers, but for more systems of computers.” For long Hadoop has been the data analytics system preferred by businesses all over. The recent entry of the spark engine has however given businesses an option other than Hadoop for data analytics…Continue
Added by Tanmay Bhandari on June 7, 2016 at 7:29pm — No Comments
Summary: In this review of Mary Meeker’s annual Internet Trends report for 2016 we’ll look for the advanced analytics that makes these trends possible.
The Affordable Care Act expanded eligibility for the Medicaid program with the hope of enrolling millions of low-income U.S. residents who could not access insurance. California was one of the states that chose to expand. Researchers at the University of California-San Francisco conducted a study that investigated the trends in the association between insurance coverage and usage of emergency departments among adults 18-64 from 2005 through 2010. They found that ED utilization in…Continue
Added by Salil Sheth on June 7, 2016 at 8:34am — No Comments
Kcore Analytics is a website that developed an end-to-end framework to find the Influencers in complex social networks like Twitter. The big innovation in their approach is extracting, solely from influencers, summarized data to forecast global trends, from the markets to consumer products to social movements and revolutions.
Here is the list of the top 10 influencers in Data Science (screenshot): …Continue
Added by Rohit Yadav on June 7, 2016 at 12:20am — No Comments
This article is an introduction of 5 courses in Data Science Specialization from Coursera, developed and taught by leading professors. Coursera is an educational technology company that offers massive open online courses (MOOCs).
This Specialization, created by Johns Hopkins University, covers the concepts and tools you'll need throughout the entire data science pipeline, from asking the right kinds of questions to making inferences and publishing results. In the final Capstone…Continue
Added by Emmanuelle Rieuf on June 6, 2016 at 1:00pm — No Comments