Subscribe to DSC Newsletter

All Blog Posts (6,678)

Weekly Digest, December 16

Monday newsletter published by Data Science Central. Previous editions can be found here. The contribution flagged with a + is our selection for the picture of the week. To subscribe, follow this link.  …

Continue

Added by Vincent Granville on December 15, 2019 at 4:12pm — No Comments

Crossing the AI Chasm with Infographics

AI is a game changer. Period. And being a data and analytics guy, I could not be more excited about it.

The McKinsey research study “Notes from the AI frontier: Applications and value of deep learning” provided some valuable insights into where and how Artificial Intelligence (i.e., Deep Learning / Neural Networks (CNNs, RNNs, GANs), Reinforcement Learning and Deep Reinforcement Learning) will derive and drive new sources of customer, product and…

Continue

Added by Bill Schmarzo on December 15, 2019 at 11:58am — No Comments

Math for Data Science in One Picture: What do you REALLY need to study?

In my last blog post, I covered the statistics you need to know for data science. But of course, stats isn't the only math related knowledge you need. Rather than offer my own biased opinion about the importance of this subject vs. that one, I performed a meta analysis of popular opinion to see what data scientists and educators are saying (see…

Continue

Added by Stephanie Glen on December 14, 2019 at 7:17am — No Comments

Rule of thumb: Which AI / ML algorithms to apply to business problems

 

How to know which AI/ ML algorithm to apply to which business problem?

This is a common question

I found a good reference for it –…

Continue

Added by ajit jaokar on December 13, 2019 at 10:18am — No Comments

Using "record id's" to facilitate processing in Python-Pandas and R-data.table.

ID card template example

Both R and Python-Pandas are array-oriented platforms that support fast filtering through vectors of record-id's. In Python-Pandas, such vectors are implemented via Pandas's powerful index construct; in R-data.table, they're accessible through the "which" and "row.name" functions. In both instances, joins to record-id vectors generate fast subsetted access.

How is the record-id vector approach helpful? For starters, the analyst can encapsulate common…

Continue

Added by steve miller on December 13, 2019 at 5:51am — No Comments

Thursday News, December 12

Here is our selection of featured articles and technical resources posted since Monday:

Technical Resources

Continue

Added by Vincent Granville on December 12, 2019 at 12:00pm — No Comments

The Rise of Fake News: A Machine Learning Challenge

Guest blog by Faruqui Ismail and NookaRaju Garimella.

Reporters with various forms of "fake news" from an 1894 illustration by Frederick Burr…

Continue

Added by Vincent Granville on December 12, 2019 at 10:31am — No Comments

2020 Trends, Predictions and Challenges for Data Management and Privacy

As we move into 2020, data management will continue to advance and develop efficiencies that will make the job of having data ready for business purposes faster and more reliable than ever. While the data management space is a diverse field in its practices, there are four trends that will be forefront in 2020:

  • Data Orchestration – The uniting of data integration, API integration, and data movement to support DataOps techniques. This involves combining multiple…
Continue

Added by Todd Wright on December 12, 2019 at 6:00am — No Comments

Performance evaluation of cloud computing platforms for Machine Learning

A use case on Logistic regression training

Over the last few years there are several efforts for more powerful computing platforms to face the challenges imposed by emerging applications like machine learning. General purpose CPUs have been developed specialized ML modules, GPUs and FPGAs with specialized engines are…

Continue

Added by Chris Kachris on December 12, 2019 at 4:30am — No Comments

Web Scraping with a Headless Browser: A Puppeteer Tutorial

Web development has moved at a tremendous pace in the last decade with a lot of frameworks coming in for both backend and frontend development. Websites have become smarter and so have the underlying frameworks used in developing them. All these advancements in web development have led to the development of the browsers themselves too.  Most of the browsers are now available with a “headless” version where a user can interact with a website without any UI. You…

Continue

Added by Sandra Moraes on December 11, 2019 at 8:00pm — No Comments

Optimal Binning for Scoring Modeling (R Package)

What is Binning?

Binning is the term used in scoring modeling for what is also known in Machine Learning as Discretization, the process of transforming a continuous characteristic into a finite number of intervals (the bins), which allows for a better understanding of its distribution and its relationship with a binary variable. The bins generated by the this process will eventually become the attributes of a…

Continue

Added by Vincent Granville on December 11, 2019 at 11:12am — No Comments

The Rise of Fake News. A Machine Learning challenge!

By Faruqui Ismail and NookaRaju Garimella

Reporters with various forms of "fake news" from an 1894 illustration by Frederick Burr Opper

 

We’ve always pictured the rise of artificial intelligence as…

Continue

Added by Faruqui Ismail on December 11, 2019 at 8:00am — No Comments

Machine Learning Market is Rising Due to Rapid Increase in Unstructured Data

The Global Machine Learning Market is expected to expand at 42.08% CAGR during the forecast period 2018–2024Machine learning is a branch of artificial intelligence (AI) that uses statistical techniques for analytical model building that imparts the computers with the ability to learn from data instead of being…

Continue

Added by Ehtesham Peerzade on December 11, 2019 at 2:30am — No Comments

It Is Never Too Late To Learn!

The article by Stefanie Glen in the November 30 DSC Newsletter  is spot on!  I am a 77-year old Data Scientist, and I have done my best work since I “retired” in 2009.  Since then, I published 3 books on Data Science topics with Academic Press, and a 4th book is in press at Cambridge University Press.  I began teaching Data Science at the University of California at Irvine in 2012.  All of my students are international (in an international program at UCI), and almost all of them…

Continue

Added by Robert Nisbet on December 10, 2019 at 6:14pm — No Comments

Make Crucial Predictions as Data Comes

Walking by the hottest IT streets in these days means you've likely heard about achieving Streaming Machine Learning, i.e. moving AI towards streaming scenario and exploiting the real-time capabilities along with new Artificial Intelligence techniques. Moreover, you will also notice the lack of research related to this topic, despite the growing interest in it.

If we try to investigate it a little bit deeper then, we realize that…

Continue

Added by Valeria on December 10, 2019 at 7:30am — No Comments

Why Event Stream Processing Is Leading the New Big Data Era

Big Data is probably one of the most misused words of the last decade. It was widely promoted, discussed, and spread around by business managers, technical experts, and experienced academics. Slogans like “Data is the new oil” were widely accepted as unquestionable truth.

These beliefs pushed  technologies forward. Its stack, formerly developed by Yahoo! and now owned by the Apache Software Foundation, was recognized as “The” Big Data…

Continue

Added by Valeria on December 10, 2019 at 7:21am — No Comments

Deep Analytics: Risk Management with AI

We first provide a mini-tutorial on  Adjoint Algorithmic Differentiation (AAD) (also known as back-propagation in machine learning). We then illustrate how  neural networks may be used to compute dynamic values and risks of trading books with applications to risk management of derivatives,  valuation adjustments (XVA), counterpart credit risk, FRTB and SIMM margin valuation adjustments (MVA). We also describe new techniques to substantially improve deep learning on simulated data, and…

Continue

Added by Antoine Savine on December 10, 2019 at 1:30am — No Comments

Fun with maps: Part 2

Last time we created a beautiful map with a lot of features, see here. This time I will show you how to customize different things. I use the same data.

map1= folium.Map(location=[10,20], zoom_start=2, tiles='http://tile.stamen.com/toner-lite/{z}/{x}/{y}.png',attr="Dr.Katharina Glass")

Let’s start with marker.…

Continue

Added by Dr. Katharina Glass on December 9, 2019 at 10:00pm — No Comments

CPU Vendors Compete Over Memory Bandwidth to Achieve Leadership in Real-World Application Performance

By Rob Farber

Now is a great time to be procuring systems as vendors are finally addressing the memory bandwidth bottleneck. Succinctly, memory performance dominates the performance envelope of modern devices be they CPUs or GPUs. [i] It does not matter if…

Continue

Added by Rob Farber on December 9, 2019 at 10:00am — No Comments

Statistics for Data Science in One Picture

There's no doubt about it, probability and statistics is an enormous field, encompassing topics from the familiar (like the average) to the complex (…

Continue

Added by Stephanie Glen on December 9, 2019 at 7:48am — No Comments

Blog Topics by Tags

Monthly Archives

2019

2018

2017

2016

2015

2014

2013

2012

2011

1999

Videos

  • Add Videos
  • View All

© 2019   Data Science Central ®   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service