December 2019 Blog Posts (87)

5 Ways Big Data Is Impacting Healthcare Providers

The ability of companies to provide great healthcare has always been reliant upon data. Before the widespread adoption of technology, important information came from medical exams, checkups, and direct communication between physicians. …


Added by Luke Fitzpatrick on December 15, 2019 at 11:30pm — No Comments

The Rise of Fake News. A Machine Learning challenge!

By Faruqui Ismail and Nooka Raju Garimella

Reporters with various forms of "fake news" from an 1894 illustration by Frederick Burr Opper


We’ve always pictured the rise of artificial…


Added by Faruqui Ismail on December 15, 2019 at 10:24pm — No Comments

Weekly Digest, December 16

Monday newsletter published by Data Science Central. Previous editions can be found here. The contribution flagged with a + is our selection for the picture of the week. To subscribe, follow this link.  …


Added by Vincent Granville on December 15, 2019 at 4:00pm — No Comments

Crossing the AI Chasm with Infographics

AI is a game changer. Period. And being a data and analytics guy, I could not be more excited about it.

The McKinsey research study “Notes from the AI frontier: Applications and value of deep learning” provided some valuable insights into where and how Artificial Intelligence (i.e., Deep Learning / Neural Networks (CNNs, RNNs, GANs), Reinforcement Learning and Deep Reinforcement Learning) will derive and drive new sources of customer, product and…


Added by Bill Schmarzo on December 15, 2019 at 11:58am — No Comments

Math for Data Science in One Picture: What do you REALLY need to study?

In my last blog post, I covered the statistics you need to know for data science. But of course, stats isn't the only math related knowledge you need. Rather than offer my own biased opinion about the importance of this subject vs. that one, I performed a meta analysis of popular opinion to see what data scientists and educators are saying (see…


Added by Stephanie Glen on December 14, 2019 at 7:00am — No Comments

Rule of thumb: Which AI / ML algorithms to apply to business problems


How to know which AI/ ML algorithm to apply to which business problem?

This is a common question

I found a good reference for it –…


Added by ajit jaokar on December 13, 2019 at 10:18am — 2 Comments

Using "record id's" to facilitate processing in Python-Pandas and R-data.table.

ID card template example

Both R and Python-Pandas are array-oriented platforms that support fast filtering through vectors of record-id's. In Python-Pandas, such vectors are implemented via Pandas's powerful index construct; in R-data.table, they're accessible through the "which" and "row.name" functions. In both instances, joins to record-id vectors generate fast subsetted access.

How is the record-id vector approach helpful? For starters, the analyst can encapsulate common…


Added by steve miller on December 13, 2019 at 5:51am — No Comments

Sports Telemetry in Real-Time

The history of F1 motor racing and the use of telemetry as a way to monitor car setup and performance dates back to the 80s. The first electronic systems were installed onboard the car, collected information for only one lap and the data were then downloaded when the car was back in the garage. The explosion of computing capabilities, in the 90s, contributed to the growth of intelligent data usage in the F1 and the…


Added by Valeria on December 13, 2019 at 1:00am — 2 Comments

Thursday News, December 12

Here is our selection of featured articles and technical resources posted since Monday:

Technical Resources


Added by Vincent Granville on December 12, 2019 at 12:00pm — No Comments

The Rise of Fake News: A Machine Learning Challenge

Guest blog by Faruqui Ismail and NookaRaju Garimella.

Reporters with various forms of "fake news" from an 1894 illustration by Frederick Burr…


Added by Vincent Granville on December 12, 2019 at 10:31am — No Comments

2020 Trends, Predictions and Challenges for Data Management and Privacy

As we move into 2020, data management will continue to advance and develop efficiencies that will make the job of having data ready for business purposes faster and more reliable than ever. While the data management space is a diverse field in its practices, there are four trends that will be forefront in 2020:

  • Data Orchestration – The uniting of data integration, API integration, and data movement to support DataOps techniques. This involves combining multiple…

Added by Todd Wright on December 12, 2019 at 6:00am — No Comments

Performance evaluation of cloud computing platforms for Machine Learning

A use case on Logistic regression training

Over the last few years there are several efforts for more powerful computing platforms to face the challenges imposed by emerging applications like machine learning. General purpose CPUs have been developed specialized ML modules, GPUs and FPGAs with specialized engines are…


Added by Chris Kachris on December 12, 2019 at 4:30am — No Comments

Web Scraping with a Headless Browser: A Puppeteer Tutorial

Web development has moved at a tremendous pace in the last decade with a lot of frameworks coming in for both backend and frontend development. Websites have become smarter and so have the underlying frameworks used in developing them. All these advancements in web development have led to the development of the browsers themselves too.  Most of the browsers are now available with a “headless” version where a user can interact with a website without any UI. You…


Added by Sandra Moraes on December 11, 2019 at 8:00pm — No Comments

Optimal Binning for Scoring Modeling (R Package)

What is Binning?

Binning is the term used in scoring modeling for what is also known in Machine Learning as Discretization, the process of transforming a continuous characteristic into a finite number of intervals (the bins), which allows for a better understanding of its distribution and its relationship with a binary variable. The bins generated by the this process will eventually become the attributes of a…


Added by Vincent Granville on December 11, 2019 at 11:12am — No Comments

Machine Learning Market is Rising Due to Rapid Increase in Unstructured Data

The Global Machine Learning Market is expected to expand at 42.08% CAGR during the forecast period 2018–2024Machine learning is a branch of artificial intelligence (AI) that uses statistical techniques for analytical model building that imparts the computers with the ability to learn from data instead of being…


Added by Ehtesham Peerzade on December 11, 2019 at 2:30am — 1 Comment

It Is Never Too Late To Learn!

The article by Stefanie Glen in the November 30 DSC Newsletter  is spot on!  I am a 77-year old Data Scientist, and I have done my best work since I “retired” in 2009.  Since then, I published 3 books on Data Science topics with Academic Press, and a 4th book is in press at Cambridge University Press.  I began teaching Data Science at the University of California at Irvine in 2012.  All of my students are international (in an international program at UCI), and almost all of them…


Added by Robert Nisbet on December 10, 2019 at 6:14pm — No Comments

Make Crucial Predictions as Data Comes

Walking by the hottest IT streets in these days means you've likely heard about achieving Streaming Machine Learning, i.e. moving AI towards streaming scenario and exploiting the real-time capabilities along with new Artificial Intelligence techniques. Moreover, you will also notice the lack of research related to this topic, despite the growing interest in it.

If we try to investigate it a little bit deeper then, we realize that…


Added by Valeria on December 10, 2019 at 7:30am — No Comments

Why Event Stream Processing Is Leading the New Big Data Era

Big Data is probably one of the most misused words of the last decade. It was widely promoted, discussed, and spread around by business managers, technical experts, and experienced academics. Slogans like “Data is the new oil” were widely accepted as unquestionable truth.

These beliefs pushed  technologies forward. Its stack, formerly developed by Yahoo! and now owned by the Apache Software Foundation, was recognized as “The” Big Data…


Added by Valeria on December 10, 2019 at 7:21am — No Comments

Deep Analytics: Risk Management with AI

We first provide a mini-tutorial on  Adjoint Algorithmic Differentiation (AAD) (also known as back-propagation in machine learning). We then illustrate how  neural networks may be used to compute dynamic values and risks of trading books with applications to risk management of derivatives,  valuation adjustments (XVA), counterpart credit risk, FRTB and SIMM margin valuation adjustments (MVA). We also describe new techniques to substantially improve deep learning on simulated data, and…


Added by Antoine Savine on December 10, 2019 at 1:30am — No Comments

Fun with maps: Part 2

Last time we created a beautiful map with a lot of features, see here. This time I will show you how to customize different things. I use the same data.

map1= folium.Map(location=[10,20], zoom_start=2, tiles='http://tile.stamen.com/toner-lite/{z}/{x}/{y}.png',attr="Dr.Katharina Glass")

Let’s start with marker.…


Added by Dr. Katharina Glass on December 9, 2019 at 10:00pm — No Comments

Blog Topics by Tags

Monthly Archives













© 2021   TechTarget, Inc.   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service