.

We tried to do XYZ. Did it make a difference?”

Whether you are in the for-profit world or the not-for profit world, this is a very basic question that many people try to answer.

You could be working at a bank trying to figure out which offer is most appealing to customers, at an online retailer figuring out which ad display gets the most clicks, at the Department of Education trying to test the effect of smaller class sizes, at the city government office trying to see if the new bike lane programs really are safer, at an online media provider (Netflix, Youtube, Spotify,..) trying to find the best algorithm for making recommendations, a pharmaceutical company is running a clinical trial comparing their drug’s effectiveness versus the competitor, a pharmaceutical company wants to see if its newly released drug has strong impact in the real world…

All of these examples have a few things in common.

- A change was made whether it be a new algorithm, a different ad display, new bike lanes, smaller class sizes or testing a new drug treatment. These are all examples of Programs.
- There is a desire to determine if this change made a difference in an outcome of interest whether that outcome is more sales, safer roads, better educated students or more patients being cured. These are all examples of Program Evaluation where the goal is to find out not only was there a change in the outcome of interest but to be able to say that the program
the difference. As part of this Program Evaluation, we want to know the direction of the change (did it make things better or worse?, the magnitude of the change, and, in some cases, whether the change was statistically significant.__caused__

Economists and Evaluation Specialists (those with degrees in Monitoring and Evaluation) study many techniques to do program evaluation including Randomized Experiments as well as Quasi-Experimental Methods such as Propensity Score Methods, Instrument Variable, Interrupted Time Series, Regression Discontinuity, Heckman’s 2-Stage Model...

Most data scientists can do A/B Testing like there is no tomorrow. It is a standard part of the Data Scientists toolkit. When successful, the A/B Testing creates a random assignment so that the two groups, A and B, are, on average, very similar in all observable and unobservable characteristics. The program evaluation then simply consists of checking the quality of the randomization (yes, this step get skipped by many people but, it should not be skipped) then comparing the outcomes in Group A to Group B. This is like the way a clinical trial is designed and implemented for a drug.

But what if the randomization failed? What if the groups are different? What if other experiments were going on at the same time that impacted the assignment?

What if randomization is not possible?

In these situations, the toolbox of Program Evaluation becomes critical to determining if the program made a difference in the outcome of interest, whether that be higher click-through rates, increased sales, safer roads, more effective drugs or better education.

The desired skills for a Data Scientist already include quite a long list. Knowing that we can’t add an infinite number of required skills to the Data Scientist Toolbox, what do you think about a basic course in Program Evaluation? Would some training in Program Evaluation be helpful to round out a Data Scientists training?

Interested in your insights on this topic. #datascience #programevaluation

- A History and Timeline of Big Data
- AI voice technology has benefits and limitations
- Strong data governance frameworks are fuel for analytics
- Top 12 most commonly used IoT protocols and standards
- What is the status of quantum computing for business?
- How parallelization works in streaming systems
- An Eggplant automation tool tutorial for Functional, DAI
- Circular economy model enables sustainability and resilience

Posted 29 March 2021

© 2021 TechTarget, Inc. Powered by

Badges | Report an Issue | Privacy Policy | Terms of Service

**Most Popular Content on DSC**

To not miss this type of content in the future, subscribe to our newsletter.

- Book: Applied Stochastic Processes
- Long-range Correlations in Time Series: Modeling, Testing, Case Study
- How to Automatically Determine the Number of Clusters in your Data
- New Machine Learning Cheat Sheet | Old one
- Confidence Intervals Without Pain - With Resampling
- Advanced Machine Learning with Basic Excel
- New Perspectives on Statistical Distributions and Deep Learning
- Fascinating New Results in the Theory of Randomness
- Fast Combinatorial Feature Selection

**Other popular resources**

- Comprehensive Repository of Data Science and ML Resources
- Statistical Concepts Explained in Simple English
- Machine Learning Concepts Explained in One Picture
- 100 Data Science Interview Questions and Answers
- Cheat Sheets | Curated Articles | Search | Jobs | Courses
- Post a Blog | Forum Questions | Books | Salaries | News

**Archives:** 2008-2014 |
2015-2016 |
2017-2019 |
Book 1 |
Book 2 |
More

**Most popular articles**

- Free Book and Resources for DSC Members
- New Perspectives on Statistical Distributions and Deep Learning
- Time series, Growth Modeling and Data Science Wizardy
- Statistical Concepts Explained in Simple English
- Machine Learning Concepts Explained in One Picture
- Comprehensive Repository of Data Science and ML Resources
- Advanced Machine Learning with Basic Excel
- Difference between ML, Data Science, AI, Deep Learning, and Statistics
- Selected Business Analytics, Data Science and ML articles
- How to Automatically Determine the Number of Clusters in your Data
- Fascinating New Results in the Theory of Randomness
- Hire a Data Scientist | Search DSC | Find a Job
- Post a Blog | Forum Questions

## You need to be a member of Data Science Central to add comments!

Join Data Science Central