Subscribe to DSC Newsletter

June 2018 Blog Posts (93)

Two More Math Problems: Continued Fractions, Nested Square Roots, Digits of Pi

These problems are for college undergrads after a first course in calculus. They are provided with solutions, and could be used by college professors as exercises or exam questions.

1. Digits of Pi/4

Prove that in base b, if b is an even integer, n > 3, and x = Pi/4, then the n-th digit of x, denoted as a(n), is given by the formula below. We start…


Added by Vincent Granville on June 25, 2018 at 7:30pm — No Comments

The Difference Between Managing Large and Small Data Science Teams

Summary:  As advanced analytics and data science have matured into must-have skills, data science groups within large companies have themselves become much larger.  This has led to some unique problems and solutions that you’ll want to consider as your own DS group grows larger. 



Added by William Vorhies on June 25, 2018 at 2:45pm — 5 Comments

How will a bank recognize that it is falling behind in artificial intelligence?

Driven by developments in artificial intelligence and big data, the whole financial industry is undergoing a fundamental change that will become even more pronounced in the coming years. The associated changes entail many opportunities, but also numerous risks. It is already foreseeable that there will be both winners and losers, especially since the degree of maturity of the use of artificial intelligence in banks is very different.

But how can a bank actually notice that it is being…


Added by Dr. Dimitrios Geromichalos on June 25, 2018 at 12:30am — No Comments

Harnessing Technology To Kill User Privacy

Privacy!! does it really exist in todays time, when technology allow to monitor even your personal movement in your dark bedroom through solid walls.

Privacy – What is it !!!

Lets race for DataIntelligence this is what is mantra for every company on planet today. Do we really have privacy any more. What all data and upto what extend getting collected more then 90% of us are not even aware.

Some sources from internet (reliable sources) claims…


Added by Vinod Sharma on June 24, 2018 at 10:30pm — No Comments

Weekly Digest, June 25

Monday newsletter published by Data Science Central. Previous editions can be found here. The contribution flagged with a + is our selection for the picture of the week.


  • Springboard provides you not only with the content that will help you get a data science job quickly, but also sets you up to be more impactful in the role that you land.…

Added by Vincent Granville on June 24, 2018 at 3:00am — No Comments

DataMelt published Java API documentation

DataMelt computational platform for data analysis organized its Java documentation:


Added by jwork.ORG on June 23, 2018 at 5:24pm — No Comments

Market Alignment - An Application of Systems Theory for Organizations

The main components of systems theory that readers might remember are “inputs,” “processes,” and “outputs.”  The part that tends to get neglected is “feedback mechanisms.”  These mechanisms tell the system the extent to which operations fit the environment.  If there is lack of fitness, there is stress.  One adaptive impulse is to make processes more complex and intelligent - i.e. sometimes described as the fight response.  Another impulse is to give up and run away - i.e. the flight…


Added by Don Philip Faithful on June 23, 2018 at 9:00am — 1 Comment

Multivariate Regression with Neural Networks: Unique, Exact and Generic Models

Michael Nielsen provides a visual demonstration in his web book Neural Networks and Deep Learning that a 1-layer deep neural network can match any function y = f(x) . It is just a matter of the number of neurons to get a prediction that is arbitrarily close – the more the neurons the better the approximation.  There is the Universal Approximation Theorem as well that supplies a rigorous proof of the same.But the known issues…


Added by Ashok Chilakapati on June 22, 2018 at 2:30pm — No Comments

Contamination of the Control Group

Continuing my thoughts on random experiments and what can go wrong:

One common problem is Contamination of the Control Group

The only difference between the treatment and control groups should be the treatment.  That said, this isn't always true.  How the treatment is administered can affect the control group.  Think about health examples where, for example, deworming…


Added by Howard Friedman on June 22, 2018 at 3:55am — No Comments

Encoding fixed length high cardinality non-numeric columns for a ML algorithm

In this post, Encoding high cardinality text data for a ML algorithm, the author compares 4 ways to encode non-numerical tabular data. This skill is quite useful and necessary to be able to use years worth tabular data in a machine learning and deep learning algorithms. 

One of the ideas, Character Encoding is an…


Added by Nitin Pasumarthy on June 22, 2018 at 12:30am — No Comments

Gradients support in PyTorch

In this article by Maxim Lapan,  the author of Deep Reinforcement Learning Hands-On,we are going to discuss about gradients in PyTorch.

Gradients support in tensors is one of the major changes in PyTorch 0.4.0. In previous versions, graph tracking and gradients accumulation were done in a separate, very thin class Variable, which worked as a wrapper around the tensor and automatically performed saving of the history of computations in order to be able to…


Added by Packt Publishing on June 22, 2018 at 12:30am — No Comments

Will the job outlook for data scientists severely decline after 2020?

Interested question posted on Quora recently. Here is my take on this.

Just put the next buzz word on your resume when you graduate, maybe AI engineer? I completed my PhD in computational statistics 25 years ago. It was in fact data science (image remote sensing), but under a different name. I changed my job title from statistician to data scientist many years ago, and I may dot it again if needed. There is more and more data to process, so the need will grow, but it will grow very…


Added by Vincent Granville on June 21, 2018 at 12:30pm — No Comments

Using Topological Data Analysis to Understand the Behavior of Convolutional Neural Networks

TLDR: Neural Networks are powerful but complex and opaque tools. Using Topological Data Analysis, we can describe the functioning and learning of a convolutional neural network in a compact and understandable way. The implications of the finding are profound and can accelerate the development of a wide range of applications from self-driving everything to GDPR.


Neural networks have demonstrated a great…


Added by Jonathan Symonds on June 21, 2018 at 9:30am — No Comments

Thursday News: AI, Decision Trees, Feature Selection, World Cup Predictions, Data Science and Biology

Here is our selection of featured articles and technical resources posted since Monday:


Added by Vincent Granville on June 21, 2018 at 8:14am — No Comments

When Randomization Fails

Randomization, including A/B testing, often fails...for a variety of reasons.  When that happens you will likely find yourself in a situation where the treatment and control groups differ significantly.

First thing to remember is that some differences can occur in studies that are unavoidable.  Are the differences practically significant?  Are they statistically…


Added by Howard Friedman on June 21, 2018 at 5:41am — No Comments

What is the Artificial Intelligence, Machine Learning, Data Science and what is the difference between them?

What is an artificial intelligence (AI)?

Most of us can not imagine a single day without a computer. With the rapid development of technology, various devices that simplify people's lives become more accessible. This is also connected with modern computers, which are capable of providing impressive fast processing of information.

     Modern business uses the full potential of information technology. This allows you to store important data and manage it…


Added by Barbara Elliott on June 21, 2018 at 1:00am — No Comments

Free eBooks on Data Visualization and Machine Learning

What You Need to Know about Machine Learning

By Gabriel A. Canepa

This eBook offers you the perfect place to lay the foundation for your work in the world of Machine Learning, providing the basic understanding, knowledge, and skills…


Added by Packt Publishing on June 21, 2018 at 12:50am — 2 Comments

The Top 3 Openstack Benefits and Challenges

Over the past decade, we have seen a shift towards virtualization, a stepping stone to complete cloud utilization. However, even by leveraging virtualization, you still need the appropriate building blocks for a private cloud environment. This involves going beyond simply virtualizingcompute and other central data center subsystems (i.e., network and storage), and requires you to enable flexible APIs in order to automate resource provisioning and…


Added by lakshmi yarlagadda on June 20, 2018 at 11:30pm — No Comments

How to unlock value from Enterprise Asset Analytics ?

Digital world is all around us! Disruptive Business Models is the new phenomenon! It means different things (not just internet of things) to different people and businesses.

Some industries are traditionally slow in adapting new technologies. For example, AOVC (Asset Oriented Value Chains – Natural Resources, Metals and Chemicals) have not explored digital to it’s full potential. Analytics is one of the key components of this revolution. It it not just about capturing…


Added by Amit Supe on June 20, 2018 at 10:00pm — No Comments

Applying Agile IT Methodology to Data Science Projects

If you keep up with the latest trends in the business world, then Data Science is a term that appears frequently nowadays. It is a steadily growing field and newer developments keep occurring as well. Data Science is responsible for multiple benefits for varying business industries. Small and large businesses alike are catching up; discovering high potential for growth using data analytics.

Data Science Projects

Businesses that seek to improve their solutions for customers by…


Added by VAMSI NELLUTLA on June 20, 2018 at 5:30pm — No Comments

Blog Topics by Tags

Monthly Archives











© 2019   Data Science Central ®   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service