Subscribe to DSC Newsletter

All Blog Posts Tagged 'data' (678)

How do I prepare for Data Scientist interviews?

If you are a recent graduate or someone preparing for your first data scientist position, then here are some tips to help you ace your interview!…

Continue

Added by Ann Rajaram on July 11, 2019 at 2:52pm — No Comments

Deploying Python application using Docker and AWS

The use of Docker in conjunction with AWS can be highly effective when it comes to building a data pipeline.

Let me ask you if you have ever had this situation before. You are building a model in Python which you need to send over to a third-party, e.g. a client, colleague, etc. However, the person on the other end cannot run the code! Maybe they don't have the right libraries installed, or their system is not configured correctly.

Whatever the reason, Docker alleviates this…

Continue

Added by Michael Grogan on July 5, 2019 at 8:30am — No Comments

Top 8 Data Science Use Cases in Construction

Introduction

With every article, we keep proving that data science has found broad application in numerous business areas. Now, the turn came to the construction industry as well. The world is overloaded with data. It results in a steady improvement in…

Continue

Added by Igor Bobriakov on July 5, 2019 at 5:00am — No Comments

Data Quality Case Studies: How We Saved Clients Real Money Thanks to Data Validation

Machine learning models grow more powerful every week, but the earliest models and the most recent state-of-the-art models share the exact same dependency: data quality. The maxim “garbage in – garbage out” coined decades ago, continues to apply today. Recent examples of data verification shortcomings abound, including JP Morgan/Chase’s 2013 fiasco and this lovely…

Continue

Added by Michał Frącek on July 4, 2019 at 4:21am — No Comments

Multilevel Modelling of U.S. Home Loan Data

The housing market has undergone quite a change in the past decade, with more stringent lending criteria for housing having been enforced.

A key objective of financial institutions is to minimise the risk of mortgage lending by ensuring that the debtor is ultimately able to repay the loan.

In this example, multilevel modelling techniques are used to analyse data from the Federal Home Loan Bank…

Continue

Added by Michael Grogan on July 3, 2019 at 3:01am — No Comments

How Long Does It Take to Learn Python for Data Science?

Python is the most loved, dreaded, and wanted programming languages by most developers, according to StackOverflow survey.  Popular among most professional software developers, Python was ranked the world’s seventh popular programming language.

A study by PYPL Popularity of Programming Language Index (a study that monitors the frequency of searches regarding the popular programming languages to learn) predicted that it showed that there was a growth of 17.1% during the last…

Continue

Added by Yoey Thamas on July 2, 2019 at 12:29am — No Comments

Where’s the Love – Trends in Data Science Career Opportunities

Summary:  The annual Burtch Works salary survey tells us a lot about which industries are using the most data scientists and the difference between higher and lower skilled data scientists.  Salary increases show us whether demand is increasing, and finally we take a shot at determining which skills are most in demand.

 …

Continue

Added by William Vorhies on July 1, 2019 at 8:00am — No Comments

Recognizing Animals in Photos: Building an AI model for Object Recognition

Our model for recognizing specific animals in images is a neural network consisting of multiple layers, and the initial layers are already good at understanding the world in general. So instead of “re-inventing the wheel,” we only need to train the final layers.

I was excited to work on a recent project with one of our partners, Wild Detect, because it aligns with one of our goals at Appsilon — to use data science consulting to aid in the…

Continue

Added by Michał Frącek on June 25, 2019 at 1:13am — No Comments

What Is Big Data And What Are Its Importance

Big Brother? No, no, no, Big Data.

If someone tells you about the Big Data concept in the course of a conversation, do not think that your interlocutor is addicted to the programs of reality TV (a term used by your audience) or trash (as you like to call your detractors). Although it sounds similar, that's the Big Brother. Although in the background and tangentially, you can find certain similarities, or relationships, between the television phenomenon Big Brother and the latest…

Continue

Added by Ricky Brown on June 14, 2019 at 3:14am — No Comments

Why Every Hadoop Professional Needs Data Science Skills?

Value of adopting Data Science Skills

Data Science is responsible to provide meaning to the large amounts of complex data called big data. It involves different fields of work in statistics and computation to interpret data for decision-making.

Advances in the internet and social media is increasing access to big data. Extraction of meaningful information requires the use of AI and ML by data science. Big data is used in every…

Continue

Added by Yoey Thamas on June 4, 2019 at 2:33am — No Comments

Implementing Knowledge Graphs in Enterprises - Some Tips and Trends

Tips

  1. Don't try to put the cart before the horse: realize that efficient data preparation (and thus interoperable standards) and data quality, especially in the enterprise environment, are a basic requirement for…
Continue

Added by Andreas Blumauer on May 21, 2019 at 5:33am — No Comments

Causality – The Next Most Important Thing in AI/ML

Summary:  Finally there are tools that let us transcend ‘correlation is not causation’ and identify true causal factors and their relative strengths in our models.  This is what prescriptive analytics was meant to be.

 

Just when I thought we’d figured it all out, something comes along to make…

Continue

Added by William Vorhies on April 22, 2019 at 8:47am — 4 Comments

What are the prerequisite skills required to become a data scientist in 2019?

According to the “Future of Jobs 2018” report, it is said that with advancement in machines and artificial intelligence it is likely that these technologies will replace 75 million jobs by 2020. Data scientist remains to be one of the trending careers, and those interested in pursuing data science as a career must first have the right skill set to land a job in this field.  

As more companies become data-driven, professionals skilled in data science…

Continue

Added by Yoey Thamas on April 7, 2019 at 8:57pm — 1 Comment

Credit risk transactional data

Hello,

I am working on transactional data on a banks portfolio and I would like to spot outliers both for credit risk and fraudulent activities. What methods would you recommend?

Thanks!

Added by Alex Din on April 4, 2019 at 8:00am — No Comments

Practice Data Storytelling with Your Own Instagram Data!

This month, Panoply is running an Instagram Data Challenge! The gist is, you can pull your own Instagram data into a Panoply database with a few clicks, make a visualization or tell a data story with the data about your own posts, and enter to win one of several $1000 awards.

To kick off the challenge, I did a bit of exploration…

Continue

Added by Trevor Fox on April 4, 2019 at 7:16am — No Comments

Most Popular Content on DSC

We have been in existence for over 10 years now, with content in many different places, lists, categories, and databases. This is an attempt to put everything together in one place, and help our readers (re-)discover some great articles and resources that were lost on the Internet over the years, but still sit on our web servers. We are making them come back to life. We are in the process of organizing it in a way that is user-friendly.  Some of the resources below are very recent, and some…

Continue

Added by Vincent Granville on April 3, 2019 at 5:30pm — No Comments

Retailers: Better Market Basket Analysis Will Keep You Ahead Of Competition

“It’s much easier to double your business by doubling your conversion rate than by doubling your traffic.” —Jeff Eisenberg, CEO of BuyerLegends.com

It is but only now that medium and small retailers have started to realize why Amazon and other e-commerce companies…

Continue

Added by Hemant Warudkar on April 1, 2019 at 7:30pm — No Comments

Long-range Correlations in Time Series: Modeling, Testing, Case Study

We investigate a large class of auto-correlated, stationary time series, proposing a new statistical test to measure departure from the base model, known as Brownian motion. We also discuss a methodology to deconstruct these time series, in order to identify the root mechanism that generates the observations. The time series studied here can be discrete or continuous in time, they  can have various degrees of smoothness (typically measured using the Hurst exponent) as well as long-range or…

Continue

Added by Vincent Granville on April 1, 2019 at 9:00am — 1 Comment

Top 8 Data Science Use Cases in Gaming

The gaming industry is on its rise nowadays. With more than 2 billion players all over the world gaming industry is a resource of enormous revenues and…

Continue

Added by Igor Bobriakov on March 31, 2019 at 10:00pm — No Comments

Monthly Archives

2019

2018

2017

2016

2015

2014

2013

2012

2011

1999

Videos

  • Add Videos
  • View All

© 2019   Data Science Central ®   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service