Subscribe to DSC Newsletter

Dr. Moloy De's Blog (5)

Forecasting with Lead Regression

Our client was a pioneering company in producing and delivering Roof Shingles. Their main plant in Minnesota and they have around 25 more plants across US. Client implemented 100’s of sensors along the assembly line that are streaming nano-second data to their Spark Data-lake.

Viscosity of input fluid is an important factor to maintain quality of production of roof shingles. Data shows there are unwanted peaks (outliers) in viscosity data which client wanted to eliminate. Following…

Continue

Added by Dr. Moloy De on June 21, 2019 at 3:10am — No Comments

Optimal Allocation of Ambulances

Our client was an Emergency Response Management organization who handles medical, police and fire emergencies through the " 1-0-8 Emergency service". Currently the organization runs around 690 ambulances. An analysis is run on historic data extracted from client's management database. The live data input is fed to a simulation model to propose an optimal ambulance allocation providing an opportunity of cost reduction for the organization.

In probability theory and statistics, the…

Continue

Added by Dr. Moloy De on June 12, 2019 at 6:51am — No Comments

Profiling Store Visitors

Our Telecom Client was developing a Big Data Product that will profile demography (Age, Gender, Income, Ethnicity, Marital Status) of the visitors of the stores receiving feed from the wi-fi routers placed in the stores. Client used to receive daily feed of router data in its server which were then uploaded in HDFS / Hive Tables in the data lake for analysis.

Maintaining data quality was a serious issue without which the reports would have been erroneous. A daily e-mail used to get…

Continue

Added by Dr. Moloy De on May 23, 2019 at 9:29pm — No Comments

Telecom Customer Analytics

I was deputed to work at Lagos, Nigeria in 2011 to work for a telecom giant there. The project in hand was to develop customer analytics modules using SAS on customer's newly built Oracle data warehouse. We thought about developing following modules.

  1. Customer Churn Analysis
  2. Calculating Product Propensities
  3. Customer Lifetime Value Calculation
  4. Customer Segmentation

Customer Churn…

Continue

Added by Dr. Moloy De on May 17, 2019 at 4:31pm — No Comments

Multivariate Outlier Detection

I was given 3 GB of Machine Generated data being fed by 120 sensors (5 records every second) in an excel format. The task in hand was to mine out interesting patterns, if any, from the data.

I fed the data in R in my local machine and performed various descriptive and exploratory analysis to have some insights. Customer was also looking for some low cost maintenance mechanisms for their machines. So I  thought if I could study the outliers and provide some information about system…

Continue

Added by Dr. Moloy De on May 15, 2019 at 7:30pm — No Comments

Videos

  • Add Videos
  • View All

© 2019   Data Science Central ®   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service