Let's say a niche digital publisher makes $2 million in yearly revenue, delivers 10 million page views in a year, and generates a 50% profit margin. In short, this is $1 million profit per 10 million…Continue
Here I am talking about the Google algorithm that is used to attribute an article to the original blog where it was posted first, rather than to subsequent re-posts from authorized (syndicated) or…Continue
The gravity force is inversely proportional to the square of the distance between the two bodies. Why is the exponent in this formula exactly equal to 2, rather than (say) 1.99997054 or 2.00061833?…Continue
Well rounded, visionary data scientist with broad spectrum of domain expertise, technical knowledge, and proven success in bringing measurable added value to companies ranging from startups to fortune 100, across multiple industries (finance, Internet, media, IT, security) and domains (data science, operations research, machine learning, computer science, business intelligence, statistics, applied mathematics, growth hacking, IoT).
Vincent developed and deployed new techniques such as hidden decision trees (for scoring and fraud detection), automated tagging, indexing and clustering of large document repositories, black-box, scalable, simple, noise-resistant regression known as the Jackknife Regression (fit for black-box, real-time or automated data processing), model-free confidence intervals, bucketisation, combinatorial feature selection algorithms, detecting causation not correlations, and generally speaking, the invention of a set of consistent robust statistical / machine learning techniques that can be understood, implemented, interpreted, leveraged and fine-tuned by the non-expert. Vincent also invented many synthetic metrics (for instance, predictive power and L1 goodness-of-fit) that work better than old-fashioned stats, especially on badly-behaved sparse big data. Some of these techniques have been implemented in a Map-Reduce Hadoop-like environment. Some are concerned with identifying true signal in an ocean of noisy data.
Vincent is a former post-doctorate of Cambridge University and the National Institute of Statistical Sciences. He was among the finalists at the Wharton School Business Plan Competition and at the Belgian Mathematical Olympiads. Vincent has published 40 papers in statistical journals and is an invited speaker at international conferences. Vincent also created the first IoT platform to automate growth and content generation for digital publishers, using a system of API's for machine-to-machine communications, involving Hootsuite, Twitter, and Google Analytics.
Vincent's profile is accessible at http://bit.ly/1jWEfMP and includes top publications, presentations, and work experience with Visa, Microsoft, eBay, NBC, Wells Fargo, and other organisations.
Guest blog by Justin B. Dickerson, PhD, MBA, PStat, Chief Data Scientist at Snap Advances.
Okay, that headline was meant to get your attention. But lately, I've been thinking about this crazy circus we call data science and how everyone seems to think data scientists are invaluable, treasured, and potentially "un-fireable" in this age of data scientist negative…Continue
Guest blog by Jay Gendron, Associate Data Scientist at Booz Allen Hamilton; Author; Data Analytics; Speaker.
In today’s edition of “Coffeehouse Connect” we take a look at a major predictive event in the United States that occurs each year on February 2.
Today is Groundhog Day. It occurs in…Continue
Starred articles are new additions or updated content, posted between Thursday and Sunday. The weekly digest has six sections: (1) Featured Resources and Technical Contributions, (2) Featured Articles and Case Studies, (3) From our Sponsors, (4) News, Events, Books, Training, Forum Questions, (5) Picture of the Week, and (6) Syndicated Content.
The full version is always published Monday.…Continue
Guest blog post by
There is a featureI really like in Apache Spark. Spark can process data out of memory in my local machine even without a cluster. Good news for those…Continue