Subscribe to DSC Newsletter

Featured Blog Posts – September 2014 Archive (44)

The 22 Skills of a Data Scientist

There has been a number of interesting articles recently, discussing the skills a data scientist should or might have. The one entitled The 22 Skills of a Data Scientist is a popular one (see 22 skills listed below, or click on the link to read the full article). Earlier this morning, I read another one on LinkedIn: …

Continue

Added by Vincent Granville on September 29, 2014 at 1:00pm — 7 Comments

Elements of machine learning

The official title of this free book available in PDF format is Machine Learning Cheat Sheet. But it's more about elements of machine learning, with a strong emphasis on classic statistical modeling, and rather theoretical - maybe something like a rather comprehensive, theoretical foundations (or handbook) of statistical science. Anyway, very interesting, and it's free. See table of content screenshot below. …

Continue

Added by Marcel Remon on September 29, 2014 at 9:30am — 1 Comment

More about Shifting Culture, Less about Investing in Potential

Data Science is often brought to companies as a potential game changer. An investment that may pay off if the company's data can be leveraged to provide insight and gain a competitive edge. But bringing analytical offerings to organizations as a "maybe solution" to their pain points misses the mark. Data science is today's answer to our most pressing enterprise and socially innovative challenges given the data-driven nature of our markets and society as a whole. If an investment in data…

Continue

Added by Sean McClure on September 29, 2014 at 9:03am — 1 Comment

New Beginnings in Facial Recognition

As humans, we navigate our lives largely by the recognition of patterns. These patterns include the sound of a mother’s voice, the appearance of a dangerous animal or poisonous food, the familiarity of kin, and the attraction to potential mates. Accurate pattern recognition is key to an animal’s survival and progress, and has allowed humans to become the socially complex and advanced species we are today. 

It should come as no surprise that…

Continue

Added by Sean McClure on September 29, 2014 at 9:01am — No Comments

Keeping Corporate Data Safe: 5 Ways Lax BYOD Policies Create Security Risks

The proliferation of smartphones, tablets, and other mobile devices — here come the “wearables” — has opened up new opportunities for businesses to leverage employee-owned technology for competitive advantage. That being said, the use of such devices in the workplace can compromise sensitive data, especially when comprehensive BYOD policies are not implemented and…

Continue

Added by Beau Winchester on September 28, 2014 at 11:00am — No Comments

Apache Spark: distributed data processing faster than Hadoop

This blog is extrapolated from DataScience Hacks by the author himself. 

Apache Spark, another apache licensed top-level project that could perform large scale data processing way faster than Hadoop (I am referring to MR1.0 here). It is possible due to Resilient Distributed Datasets concept that is behind this fast data processing. RDD is basically a collection of objects,…

Continue

Added by Pavan Kumar on September 28, 2014 at 7:00am — 1 Comment

Data Instrumentalism

Being the son of a mechanic, I have spent many years handling power tools. I'm especially fond of a couple of hammer-drills in my possession. They can effortlessly drill holes through concrete. At least, this is what my father once claimed. He handed down his most treasured tools to me. I'm big on pliers and screwdrivers. This might be due to my vocational training as a technician. Even today - long after I completed my diploma and continued to further my education - I still carry a licence…

Continue

Added by Don Philip Faithful on September 27, 2014 at 7:39am — No Comments

How Tracking Analytics Can Improve Content Marketing

Inbound and content marketing are not going anywhere anytime soon. The content marketing association reports that over 90% of both enterprise B2B and B2C companies are using the tactic. There are a million different ways to leverage content strategy, and here at TechnologyAdvice, we’ve experimented with plenty of them. It’s been a fun, albeit, educational experience to say the…

Continue

Added by Keith Cawley on September 25, 2014 at 4:59am — No Comments

Weekly Digest - September 29

The full version is always published Monday. Starred articles or sections are new additions or updated content, posted between Thursday and Sunday. 

Featured

Continue

Added by Vincent Granville on September 24, 2014 at 5:30pm — No Comments

Caveat Data Scientist: Public Trust Low for Science

A new paper entitled "Gaining Trust as well as Respect in Communicating Science Topicspublished in the Proceedings of the National…

Continue

Added by Michael Walker on September 24, 2014 at 5:07pm — No Comments

SKOS as a Key Element in Enterprise Linked Data Strategies

The challenges in implementing linked data technologies in enterprises are not limited to technical issues only. Projects like these deal also with organisational hurdles to be crossed, for instance the development of employee skills in the area of knowledge modelling and the implementation of a linked data strategy which foresees a cost-effective and sustainable infrastructure of high-quality and linked knowledge graphs. SKOS is able to play a key role in enterprise linked data strategies…

Continue

Added by Andreas Blumauer on September 21, 2014 at 10:22pm — No Comments

Top 2,500 Websites - top of the top

For explanations about the methodology, including source code and possible improvements, read our main article on this subject. It also provides links to our other three listings.

The field between parentheses represents the year when the website in question was first mentioned - it does not represent when the website was created,…

Continue

Added by Vincent Granville on September 20, 2014 at 12:00pm — 2 Comments

Top 2,500 Data Science, Big Data and Analytics Websites

The following comprehensive listings were produced by analyzing our large member database, extracting websites that our members mentioned or liked, and for each web site, identifying

  • When it is first mentioned by one of our members
  • The number of times it was mentioned
  • Keywords found when visiting the front page with a web crawler, using a pre-selected list of seed keywords

The design of the member database (non-mandatory sign-up questions and…

Continue

Added by Vincent Granville on September 20, 2014 at 10:00am — No Comments

Embrace Relationships with Neo4J, R & Java

Graphs are everywhere, used by everyone, for everything. Neo4j is one of the most popular graph database that can be used to make recommendations, get social, find paths, uncover fraud, manage networks, and so on. A graph database can store any kind of data using a Nodes (graph data records), Relationships (connect nodes), and Properties (named data values).

A graph database can be used for connected data which is otherwise not possible with either relational or other NOSQL databases…

Continue

Added by Raghavan Madabusi on September 19, 2014 at 6:31pm — No Comments

Preliminary findings about Zipf's Law (the thick tail distribution)

In this blog post I summarize my current findings in the Zipf's law clustering process.

I have done some extensive simulations on clustering of entities, which should lead to Zipf's law..

To summarize the simulated clustering process (which complies to the guidelines dr. Granville provided in his post):…

Continue

Added by Tom De Smedt on September 19, 2014 at 5:00pm — No Comments

Big Data is Stupid Data

The "Big Data" marketing hype obscures the fact that more actionable, valuable insights are likely to be found in the right smaller "Smart Data" sets in contrast to large data sets.



While the term "Big Data" is properly defined as data sets whose size is beyond the ability of commonly used software tools to capture, manage, and process the data within a tolerable elapsed time - the…

Continue

Added by Michael Walker on September 19, 2014 at 11:10am — 2 Comments

Is BYOD Momentum Fading?

Not long ago, bring your own device (BYOD) was the trend to watch among enterprises. Companies of all shapes and sizes looked at BYOD as the next big thing in business, a way to make…

Continue

Added by Rick Delgado on September 18, 2014 at 10:00am — No Comments

Weekly Digest - September 22

The full version is always published Monday. Starred articles or sections are new additions or updated content, posted between Thursday and Sunday. 

Sponsored Announcement…

Continue

Added by Vincent Granville on September 17, 2014 at 5:30pm — 1 Comment

What Are Analytic Marketplaces?

The Naked analyst Episode 9 looks at the best places for analytic competitions.

Here are my top 4 sites. They range from the the well known to some more specialist sites. There are 3 marketplaces I know of, and a 'world championship' currently running.

Kaggle

Kaggle is the world's largest community of data scientists. They compete with…

Continue

Added by Steve Bennett on September 16, 2014 at 7:17pm — No Comments

Defining Big Data

Defining big data is now a hot topic. Berkeley University posted 40 very short definitions by thought leaders (including me). Here our goal is to offer a very detailed, comprehensive definition that (hopefully) suits everyone.

First, there are three…

Continue

Added by Vincent Granville on September 16, 2014 at 6:00pm — 2 Comments

Featured Monthly Archives

2019

2018

2017

2016

2015

2014

2013

2012

2011

Videos

  • Add Videos
  • View All

© 2019   Data Science Central ®   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service