Subscribe to DSC Newsletter

September 2014 Blog Posts (52)

The 22 Skills of a Data Scientist

There has been a number of interesting articles recently, discussing the skills a data scientist should or might have. The one entitled The 22 Skills of a Data Scientist is a popular one (see 22 skills listed below, or click on the link to read the full article). Earlier this morning, I read another one on LinkedIn: …

Continue

Added by Vincent Granville on September 29, 2014 at 1:00pm — 7 Comments

Elements of machine learning

The official title of this free book available in PDF format is Machine Learning Cheat Sheet. But it's more about elements of machine learning, with a strong emphasis on classic statistical modeling, and rather theoretical - maybe something like a rather comprehensive, theoretical foundations (or handbook) of statistical science. Anyway, very interesting, and it's free. See table of content screenshot below. …

Continue

Added by Marcel Remon on September 29, 2014 at 9:30am — 1 Comment

More about Shifting Culture, Less about Investing in Potential

Data Science is often brought to companies as a potential game changer. An investment that may pay off if the company's data can be leveraged to provide insight and gain a competitive edge. But bringing analytical offerings to organizations as a "maybe solution" to their pain points misses the mark. Data science is today's answer to our most pressing enterprise and socially innovative challenges given the data-driven nature of our markets and society as a whole. If an investment in data…

Continue

Added by Sean McClure on September 29, 2014 at 9:03am — 1 Comment

New Beginnings in Facial Recognition

As humans, we navigate our lives largely by the recognition of patterns. These patterns include the sound of a mother’s voice, the appearance of a dangerous animal or poisonous food, the familiarity of kin, and the attraction to potential mates. Accurate pattern recognition is key to an animal’s survival and progress, and has allowed humans to become the socially complex and advanced species we are today. 

It should come as no surprise that…

Continue

Added by Sean McClure on September 29, 2014 at 9:01am — No Comments

Keeping Corporate Data Safe: 5 Ways Lax BYOD Policies Create Security Risks

The proliferation of smartphones, tablets, and other mobile devices — here come the “wearables” — has opened up new opportunities for businesses to leverage employee-owned technology for competitive advantage. That being said, the use of such devices in the workplace can compromise sensitive data, especially when comprehensive BYOD policies are not implemented and…

Continue

Added by Beau Winchester on September 28, 2014 at 11:00am — No Comments

Apache Spark: distributed data processing faster than Hadoop

This blog is extrapolated from DataScience Hacks by the author himself. 

Apache Spark, another apache licensed top-level project that could perform large scale data processing way faster than Hadoop (I am referring to MR1.0 here). It is possible due to Resilient Distributed Datasets concept that is behind this fast data processing. RDD is basically a collection of objects,…

Continue

Added by Pavan Kumar on September 28, 2014 at 7:00am — 1 Comment

Data Instrumentalism

Being the son of a mechanic, I have spent many years handling power tools. I'm especially fond of a couple of hammer-drills in my possession. They can effortlessly drill holes through concrete. At least, this is what my father once claimed. He handed down his most treasured tools to me. I'm big on pliers and screwdrivers. This might be due to my vocational training as a technician. Even today - long after I completed my diploma and continued to further my education - I still carry a licence…

Continue

Added by Don Philip Faithful on September 27, 2014 at 7:39am — No Comments

Hadoop is Dead. DataFlow is Alive!

We've given Hadoop almost 10 years to mature, invested billions, and very few companies are  seeing the return on investment.  Several companies have tried to make Hadoop a real-time analytical platform, incorporating SQL-like facades on top, but the latency is still not where it needs to be for interactive applications.  Even Google, a true big data user, has moved on and is using more dataflow / flow-based programming approaches.    Why?  It just makes sense...

  • Why should I…
Continue

Added by Lars Fiedler on September 27, 2014 at 7:30am — No Comments

Decipher Neo4J Cypher Query Language (CQL)

This blog post is a follow up post to Embrace Relationships with Neo4J, R & Java

Neo4j Cypher is a declarative graph query language that allows for expressive and efficient querying and updating of the graph store. Cypher is a relatively simple but still very powerful language. Very complicated database queries can easily be expressed through Cypher. This allows…

Continue

Added by Raghavan Madabusi on September 26, 2014 at 2:17am — No Comments

How Tracking Analytics Can Improve Content Marketing

Inbound and content marketing are not going anywhere anytime soon. The content marketing association reports that over 90% of both enterprise B2B and B2C companies are using the tactic. There are a million different ways to leverage content strategy, and here at TechnologyAdvice, we’ve experimented with plenty of them. It’s been a fun, albeit, educational experience to say the…

Continue

Added by Keith Cawley on September 25, 2014 at 4:59am — No Comments

Weekly Digest - September 29

The full version is always published Monday. Starred articles or sections are new additions or updated content, posted between Thursday and Sunday. 

Featured

Continue

Added by Vincent Granville on September 24, 2014 at 5:30pm — No Comments

SKOS as a Key Element in Enterprise Linked Data Strategies

The challenges in implementing linked data technologies in enterprises are not limited to technical issues only. Projects like these deal also with organisational hurdles to be crossed, for instance the development of employee skills in the area of knowledge modelling and the implementation of a linked data strategy which foresees a cost-effective and sustainable infrastructure of high-quality and linked knowledge graphs. SKOS is able to play a key role in enterprise linked data strategies…

Continue

Added by Andreas Blumauer on September 21, 2014 at 10:22pm — No Comments

Top 2,500 Websites - not containing seed keywords

For explanations about the methodology, including source code and possible improvements, read our main article on this subject. It also provides links to our other three listings.

The field between parentheses represents the year when the website in question was first mentioned - it does not represent when the website was created, thought it's a…

Continue

Added by Vincent Granville on September 20, 2014 at 12:50pm — No Comments

Top 2,500 Websites - not crawlable

For explanations about the methodology, including source code and possible improvements, read our main article on this subject. It also provides links to our other three listings.

The field between parentheses represents the year when the website in question was first mentioned - it does not represent when the website was created, thought it's a…

Continue

Added by Vincent Granville on September 20, 2014 at 12:42pm — No Comments

Top 2,500 Websites - mentioned a few times (Page 2)

Click here for explanations.

  • macrorisk.com (2012) - analytics 
  • datamining.togaware.com (2012) - statistics, machine learning, analytics, data mining, database 
  • kavaii.com (2013) - statistics, big data, analytics 
  • data-mining-blog.com (2012) - text mining, analytics, data mining, business…
Continue

Added by Vincent Granville on September 20, 2014 at 12:35pm — No Comments

Top 2,500 Websites - mentioned a few times (Page 1)

Click here for explanations.

  • indiana.edu (2012) - analytics 
  • businessintelligence.ittoolbox.com (2011) - text mining, analytics, data mining, database, business intelligence 
  • plot.ly (2014) - analytics 
  • dssresources.com (2012) - business intelligence 
  • powerbi.com (2014) - analytics, business…
Continue

Added by Vincent Granville on September 20, 2014 at 12:32pm — 1 Comment

Top 2,500 Websites - top of the top

For explanations about the methodology, including source code and possible improvements, read our main article on this subject. It also provides links to our other three listings.

The field between parentheses represents the year when the website in question was first mentioned - it does not represent when the website was created, thought it's a…

Continue

Added by Vincent Granville on September 20, 2014 at 12:11pm — 2 Comments

Top 2,500 Websites - mentioned a few times

For explanations about the methodology, including source code and possible improvements, read our main article on this subject. It also provides links to our other three listings.

The field between parentheses represents the year when the website in question was first mentioned - it does not represent when the website was created, thought it's a…

Continue

Added by Vincent Granville on September 20, 2014 at 12:00pm — No Comments

Top 2,500 Data Science, Big Data and Analytics Websites

The following comprehensive listings were produced by analyzing our large member database, extracting websites that our members mentioned or liked, and for each web site, identifying

  • When it is first mentioned by one of our members
  • The number of times it was mentioned
  • Keywords found when visiting the front page with a web crawler, using a pre-selected list of seed keywords

The design of the member database (non-mandatory sign-up questions and…

Continue

Added by Vincent Granville on September 20, 2014 at 10:00am — No Comments

Monthly Archives

2017

2016

2015

2014

2013

2012

2011

1999

Follow Us

Videos

  • Add Videos
  • View All

Resources

© 2017   Data Science Central   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service