Subscribe to DSC Newsletter

Free Livestream: LineageDB Architecture for Big Data Analytics & Data Quality - Wed. Dec. 3 @6pm MST

Event Details

Free Livestream: LineageDB Architecture for Big Data Analytics & Data Quality - Wed. Dec. 3 @6pm MST

Time: December 3, 2014 from 6pm to 9pm
Location: University of Colorado Boulder
Street: 1125 18th St Bldg 223 Room 100
City/Town: Boulder, CO
Website or Map: http://www.meetup.com/Data-Sc…
Event Type: free, and, open, to, all, -, livestream
Organized By: Michael Walker
Latest Activity: Nov 19, 2014

Export to Outlook or iCal (.ics)

Event Description

Register @ http://bit.ly/1Elg3tJ

NOTE: For folks unable to attend in person register and we will email you a livestream link 2 hours prior to event.

LineageDB Architecture for Big Data Analytics - Abstract

The traditional approach to data analytic platforms are:

• tightly coupled to expensive relational data services;
• limited to star and snow-flake schema (notoriously difficult to maintain); and
• heavily dependent on brittle, expensive ETLs.

RDBMS can be scaled vertically (at a big price point), but eventually you run out of run-way because a b-tree does not scale linearly. The morphing of relational services into MPP appliances have resulted in platforms that are not flexible enough to support rapidly changing data analytic needs. These limitations in can be overcome by adopting
the LineageDB architecture, a polyglot composed from loosely coupled, open-source:

• key-value storage service;
• index service;
• graph service;
• SQL service; and
• in-memory data service.

Charles Clifford - Bio

Charles Clifford has been designing and developing both transaction, as well as analytic, business solutions since the early 90s. He has delivered distributed solutions to a variety of industries, from tel-com, to capital markets, to health care, to software powerhouses. His current focus is on the design and delivery of DaaS solutions.

Data Quality - the Dirty Underbelly of Data Science - Abstract

Data quality continues to be one of the chief challenges, costs and reasons for project failure in data science. Problems in this space limit accuracy, destroy credibility and can result in harmful solutions. And unlike challenges such as scalability and cost it has seen no major breakthrough improvements. This presentation will cover
the types of problems, as well as their impacts, causes and various solutions.

Ken Farmer - Bio

Ken Farmer is the senior data architect/wrangler/librarian for
ProtectWise where he is developing their analytical data solution. Previously, he has developed, maintained, managed and consulted on analytical data architectures for IBM, MapQuest, Verizon, and others.

Register @ http://bit.ly/1Elg3tJ

Comment Wall

Comment

RSVP for Free Livestream: LineageDB Architecture for Big Data Analytics & Data Quality - Wed. Dec. 3 @6pm MST to add comments!

Join Data Science Central

Attending (1)

© 2019   Data Science Central ®   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service