Subscribe to DSC Newsletter

Michael Walker's Blog Posts Tagged 'Processing' (4)

Apache Beam - Create Data Processing Pipelines

At the Data Science Association our members often complain about the major data engineering problem of finding the right tools and programming models to build both robust data processing pipelines and efficient ETL processes for data transformation and integration.…



Continue

Added by Michael Walker on May 19, 2016 at 10:00pm — No Comments

The Haboob Clouds Hadoops Future

Hadoop is an open source framework for storing massive amounts of data on clusters of commodity hardware.



Haboob is a dense dust storm that moves fast…

Continue

Added by Michael Walker on March 23, 2014 at 9:03am — 3 Comments

Tool for Computing Continuous Distributed Representations of Words

Natural language processing (NLP) involves machine learning, artificial intelligence, algorithms and linguistics related to interactions between computers and human languages. One important goal…

Continue

Added by Michael Walker on August 20, 2013 at 7:27pm — No Comments

Batch vs. Real Time Data Processing

Batch data processing is an efficient way of processing high volumes of data is where a group of transactions is collected over a period of time. Data is collected, entered, processed and then the batch results are…

Continue

Added by Michael Walker on August 13, 2013 at 2:30pm — 2 Comments

Videos

  • Add Videos
  • View All

© 2019   Data Science Central ®   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service