Subscribe to DSC Newsletter
Jonathan Symonds
  • Male
  • Menlo Park, CA
  • United States
Share on Facebook
Share

Gifts Received

Gift

Jonathan Symonds has not received any gifts yet

Give a Gift

 

Jonathan Symonds's Page

Latest Activity

Jonathan Symonds posted a blog post

Accelerating aggregate MD5 hashing up to 800% with AVX512

By Frank WesselsIntroductionWhile MD5 hashing is no longer a good choice when considering a hash function, it is still being used in a great variety of applications. As such any performance improvements that can be made to the MD5 hashing speed are worth considering.Due to recent improvements in SIMD processing (AVX2 and especially AVX512) we are providing a Go md5-simd package that accelerates MD5 hashing in aggregate by up to 400% on AVX2 and up…See More
May 3
Jonathan Symonds posted a blog post

Data Storage at the Edge: Typologies and Attributes

Edge computing is a hot topic and carries with it some confusion, particularly around storage. Handling data properly at the edge can ensure a scalable, cost-effective and secure infrastructure - but failing to set up the right architecture can lead to data loss, security vulnerabilities and sky-high costs related to the bandwidth needed to transfer data repeatedly to and from the public cloud. Bandwidth is a key consideration from an architecture perspective, and the reason why is clear: it is…See More
Mar 12
Jonathan Symonds posted a blog post

You can't containerize an appliance: why Kubernetes and high performance object storage are tech's new building blocks.

There are two forces that are fundamentally remaking the technology landscape today. One is Kubernetes and the other is high performance Object Storage. They are powering (or are shaped by, depending on your perspective) modern, data-rich applications that include AI/ML and application logs. Either way, modern applications need Kubernetes and Object Storage and Kubernetes and Object Storage owe their rise in part to these same modern applications.They are symbiotic and they are tech’s new…See More
Feb 14
Jonathan Symonds posted a blog post

simdjson-go: Parsing gigabytes of JSON per second in Go

Written by Frank WesselsIntroductionJSON has established itself as the "lingua franca" of the web. As such the parsing performance of JSON is hugely important for many applications. Despite the simple and human-friendly nature of JSON, it is not a technically trivial format to parse at high speeds.Recently some new designs have been presented one of which is simdjson by Daniel Lemire and Geoff Langdale. simdjson uses a novel, two stage approach…See More
Feb 13
Jonathan Symonds's blog post was featured

Kubernetes Storage Patterns

Via Nitish TiwariKubernetes has fundamentally altered  the traditional application development and deployment patterns. Application development teams can now develop, test and deploy their apps in days, across different environments, all within their Kubernetes clusters. Previous generations of technology typically took weeks if not months.This acceleration is possible due to the abstraction that Kubernetes brings to the table, i.e. it deals with underlying details of physical or virtual…See More
Feb 6
Jonathan Symonds posted a blog post

Breaking the HDFS Performance Barrier; An Object Storage First

By Siddartha ManiFew would argue with the statement that Hadoop HDFS is in decline. In fact, the HDFS part of the Hadoop ecosystem is in more than just decline - it is in freefall. At the time of its inception, it had a meaningful role to play as a high-throughput, fault-tolerant distributed file system. The secret sauce was data locality. By co-locating compute and data on the same nodes, HDFS overcame the limitations of slow network access to data. The implications, however, are well known at…See More
Aug 11, 2019
Jonathan Symonds's blog post was featured

Breaking the HDFS Performance Barrier; An Object Storage First

By Siddartha ManiFew would argue with the statement that Hadoop HDFS is in decline. In fact, the HDFS part of the Hadoop ecosystem is in more than just decline - it is in freefall. At the time of its inception, it had a meaningful role to play as a high-throughput, fault-tolerant distributed file system. The secret sauce was data locality. By co-locating compute and data on the same nodes, HDFS overcame the limitations of slow network access to data. The implications, however, are well known at…See More
Aug 11, 2019
Jonathan Symonds posted a blog post

Running Peta-Scale Spark Jobs on Object Storage Using S3 Select

When one looks at the amazing roster of talks for most data science conferences what you don’t see is a lot of discussion on how to leverage object storage. On some level you would expect to — ultimately if you want to run your Spark or Presto job on peta-scale data sets and have it be available to your applications in the public or private cloud — this would be the logical storage architecture.While logical, there has been a catch, at least historically, and that is object storage wasn’t…See More
Jun 25, 2019
Jonathan Symonds posted a blog post

Running Peta-Scale Spark Jobs on Object Storage Using S3 Select

When one looks at the amazing roster of talks for most data science conferences what you don’t see is a lot of discussion on how to leverage object storage. On some level you would expect to — ultimately if you want to run your Spark or Presto job on peta-scale data sets and have it be available to your applications in the public or private cloud — this would be the logical storage architecture.While logical, there has been a catch, at least historically, and that is object storage wasn’t…See More
Jun 23, 2019
Jonathan Symonds's blog post was featured

Running Peta-Scale Spark Jobs on Object Storage Using S3 Select

When one looks at the amazing roster of talks for most data science conferences what you don’t see is a lot of discussion on how to leverage object storage. On some level you would expect to — ultimately if you want to run your Spark or Presto job on peta-scale data sets and have it be available to your applications in the public or private cloud — this would be the logical storage architecture.While logical, there has been a catch, at least historically, and that is object storage wasn’t…See More
Jun 23, 2019

Profile Information

Seniority:
Manager
Short Bio:
Corporate marketing at Ayasdi
LinkedIn Profile:
http://ayasdi.com
Interests:
Other

Jonathan Symonds's Blog

Accelerating aggregate MD5 hashing up to 800% with AVX512

Posted on April 30, 2020 at 1:06pm 0 Comments

By Frank Wessels



Introduction

While MD5 hashing is no longer a good choice when considering a hash function, it is still being used in a great variety of applications. As such any performance improvements that can be made to the MD5 hashing speed are worth considering.

Due to recent improvements in SIMD processing (AVX2 and especially AVX512) we are providing a Go …

Continue

Data Storage at the Edge: Typologies and Attributes

Posted on March 9, 2020 at 8:32am 0 Comments

Edge computing is a hot topic and carries with it some confusion, particularly around storage. Handling data properly at the edge can ensure a scalable, cost-effective and secure infrastructure - but failing to set up the right architecture can lead to data loss, security vulnerabilities and sky-high costs related to the bandwidth needed to transfer data repeatedly to and from the public cloud. Bandwidth is a key consideration from an architecture perspective, and the reason why is clear: it…

Continue

You can't containerize an appliance: why Kubernetes and high performance object storage are tech's new building blocks.

Posted on February 14, 2020 at 10:16am 0 Comments

There are two forces that are fundamentally remaking the technology landscape today. One is Kubernetes and the other is high performance Object Storage. They are powering (or are shaped by, depending on your perspective) modern, data-rich applications that include AI/ML and application logs. Either way, modern applications need Kubernetes and Object Storage and Kubernetes and Object Storage owe their rise in part to these same modern applications.

They are symbiotic and they are…

Continue

simdjson-go: Parsing gigabytes of JSON per second in Go

Posted on February 11, 2020 at 8:30am 0 Comments

Written by Frank Wessels

Introduction

JSON has established itself as the "lingua franca" of the web. As such the parsing performance of JSON is hugely important for many applications. Despite the simple and human-friendly nature of JSON, it is not a technically trivial format to parse at high speeds.

Recently some new designs have been presented one of which is …

Continue

Comment Wall

You need to be a member of Data Science Central to add comments!

Join Data Science Central

  • No comments yet!
 
 
 

Videos

  • Add Videos
  • View All

© 2020   Data Science Central ®   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service