Dogfooding with Druid, Samza, and Kafka: Metametrics at Metamarkets

June 3rd, 2015

“Another flaw in the human character is that everybody wants to build and nobody wants to do maintenance.” – Kurt Vonnegut Every engineer loves the feeling of standing up a […]

EXPLORE ARTICLE

Five Tips for a F’ing Great Logo

July 23rd, 2014

David Hertog and Fangjin Yang

This post originally appeared on Druid.io on July 23, 2014. Everyone wants a great logo, but it’s notoriously difficult work—prone to miscommunications, heated debates and countless revisions. Still, after three years […]

EXPLORE ARTICLE

Building a Data Pipeline That Handles Billions of Events in Real-Time

July 21st, 2014

Fangjin Yang and Gian Merlino

At Metamarkets our goal is to help our clients make sense of large amounts of data in real-time. Our platform ingests tens of billions of new events every day, and […]

EXPLORE ARTICLE

Open Source Leaders Sound Off on The Rise of the Real-Time Data Stack

May 7th, 2014

Fangjin Yang and Gian Merlino

In February we were honored to speak at the O’Reilly Strata conference about building a robust, flexible, and completely open source data analytics stack. If you couldn’t make it, you […]

EXPLORE ARTICLE

How We Scaled HyperLogLog: Three Real-World Optimizations

February 18th, 2014

Nelson Ray and Fangjin Yang

At Metamarkets, we specialize in converting mountains of programmatic ad data into real-time, explorable views. Because these datasets are so large and complex, we’re always looking for ways to maximize […]

EXPLORE ARTICLE

Maximum Performance with Minimum Storage: Data Compression in Druid

September 21st, 2012

Fangjin Yang

The Metamarkets solution allows for arbitrary exploration of massive data sets. Powered by Druid, our in-house distributed data store and processor, users can filter time series and top list queries […]

EXPLORE ARTICLE

Fast, Cheap, and 98% Right: Cardinality Estimation for Big Data

May 4th, 2012

Fangjin Yang

The nascent era of big data brings new challenges, which in turn require new tools and algorithms. At Metamarkets, one such challenge focuses on cardinality estimation: efficiently determining the number […]

EXPLORE ARTICLE

Important Note!

Dogfooding with Druid, Samza, and Kafka: Metametrics at Metamarkets

Five Tips for a F’ing Great Logo

Building a Data Pipeline That Handles Billions of Events in Real-Time

Open Source Leaders Sound Off on The Rise of the Real-Time Data Stack

How We Scaled HyperLogLog: Three Real-World Optimizations

Maximum Performance with Minimum Storage: Data Compression in Druid

Fast, Cheap, and 98% Right: Cardinality Estimation for Big Data

Subscribe to our Blog and Reports