Behind the Scenes of our Transition to a Multi-Cloud Environment

Filed in Algorithms, Company, Druid, Technology

Service uptime is the performance metric that determines operational success and when something fails, the impact can be far reaching, often affecting a business’s bottom line. One of the downsides of running infrastructure in a public cloud is that we are dependent on the SLAs provided by our Cloud Providers. As a startup, we have been upgrading our systems to become a lot more fault-tolerant, but since our cloud infrastructure footprint is restricted to one region, and the oldest region of AWS at that, we are vulnerable to be bitten by cloud service blackouts or brownouts. The most prominent solution […]

Behind the Scenes of our Transition to a Multi-Cloud Environment
Read Post Comments

Going Multi-Cloud with AWS and GCP: Lessons Learned at Scale

Filed in Algorithms, Druid, Industry, Technology

Metamarkets handles a lot of data. The torrent of data that clients send to us surpasses a petabyte a week. At this scale, the ability to failover gracefully, to detect and eliminate brownouts, and to efficiently operate huge quantities of byte-banging machines is necessary. We started and grew Metamarkets in AWS’s us-east region. And the majority of our footprint was in a single availability zone (AZ). As we grew, we started to see the side effects of being restricted to one AZ, then the side effects of being restricted to one region. It’s kind of like inflating a balloon in […]

Going Multi-Cloud with AWS and GCP: Lessons Learned at Scale
Read Post Comments