Unifying multiple data sources and repositories is a challenge that Etsy, Inc. is solving with the Apache Kafka messaging system. Chris “CB” Bohn, senior database engineer for the Etsy online ...
Yelp today open sourced a key piece of code that helped it migrate from a monolithic code base to a distributed services-based architecture. Called Data Pipeline, the Python-based product saved the ...
Ably Kafka Connector 3.0 continues to deliver efficient and reliable Kafka pipeline extension capabilities for developers, now with improvements toward several features. Enhanced throughput and ...
Unlock the full InfoQ experience by logging in! Stay updated with your favorite authors and topics, engage with content, and download exclusive resources. Vivek Yadav, an engineering manager from ...
As a big Hadoop user, Pandora Media is no stranger to distributed processing technologies. But when the music streaming service decided to transition its ad tracking system from a batch-oriented ...
Confluent, founded by the creators of Apache™ Kafka™, announced the release of open source Confluent Platform 2.0, based on an updated Apache Kafka 0.9 core. Representing a big leap forward in the ...
In a connected world, real-time data pipelines power applications and insights, providing the digital infrastructure for active data. This helps data-driven companies understand how their customer ...
Streaming is hot. The demand for real-time data processing is rising, and streaming vendors are proliferating and competing. Apache Kafka is a key component in many data pipeline architectures, mostly ...
Imagine it’s 3 a.m. and your pager goes off. A downstream service is failing, and after an hour of debugging you trace the issue to a tiny, undocumented schema change made by an upstream team. The fix ...