PyData Global 2023

Arthur Andres

After graduating with an engineering degree in 2009, I’ve worked in all four corners of the City of London, for various financial institutions, big and small.
As a software engineer, I specialise in data intensive applications.
I've worked with both real-time systems, and batch jobs.
I have a keen interest in how we can get the two to interact seamlessly.

The speaker's profile picture

Sessions

12-08
16:00
30min
Unified batch and stream processing in python
Arthur Andres

Historically it's been difficult to reuse existing batch processing code in streaming application.
Because of this, ML engineers had to maintain two implementations of their jobs.
One for streaming and one for batch.

In this talk we'll introduce beavers, a stream processing library optimized for analytics.
It can be used to run both batch and streaming jobs with minimal code duplication, whilst still being good at both.

Data Track
Data Track