Mastering Real-Time Anomaly Detection & Data Engineering
- 5:30 to 6:00 PM: Doors open, check in, and networking
- 6:00 to 7:00 PM: Talks and Q&A
- 7:00 to 8:00 PM: Closing remarks and networking
We are thrilled to host the inaugural DET Warsaw Meetup! Join us at Netflix's Warsaw office (in the Wola district) for an evening of learning, networking, and fun. Refreshments will be provided.
With limited capacity, please RSVP to secure your spot.
Cheers!
The DET Warsaw Team
Speakers & Talks
Talk #1: Content Data Lifecycle at Netflix
Speakers: Inna Giguere (Director of Content Data Products at Netflix) and Vivek Pasari (Manager of Security and Platform Data Engineering at Netflix)
This talk focuses on content data lifecycle at Netflix, illustrating how data and analytics underpin every stage of content development, production, and distribution. Beginning with the ideation and pitch phase, we outline how data-driven insights inform script selection, talent acquisition, and deal negotiations. It then follows the journey through production planning, budget analysis, and post-production processes, demonstrating how predictive analytics and machine learning optimize resource allocation and scheduling.
The discussion further explores how content is prepared for launch—through localization, quality control, and promotional strategies—before shifting to post-launch analysis, where custom-built data products and visualizations measure audience engagement, performance, and member satisfaction. The presentation highlights the collaborative efforts of specialized teams in engineering, analytics, and product management, all working to build robust data foundations and tools that enable seamless decision-making throughout the content lifecycle.
By showcasing real-world examples such as budget analyzers, VFX cost predictions, and ratings automation, the presentation emphasizes Netflix’s holistic, data-driven approach to managing content from conception to audience impact, ultimately ensuring both economic value and member joy.
Talk #2: Mastering Real-Time Anomaly Detection
Speaker: Olena Kutsenko (Staff Developer Advocate at Confluent)
Detecting problems as they happen is essential in today’s fast-moving, data-driven world. In this talk, you’ll learn how to build a flexible, real-time anomaly detection pipeline using Apache Kafka and Apache Flink, backed by statistical and machine learning models.
We’ll start by demystifying what "anomaly" really means - exploring the different types (point, contextual, and collective anomalies) and the difference between unintentional issues and intentional outliers like fraud or abuse.
Then, we’ll look at how anomaly detection is solved in practice: from classical statistical models like ARIMA to deep learning models like LSTM. You’ll learn how ARIMA breaks time series into AutoRegressive, Integrated, and Moving Average components - no math degree required (just a Python library, hehe)! We’ll also uncover why forgetting is a feature, not a bug, when it comes to LSTMs, and how these models learn to detect complex patterns over time.
Throughout, we’ll show how Kafka handles high-throughput streaming data and how Flink enables low-latency, stateful processing to catch issues as they emerge. You’ll leave knowing not just how these systems work, but when to use each type of model depending on your data and goals.
Whether you're monitoring system health, tracking IoT devices, or looking for fraud in transactions, this talk will give you the foundations and tools to detect the unexpected - before it becomes a problem.
(Interested in speaking at our meetups? Submit talk proposals here.)
About Data Engineer Things
Data Engineer Things (DET) is a global community built by data engineers for data engineers. Subscribe to the newsletter and follow us on LinkedIn to gain access to exclusive learning resources and networking opportunities, including articles, webinars, meetups, conferences, mentorship, and more.