Back to Home

Data Science Posts

Data Science

Detecting Silent Timestamp Drift In Streaming Sensor Data With Sql And Row-Level Lineage

I got burned by a failure mode that’s weirdly hard to spot: my real-time model didn’t crash, but it started learning from “time-shifted” data. The sen...

Jun 26, 2026Read more
Data Science

Building A Backpressure-Aware Real-Time Feature Store Using Postgres Logical Replication And Kafka

The problem I ran into: “real-time” that wasn’t I built what I thought was a clean real-time analytics pipeline: events flowed into Kafka, a consume...

Jun 25, 2026Read more
Data Science

Optimizing Postgres With Generated Columns For Jsonb Key Existence Filters

The problem I kept running into I was building a real-time analytics pipeline on PostgreSQL, where events came in as `JSONB`. Each event had a paylo...

Jun 5, 2026Read more
Data Science

Predicting Same-Day Checkout Failures With Feature-Arrival Time Windows

I ran into a weird failure mode while building a predictive model for “will this checkout fail today?” The data looked fine in batch, but during real-...

May 5, 2026Read more
Data Science

Building A Scd Type 2 “Event-Time Merge” For Late Arriving Clickstream In Dbt And Snowflake

Building a SCD Type 2 “Event-Time Merge” for Late Arriving Clickstream in dbt and Snowflake The problem that pulled me into this niche is simple: cl...

Apr 22, 2026Read more
Data Science

Taming The “Late-Arriving Dimension” Spike With Sql Window Filters

The problem I chased: a sudden 10x query spike in real-time metrics I hit a performance cliff in a real-time analytics pipeline that joins event dat...

Apr 21, 2026Read more
Data Science

Event-Time Dedupe Drift In Kafka Streams With Late Arrivals

The bug I couldn’t explain: “Same event, different features” I ran a real-time feature pipeline for an AI model and watched accuracy slowly degrade o...

Apr 15, 2026Read more
Data Science

Building A Streaming “Monotonic Timestamp” Guardrail In Kafka Connect

Building a Streaming “Monotonic Timestamp” Guardrail in Kafka Connect A weird class of incidents kept biting me in real-time pipelines: event times ...

Apr 3, 2026Read more