Sitemap

Member-only story

I Almost Got Fired Because of Pandas on Databricks — Here’s What You Should Learn From My Mistake

4 min readMay 3, 2025

--

Just because your code works doesn’t mean it scales.

Photo by Igor Omilaev on Unsplash

Read for free here

The Mistake That Almost Ended My Data Engineering Career

I still remember the anxiety in my chest when the production job failed.

I was a few months into my first full-time data engineering role, working with a cloud-native stack on Databricks. Everything felt familiar enough: Python, notebooks, some new UI elements, but nothing I couldn’t figure out.

Then came the big moment.

A client needed a transformed dataset pushed into production — millions of rows with business-critical insights.

I knew exactly how to handle this. I opened Databricks, pasted in the same Pandas code I had relied on in Jupyter notebooks for years…

It worked flawlessly in dev.

So I scheduled the job, pushed it to production, and logged off feeling like a rockstar.

That night, I got an urgent call.
The production pipeline had failed.
The cluster had crashed.
SLAs were missed.
Our reporting dashboards were blank.
And I was this close to losing my job.

--

--

Nnaemezue Obi-Eyisi
Nnaemezue Obi-Eyisi

Written by Nnaemezue Obi-Eyisi

I am passionate about empowering, educating, and encouraging individuals pursuing a career in data engineering. Currently a Senior Data Engineer at Capgemini

Responses (29)