The Power of Data Extraction: Pull vs. Push Strategies

Nnaemezue Obi-Eyisi
3 min readOct 26, 2023

In the realm of Data Engineering, the methods we use to extract data are just as pivotal as the data itself. Two primary strategies, known as Pull and Push, stand at the forefront of shaping how we collect and process information. Let’s delve into the world of data extraction strategies to understand their merits, drawbacks, and real-world applications.

The Pull Strategy

Flexible and Controlled Data Retrieval

The Pull strategy is akin to having the data world at your fingertips. It allows you to “pull” data from its source when you need it. This approach provides data engineers with the flexibility and control to decide when and how data should be retrieved. It’s an ideal choice when you need to work with large datasets or when the source systems are not capable of actively sending data.

Pros of the Pull Strategy:

  1. Flexibility in Scheduling Extractions: With the Pull strategy, you can set up extraction schedules that align with your needs, ensuring that data is available when you require it.
  2. Ideal for Large Datasets: When working with vast amounts of data, Pull is the go-to strategy, as it allows you to fetch data in manageable chunks.
  3. Minimal Impact on Source Systems: Unlike Push…

--

--

Nnaemezue Obi-Eyisi

I am passionate about empowering, educating, and encouraging individuals pursuing a career in data engineering. Currently a Senior Data Engineer at Capgemini