A work breakdown structure (WBS) for big data analytics projects — Part 1

Vijaya Phanindra
2 min readMar 1, 2020

Ever wondered what activities are involved in a big data project? A work breakdown structure (WBS) helps in selecting technology/tools, defining project scope and estimating the effort.

Big data Analytics projects have the following unique features compared with traditional data warehouse projects

* Storage and Compute is decoupled
* Schema on write is not required.
* Supports polyglot storage ex: raw, object store, NoSQL, columnar.
* Supports polyglot consumption of data products

As such a big data analytics project scope, tools, tasks, and estimation is completely different from data warehouse.

Breaking down tasks for big data projects is hard and so is estimating.

The easiest way to start identifying tasks is to look at solution architectures of big data projects and in almost all of the cases, the design pattern is

Source --> Ingest --> Store --> Process --> Analyze > Consume

though each of these individual components can/will have multiple loosely coupled technologies/systems which makes it so hard in implementing and operationalizing.

The first part of blog post covers activities in the Ingest component of a big data analytics project.

A list of activities related to ingestion in big data analytics projects
Big Data Analytics Ingest — WBS

Related

Work breakdown structure for big data analytics projects — part 2 store

Work breakdown structure for big data analytics projects — part 3 process

Disclaimer: All the opinions expressed are personal independent thoughts and not to be attributed to my current or previous employers.

--

--

Vijaya Phanindra

I am a Cloud and Data Architect and I write about tech (data analytics, data products, real time streaming analytics), career development and decision making