Consolidated list of papers on distributed database systems and parallel computing papers from Google Research

Vijaya Phanindra
2 min readMay 23, 2022

MapReduce programming model fundamentally changed the data processing world. The MapReduce pattern may be older, but Google’s MapReduce seminal paper and the Hadoop open-source distribution enabled programmers to run analysis on large data sets without having expertise in distributed servers and cluster management.

Google published many such papers over some time.
Some of the tools are run at planet scale internally, and some are available for external customers via the Google Cloud Platform.

The following is the list of papers in the area of distributed systems and parallel computing published by Google. These provide a wealth of information for someone working in the modern cloud, distributed databases, and systems.

Disclaimer: All the opinions expressed are personal independent thoughts and not to be attributed to my current or previous employers.

--

--

Vijaya Phanindra

I am a Cloud and Data Architect and I write about tech (data analytics, data products, real time streaming analytics), career development and decision making