Ivo Stratev
Feb 19, 2023

First, sorry for the late reply Arli!

Okay, I agree about the column-oriented storage but the RowGroups and Stats are so key for the Parquet format that without explaining them we are missing 2 out of 3 reasons why Parquet even exists :)

The third is the implementation of the Dremel paper but this is not so key for the initial grasp of the Parquet format and can be explained separately because one needs to explain definition and repetition levels which is far from trivial...

Sign up to discover human stories that deepen your understanding of the world.

Free

Distraction-free reading. No ads.

Organize your knowledge with lists and highlights.

Tell your story. Find your audience.

Membership

Read member-only stories

Support writers you read most

Earn money for your writing

Listen to audio narrations

Read offline with the Medium app

Ivo Stratev
Ivo Stratev

Written by Ivo Stratev

Passionate about Programming. Interested in Highly Distributed Systems and the Microservice Architecture. In love with Math and proving things.

No responses yet

Write a response