Partition

SQL

SQL - PARTITION BY Clause

The PARTITION BY clause defines logical boundaries within a result set for window functions. Unlike GROUP BY, which collapses rows into aggregate summaries, PARTITION BY maintains all original rows…

Read more →
Scala

Scala - partition, span, splitAt

Scala provides three distinct methods for dividing collections: partition, span, and splitAt. Each serves different use cases and has different performance characteristics. Choosing the wrong…

Read more →
Engineering

Apache Spark - Partition Pruning

Partition pruning is Spark’s mechanism for skipping irrelevant data partitions during query execution. Think of it like a library’s card catalog system: instead of walking through every aisle to find…

Read more →