Serialization

Jan 22, 2026 Engineering

Spark Scala - Encoders and Serialization

Serialization is the silent performance killer in distributed computing. Every time Spark shuffles data between executors, broadcasts variables, or caches RDDs, it serializes objects. Poor…

Read more →

Jan 16, 2026 Engineering

Serialization: JSON, Protocol Buffers, MessagePack

Serialization converts in-memory data structures into a format that can be transmitted over a network or stored on disk. Deserialization reverses the process. Every time you make an API call, write…

Read more →

Jan 02, 2026 Rust

Rust Serde: Serialization and Deserialization

Serde is Rust’s de facto serialization framework, providing a generic interface for converting data structures to and from various formats. The name combines ‘serialization’ and ‘deserialization,’…

Read more →

Oct 09, 2025 Engineering

Protocol Buffers: Schema-Based Serialization

JSON is convenient until it isn’t. At small scale, the flexibility of schema-less formats feels like freedom. At large scale, it becomes a liability. Every service parses JSON differently. Field…

Read more →

Jan 09, 2025 Data Engineering

Apache Spark - Serialization (Kryo vs Java)

Apache Spark serializes objects when shuffling data between executors, caching RDDs in serialized form, and broadcasting variables. The serialization mechanism directly impacts network I/O, memory…

Read more →