Files

Mar 03, 2026 TypeScript

TypeScript Declaration Files: Writing .d.ts Files

Declaration files are TypeScript’s mechanism for describing the shape of JavaScript code that exists elsewhere. When you use a JavaScript library in a TypeScript project, the compiler needs to know…

Read more →

Dec 16, 2025 R

R - Read/Write RDS and RData Files

R provides two native binary formats for persisting objects: RDS and RData. RDS files store a single R object, while RData files can store multiple objects from your workspace. Both formats preserve…

Read more →

Nov 25, 2025 Python

Python - Read/Write Binary Files

Binary files contain raw bytes without text encoding interpretation. Unlike text files, binary mode preserves exact byte sequences, making it critical for non-text data.

Read more →

Nov 25, 2025 Python

Python - Read/Write CSV Files

The csv module provides straightforward methods for reading CSV files. The csv.reader() function returns an iterator that yields each row as a list of strings.

Read more →

Nov 25, 2025 Python

Python - Read/Write Excel Files (openpyxl/xlsxwriter)

pip install openpyxl xlsxwriter pandas

Read more →

Nov 25, 2025 Python

Python - Read/Write JSON Files

• Python’s json module provides load()/loads() for reading and dump()/dumps() for writing JSON data with built-in type conversion between Python objects and JSON format

Read more →

Nov 18, 2025 Python

Python - List Files in Directory

The os.listdir() function returns a list of all entries in a directory as strings. This is the most straightforward approach for simple directory listings.

Read more →

Nov 07, 2025 Python

Python - Copy/Move/Rename Files (shutil)

The shutil module offers three primary copy functions, each with different metadata preservation guarantees.

Read more →

Sep 27, 2025 Pandas

Pandas: Reading and Writing Files

Every data project starts and ends with file operations. You pull data from CSVs, databases, or APIs, transform it, then export results for downstream consumers. Pandas makes this deceptively…

Read more →

Aug 16, 2025 Engineering

Memory-Mapped Files: Direct File Access

Traditional file I/O follows a predictable pattern: open a file, read bytes into a buffer, process them, write results back. Every read and write involves a syscall—a context switch into kernel mode…

Read more →

Jun 06, 2025 Pandas

How to Read Parquet Files in Pandas

Parquet is a columnar storage format that has become the de facto standard for analytical workloads. Unlike row-based formats like CSV where data is stored record by record, Parquet stores data…

Read more →

Jun 06, 2025 Python

How to Read Parquet Files in Polars

Parquet has become the de facto standard for analytical data storage. Its columnar format, efficient compression, and schema preservation make it ideal for data engineering workflows. But the tool…

Read more →

Jun 06, 2025 Engineering

How to Read Parquet Files in PySpark

Parquet has become the de facto standard for storing analytical data in big data ecosystems, and for good reason. Its columnar storage format means you only read the columns you need. Built-in…

Read more →

Jun 05, 2025 Pandas

How to Read CSV Files in Pandas

CSV files remain the lingua franca of data exchange. Despite the rise of Parquet, JSON, and database connections, you’ll encounter CSVs constantly—from client exports to API downloads to legacy…

Read more →

Jun 05, 2025 Python

How to Read CSV Files in Polars

Polars has rapidly become the go-to DataFrame library for Python developers who need speed without sacrificing usability. Built in Rust with a Python API, it consistently outperforms pandas on CSV…

Read more →

Jun 05, 2025 Engineering

How to Read CSV Files in PySpark

CSV files refuse to die. Despite better alternatives like Parquet, Avro, and ORC, you’ll encounter CSV data constantly in real-world data engineering. Vendors export it, analysts create it, legacy…

Read more →

Jun 05, 2025 Pandas

How to Read Excel Files in Pandas

Excel files remain stubbornly ubiquitous in data workflows. Whether you’re receiving sales reports from finance, customer data from marketing, or research datasets from academic partners, you’ll…

Read more →

Jun 05, 2025 Pandas

How to Read JSON Files in Pandas

JSON has become the lingua franca of web APIs and configuration files. It’s human-readable, flexible, and ubiquitous. But flexibility comes at a cost—JSON’s nested, hierarchical structure doesn’t map…

Read more →

Jun 05, 2025 Python

How to Read JSON Files in Polars

Polars has become the go-to DataFrame library for performance-conscious Python developers. While pandas remains ubiquitous, Polars consistently benchmarks 5-20x faster for most operations, and JSON…

Read more →

Jun 05, 2025 Engineering

How to Read JSON Files in PySpark

JSON has become the lingua franca of data interchange. Whether you’re processing API responses, application logs, configuration dumps, or event streams, you’ll inevitably encounter JSON files that…

Read more →