Skip to content

Instantly share code, notes, and snippets.

@ShankarChavan
ShankarChavan / README.md
Created January 2, 2026 11:05 — forked from rjurney/README.md
Graphlet AI Claude Code PySpark Guide - customized Palantir-PySpark-Guide for effective PySpark in Claude Code

Note: this style guide is an edit of the Palantir Style guide, for which I am very grateful! You may use this one or edit theirs as a starting point for your own agent-based PySpark code.

Palantir PySpark Style Guide

PySpark Style Guide

PySpark is a wrapper language that allows users to interface with an Apache Spark backend to quickly process data. Spark can operate on massive datasets across a distributed network of servers, providing major performance and reliability benefits when utilized correctly. It presents challenges, even for experienced Python developers, as the PySpark syntax draws on the JVM heritage of Spark and therefore implements code patterns that may be unfamiliar.

This opinionated guide to PySpark code style presents common situations we've encountered and the associated best practices based on the most frequent recurring topics across PySpark repos.