Sayantan Das sayantan-manulife

LLM Wiki

A pattern for building personal knowledge bases using LLMs.

This is an idea file, it is designed to be copy pasted to your own LLM Agent (e.g. OpenAI Codex, Claude Code, OpenCode / Pi, or etc.). Its goal is to communicate the high level idea, but your agent will build out the specifics in collaboration with you.

The core idea

Most people's experience with LLMs and documents looks like RAG: you upload a collection of files, the LLM retrieves relevant chunks at query time, and generates an answer. This works, but the LLM is rediscovering knowledge from scratch on every question. There's no accumulation. Ask a subtle question that requires synthesizing five documents, and the LLM has to find and piece together the relevant fragments every time. Nothing is built up. NotebookLM, ChatGPT file uploads, and most RAG systems work this way.

Slack System Design: A Grounded Teardown

A reverse-engineered system design of Slack's web application, built from live network traffic analysis of the authenticated Enterprise Grid experience. 200+ API calls captured across boot, search, messaging, reactions, and navigation. Every backend service named.

Architecture Overview

┌─────────────────────────────────────────────────────────────────────────────┐
│  BROWSER (Gantry v2 SPA)                                                    │
│                                                                             │

Netflix System Design: A Grounded Teardown

A reverse-engineered system design of Netflix's web application, built entirely from live network traffic analysis of the authenticated browse experience. 177 requests captured, every API contract inspected, every subsystem named.

Architecture Overview

┌─────────────────────────────────────────────────────────────────────────────┐
│  BROWSER (Akira SPA)                                                        │
│                                                                             │

GitHub Copilot CLI SKILL

Use models like Gemini 3 Pro, GPT-5.1, and GPT-5.1-Codex from within Claude by invoking GitHub Coplit CLI.

Installation

Create ~/.claude/skills/github-copilot
Save SKILL.md to ~/.claude/skills/github-copilot/SKILL.md

Configuring OpenAI Codex CLI with Azure OpenAI: A Working Solution

If you've been trying to get OpenAI's Codex CLI working with Azure OpenAI Service, you're not alone in facing configuration headaches. The ongoing transition at Azure, combined with inconsistent documentation and API version differences, can make this setup feel like navigating a maze blindfolded.

After countless hours of debugging 404 errors, stream failures, and authentication issues, here's a working configuration that actually works with Azure AI Foundry and GPT-4o.

The Challenge

The Codex CLI documentation provides a basic Azure configuration example, but the reality is more complex:

🧠 How to Save Context Tokens When Using Claude Code

This is a personal reference workflow for minimizing token usage while maintaining project continuity across Claude Code (Sonnet 4 with file access).

✅ Setup: Populate `CLAUDE.md`

Claude loads CLAUDE.md automatically at session start.

Export pull request

GitHub

Save a pull request as a pr.diff

curl -H "Accept: application/vnd.github.v3.diff" -u [username]:[personal_access_token] https://api.github.com/repos/[organization]/[repo]/pulls/[pull id] > pr.diff

Replace

username your Github username

Sayantan Das sayantan-manulife

LLM Wiki

The core idea

Slack System Design: A Grounded Teardown

Architecture Overview

Netflix System Design: A Grounded Teardown

Architecture Overview

GitHub Copilot CLI SKILL

Installation

Configuring OpenAI Codex CLI with Azure OpenAI: A Working Solution

The Challenge

🧠 How to Save Context Tokens When Using Claude Code

✅ Setup: Populate CLAUDE.md

Export pull request

GitHub

Save a pull request as a pr.diff

✅ Setup: Populate `CLAUDE.md`