prompt-compression

Here are 49 public repositories matching this topic...

open-compress / claw-compactor

14-stage Fusion Pipeline for LLM token compression — reversible compression, AST-aware code analysis, intelligent content routing. Zero LLM inference cost. MIT licensed.

Updated Apr 1, 2026
Python

jia-gao / leanctx

Star

Drop-in prompt compression for production LLM apps. Cut your token bill 40-60% without changing your code. Python SDK, LLMLingua-2, MIT.

python gemini openai cost-optimization rag llm langchain anthropic llm-inference prompt-compression langgraph llmlingua

Updated May 4, 2026
Python

atjsh / llmlingua-2-js

Star

JavaScript/TypeScript implementation of LLMLingua-2 (Experimental)

nodejs javascript typescript web tensorflow transformers webgpu hf tensorflowjs prompt-engineering transformer-js prompt-compression llmlingua

Updated Sep 14, 2025
TypeScript

chappyasel / meta-kb

Star

A self-improving knowledge base about LLM agent infrastructure

markdown machine-learning ai artificial-intelligence multi-agent knowledge-graph knowledge-base self-learning ai-agents rag autonomous-research llm anthropic prompt-compression agent-skills agent-memory claude-code context-engineering openclaw

Updated Apr 9, 2026
TypeScript

centminmod / or-cli

Sponsor

Star

Python command-line tool for interacting with AI models through the OpenRouter API/Cloudflare AI Gateway, or local self-hosted Ollama. Optionally support Microsoft LLMLingua prompt token compression

openai linkup opik rag openai-api txtai llms llm-inference openrouter ollama cloudflare-ai ollama-api prompt-compression structured-outputs openai-api-client openrouter-api cloudflare-ai-gateway ai-rag llmlingua

Updated Dec 28, 2025

sriinnu / clipforge-PAKT

Star

Lossless-first prompt compression for JSON, YAML, CSV, and Markdown. Library, CLI, MCP server, desktop app, and browser extension.

markdown cli yaml json csv mcp developer-tools lossless-compression llm pakt prompt-compression token-compression coding-agent

Updated May 11, 2026
TypeScript

NodeNestor / claude-rolling-context

Star

Rolling context compression for Claude Code — never hit the context wall. Auto-compresses old messages while keeping recent context verbatim. Zero config, zero latency. Works as a Claude Code plugin.

claude ai-agent anthropic context-window context-management prompt-compression context-compression llm-context ai-coding claude-code claude-code-plugin claude-code-extension rolling-context

Updated Mar 10, 2026
Python

bladysh / exprompt

Star

Reverse T9 for LLMs. Free, open-source prompt compressor for your AI prompts and agents.

cli golang openai developer-tools agents codex text-compression claude llm prompt-engineering llms chatgpt anth prompt-compression

Updated May 17, 2026
Go

napmany / cutia

Star

CUTIA: compress prompts while preserving quality

dspy prompt-engineering prompt-compression

Updated Feb 2, 2026
Python

g-akshay / ClaudeShrink

Sponsor

Star

A Claude Code skill that shrinks massive prompts and files using LLMLingua to save tokens.

skills developer-tools claude ai-tools context-window prompt-compression llmlingua claude-code token-optimization claude-skills

Updated Apr 25, 2026
Python

pleasedodisturb / awesome-llm-token-optimization

Star

A curated list of strategies, tools, papers, and resources for reducing LLM token costs and improving efficiency in production.

Updated May 17, 2026

AybarsBarut / Nexus-APCP

Star

AI-assisted context management and prompt compression toolkit for developer productivity, ADR workflows, and LLM token optimization.

Updated May 21, 2026
Python

therohanparmar / t3-toon

Star

TOON for TYPO3 — a compact, human-readable, and token-efficient data format for AI prompts & LLM contexts. Perfect for ChatGPT, Gemini, Claude, Mistral, and OpenAI integrations (JSON ⇄ TOON).

Updated Mar 2, 2026
PHP

kaistAI / GenPI

Star

This repository is the official implementation of Generative Context Distillation.

agent distillation prompt-injection prompt-compression prompt-internalization context-distillation

Updated May 10, 2025
Python

gladehq / claude-shorthand

Star

LLMLingua-2 prompt compression hook for Claude Code — cut token usage by ~55%

macos linux cli developer-tools token claude prompt-tuning llm prompt-engineering prompt-compression llmlingua token-optimization claudecode claudecode-hooks claudecode-plugin

Updated Mar 16, 2026
Python

VDADev2022 / token-diet

Star

Advanced token reduction and prompt optimization framework for LLMs, featuring linguistic, algorithmic, and architectural patterns.

ai nlp-resources ai-development llm prompt-engineering generative-ai llm-tools token-reduction token-usage llm-optimization context-management prompt-compression agentic-ai llm-efficiency claude-skills claude-skill ai-cost-savings

Updated Apr 25, 2026

contextcrunch-ai / contextcrunch-python

Star

Compress LLM Prompts and save 80%+ on GPT-4 in Python

python api llm prompt-compression

Updated Jan 17, 2024
Python

ksm26 / Prompt-Compression-and-Query-Optimization

Star

Enhance the performance and cost-efficiency of large-scale Retrieval Augmented Generation (RAG) applications. Learn to integrate vector search with traditional database operations and apply techniques like prefiltering, postfiltering, projection, and prompt compression.

Updated Jul 23, 2024
Jupyter Notebook

d4551 / piratebao

Star

PirateBao is a TypeScript/Bun agent-skill package for terse pirate-speak AI coding replies that preserve technical detail while cutting filler, with hooks, compressor CLI, OpenCode/Codex/Claude/Gemini cargo, .bao validation, npmjs gates, and token eval checks.

cli typescript ai opencode npm-package codex ai-agents bun bao prompt-compression gemini-cli agentic-ai ai-skills claude-code token-efficiency coding-agent

Updated Apr 13, 2026
TypeScript

Kelpejol / prompt-compression-gateway

Star

API gateway for LLM prompt compression with policy enforcement built on LLMLingua. Demonstrates cost control, prompt safety, and LLM execution boundaries.

python api-gateway fastapi llm prompt-compression

Updated Dec 26, 2025
Python

Improve this page

Add a description, image, and links to the prompt-compression topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the prompt-compression topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

prompt-compression

Here are 49 public repositories matching this topic...

open-compress / claw-compactor

jia-gao / leanctx

atjsh / llmlingua-2-js

chappyasel / meta-kb

centminmod / or-cli

sriinnu / clipforge-PAKT

NodeNestor / claude-rolling-context

bladysh / exprompt

napmany / cutia

g-akshay / ClaudeShrink

pleasedodisturb / awesome-llm-token-optimization

AybarsBarut / Nexus-APCP

therohanparmar / t3-toon

kaistAI / GenPI

gladehq / claude-shorthand

VDADev2022 / token-diet

contextcrunch-ai / contextcrunch-python

ksm26 / Prompt-Compression-and-Query-Optimization

d4551 / piratebao

Kelpejol / prompt-compression-gateway

Improve this page

Add this topic to your repo