GitHub - TheProxyCompany/proxy-structuring-engine: Guaranteed Structured Output from any Language Model via Hierarchical State Machines

Guaranteed structured outputs from any language model.

Eliminate 100% of schema violations and state tracking failures in your LLM applications.

The Problem: Unreliable LLM Outputs

Raw LLM outputs are structurally unpredictable, breaking systems that require specific formats (API calls, tool use, structured data). Prompting for structure is inconsistent; post-processing is inefficient and often fails on complex or nested formats.

The Solution: Runtime Structural Enforcement

The Proxy Structuring Engine (PSE) provides guaranteed structural validity by enforcing constraints during LLM generation.

Define your required output structure using Pydantic, JSON Schema, function signatures, or PSE's composable types. PSE compiles this into a high-performance Hierarchical State Machine (HSM). Integrated into the generation loop, the HSM engine guides the LLM, ensuring every generated token conforms to the defined structure.

Result: Structurally perfect output, generated correctly the first time.

Core Capabilities

100% Structural Guarantee: Eliminate schema violations, parsing errors, and malformed outputs. Enables reliable downstream processing and state management.
Handles Complexity & Recursion: Reliably generate deeply nested JSON, valid code, or custom recursive formats via the core HSM engine.
Flexible Schema Definition: Configure instantly using Pydantic models, JSON Schema, Python function signatures, or compose custom structures with pse.types.
Robust & Resilient: Built-in Token Healing recovers from minor tokenization artifacts. Principled path selection resolves ambiguity deterministically.
High-Performance C++ Core: Optimized HSM engine delivers guaranteed structure with minimal latency (~20ms/token overhead). (Benchmarks)
Model & Framework Agnostic: Integrates with any local LLM stack via standard logits processing (process_logits) and sampling (sample) hooks. Optional mixins simplify transformers integration (PyTorch, TF, JAX).
Advanced Grammar Composition: Use pse.types (Chain, Loop, Any, etc.) to build custom HSMs for bespoke structural requirements beyond standard schemas.

Installation

pip install pse

or

uv add pse

(Installs the pse Python library and its required pse-core dependency. See Installation Docs for framework extras and setup)

Quickstart: Pydantic to Guaranteed JSON

This example demonstrates generating JSON output matching a Pydantic schema using transformers.

import torch
from transformers import AutoTokenizer, LlamaForCausalLM
from pydantic import BaseModel
from pse import StructuringEngine
from pse.util.torch_mixin import PSETorchMixin # Optional: Mixin for easy HF integration

# 1. Define structure
class UserProfile(BaseModel):
    user_id: int
    username: str
    is_active: bool
    roles: list[str]

# 2. (Optional) Apply PSE mixin to model class
class PSE_Llama(PSETorchMixin, LlamaForCausalLM): pass

# 3. Load model & tokenizer
model_path = "meta-llama/Llama-3.2-1B-Instruct" # Example
tokenizer = AutoTokenizer.from_pretrained(model_path)
model = PSE_Llama.from_pretrained( # Or your base model class
    model_path, torch_dtype=torch.bfloat16, device_map="auto"
)
if tokenizer.pad_token is None: tokenizer.pad_token = tokenizer.eos_token
model.config.pad_token_id = tokenizer.pad_token_id

# 4. Init & Configure Engine (Mixin attaches as model.engine)
model.engine = StructuringEngine(tokenizer)
model.engine.configure(UserProfile) # Compile structure to HSM

# 5. Create prompt & input IDs
prompt = f"Generate a user profile: ID 999, username 'tester', active true, roles ['qa', 'dev']. Output ONLY the JSON."
messages = [{"role": "user", "content": prompt}]
input_ids = tokenizer.apply_chat_template(
    messages, return_tensors="pt", add_generation_prompt=True
).to(model.device)

# 6. Generate (PSE hooks applied via mixin or manually)
output_ids = model.generate(
    input_ids, max_new_tokens=150, do_sample=True,
    # Manual hook example (if not using mixin):
    # logits_processor=[model.engine.process_logits], sampler=model.engine.sample
)

# 7. Decode & Parse (Guaranteed by PSE)
output_text = tokenizer.decode(output_ids[0][input_ids.shape[-1]:], skip_special_tokens=True)
structured_output: UserProfile | None = model.engine.get_structured_output(UserProfile)

print("Raw Output (Guided by PSE):\n", output_text)
print("\nParsed Pydantic Object:\n", structured_output)
# Expected: UserProfile(user_id=999, username='tester', is_active=True, roles=['qa', 'dev'])

See the examples/ directory for more use cases, including JSON Schema and custom StateMachine composition.

Why PSE?

PSE's runtime HSM enforcement offers fundamental advantages over other methods:

Capability	PSE (Runtime HSM)	Prompting / Retries	Regex / Post-Proc.	Simple Masking
Guaranteed Structure	100%	Probabilistic	Fixes Errors	Flat Only
Complex/Nested	Handles Natively	Brittle / Fails	Impractical	Cannot Handle
Recursion	Handles Natively	No	No	No
Reliability	Production Grade	Low	Error-prone	Brittle
Efficiency	Optimized C++	Retries Cost	Slow	Fast (Simple)
Token Healing	Built-in	N/A	N/A	Breaks

Integration

Integrate PSE into your generation loop via two hooks:

logits_processor=[engine.process_logits]
sampler=engine.sample (wraps your base sampler)

Works with PyTorch, TensorFlow, JAX, MLX, etc. Optional transformers mixins simplify integration. (See Docs for details)

Foundation for Reliable Agents

PSE provides the structural guarantees required for reliable agentic systems, powering the Proxy Base Agent (PBA) for dependable state transitions and tool use.

License & Source Availability

pse (Python Library): Open source under Apache 2.0 (LICENSE). Provides the Python interface, schema parsing, and integration logic.
pse-core (C++ Engine): Required dependency. Distributed as a pre-compiled binary. Contains the high-performance HSM execution core. Source code is proprietary (The Proxy Company, Patent Pending).

Contact & Support

Issues/Bugs: GitHub Issues
Commercial Services: The Proxy Company Business Services (Integration, Custom Development, Support)

Citation

@software{Wind_Proxy_Structuring_Engine_2025,
  author    = {Wind, Jack},
  title     = {{Proxy Structuring Engine: Guaranteed Structured Output from Language Models via Runtime Hierarchical State Machine Enforcement}},
  year      = {2025}, # Adjust year if needed
  publisher = {The Proxy Company},
  version   = {2025.06.1}, # Update version as needed
  date      = {2025-04-15}, # Update release date
  url       = {https://github.com/TheProxyCompany/proxy-structuring-engine}
}

Name		Name	Last commit message	Last commit date
Latest commit History 371 Commits
.github		.github
examples		examples
pse		pse
tests		tests
.coverage		.coverage
.coveragerc		.coveragerc
.gitignore		.gitignore
CITATION.cff		CITATION.cff
LICENSE		LICENSE
README.md		README.md
logo.png		logo.png
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Repository files navigation

Guaranteed structured outputs from any language model.

The Problem: Unreliable LLM Outputs

The Solution: Runtime Structural Enforcement

Core Capabilities

Installation

Quickstart: Pydantic to Guaranteed JSON

Why PSE?

Integration

Foundation for Reliable Agents

License & Source Availability

Contact & Support

Citation

About

Uh oh!

Releases 28

Sponsor this project

Uh oh!

Uh oh!

Contributors 2

Uh oh!

Languages

Uh oh!

License

TheProxyCompany/proxy-structuring-engine

Folders and files

Latest commit

History

Repository files navigation

Guaranteed structured outputs from any language model.

The Problem: Unreliable LLM Outputs

The Solution: Runtime Structural Enforcement

Core Capabilities

Installation

Quickstart: Pydantic to Guaranteed JSON

Why PSE?

Integration

Foundation for Reliable Agents

License & Source Availability

Contact & Support

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 28

Sponsor this project

Uh oh!

Uh oh!

Contributors 2

Uh oh!

Languages