Documentation
CodexPDF docs
Generated from the sibling codex-pdf repository docs and README.
Getting started
Overview
Authoritative read-only PDF facts engine for Think Neverland tools. Versioned contract, schema-validated output, and consumer-agnostic extraction.
Architecture
codexPDF boundaries, extraction pipeline shape, and the contract-first model used by downstream tools.
CLI
Command reference for extract, schema, validate, probe, and parity workflows.
Reference
Deploy
Run codex-pdf as a Railway service (shared and per-consumer sidecars), or any container host.
Parity
Projection-based parity checks used to compare codex output with external baselines.
Preflight Ingest
Adapters that normalize external preflight reports into codex issue payloads.
codex-pdf 1.0.0 release notes
- Publishes the first stable codex-pdf major release. - Promotes the parity corpus from cross-engine-only evidence to dual-corpus coverage. - Captures machine-readable parity reports under reports/parity/.
codex-pdf contract
codex-pdf is the read-only PDF facts + render service for the Think Neverland tooling family (lint-pdf, loupe-pdf, the marketing demos, and the upcoming Forge producers). This document is the canonical pointer for…
Service Ownership Contract
This contract defines ownership boundaries across the three OSS services:
Project
Discovery Audit
Initial cross-repo parsing inventory used to design codexPDF migration boundaries and ownership.
Migration Plan
Phased rollout plan for moving PDF fact extraction from downstream engines into codexPDF.
Backward Compatibility
Consumer payload compatibility expectations during codexPDF rollout and cutover.
Cleanup Stop Gates
Release gates required before downstream parser deletion and hard cutover enforcement.