CodexPDF CodexPDF

Open source

Four repos. One ecosystem.

Mix and match extraction, preflight, review, and assay layers without vendor lock-in.

codex-pdf

beta

Authoritative, versioned PDF facts contract for Print with Synergy tools.

codex-pdf is a read-only extraction engine that publishes a stable CodexDocument contract. Downstream systems consume canonical PDF facts without re-parsing files in multiple codepaths.

  • Contract-first JSON output rooted at CodexDocument
  • Schema-validated payloads with SemVer compatibility policy
  • CLI workflows for extract, probe, validate, and parity
  • Consumer-agnostic output for adapters and downstream engines
  • AGPL open source with typed Python models

loupe-pdf

beta

Embeddable PDF viewer with separations, TAC, layers, and annotation overlays.

loupe-pdf is a browser-native review surface for print workflows. It consumes PDF data and service adapters to give reviewers high-fidelity visual tooling in web apps.

  • Embeddable React viewer architecture
  • Separations, TAC, layers, and review tools
  • Plugin and theming support for host products

lint-pdf

beta

Detection-only PDF preflight engine with deep standards coverage.

lint-pdf checks document conformance and quality against preflight rulesets while preserving original files.

  • 500+ checks across common production concerns
  • PDF/X-focused conformance workflows
  • CLI and API service integration options

assay-pdf

beta

PDF assay and metadata reporting for structural inventory.

assay-pdf inventories what's in a PDF so downstream engines can validate, compare, and reason over document structure reliably.

  • Per-page and document-level inventories
  • Font, color, image, and structure reporting
  • Schema-friendly output for downstream tools