codex-pdf
Structured PDF extraction API that turns complex files into consistent JSON.
Open source · self-host or managed
One home for the whole stack — extraction, preflight, browser review, conformance reporting, a content-addressed asset plane, and an integration hub. Focused, standalone tools that plug into the workflow you already run. Self-host the open source, or let us host any of it for you.
Every tool ships two ways · self-host from GitHub · or managed on host.withsynergy.io
The stack · open source
A toolkit of focused, standalone PDF utilities — extraction, preflight, viewing, conformance reporting, an asset plane, and an integration hub. Each one is an independent open-source service that plugs into the prepress workflow you already run. Self-host any of them, or let us host them for you.
Structured PDF extraction API that turns complex files into consistent JSON.
Detection-only PDF preflight engine — 500+ checks plus the PDF/X-4 conformance suite.
Embeddable PDF viewer with separations, TAC, layers, and annotation overlays.
GWG 2022 conformance assay — benchmark a preflight engine against the spec.
Content-addressed digital-asset plane — versioned blobs, a presigned data plane, and on-prem agent recall.
The print-data integration hub — canonical jobs, orders, and customers kept in sync across your MIS, ERP, and prepress tools.
Every tool is independent — nothing here ties them into a single platform. Browse the source on GitHub, or have us run any of it for you on host.withsynergy.io.
Two ways to run it
Same tools, your choice of operating model. Start on your own infrastructure, or hand the running of it to us — move between the two whenever you like.
Clone the repos and run them yourself. Open source under AGPL — every tool is a standalone service with a public REST contract and its own docs. No platform lock-in: adopt one tool or the whole stack.
Don't want to run servers? We host the toolkit for you on host.withsynergy.io — secured, updated, and scaled, multi-tenant isolated. Sign up and connect the tools to your existing prepress automation in minutes.
How it fits together
The stack isn't a monolith. It's a set of focused services that share conventions — so they slot into the prepress workflow you already run, one piece at a time.
Every tool is a standalone service with its own REST contract. Adopt just preflight, or just the viewer — each earns its place on its own.
codex-pdf turns any PDF into normalized facts — fonts, color, images, geometry. The other tools read from it, so they agree on what's in the file.
Wire tools together over plain HTTP — extraction feeds preflight, the viewer proves the result, the asset plane stores it. Use as many or as few as you need.
Open source under AGPL, deterministic contracts, content-addressed caching. Run it on your own boxes forever — or hand the running of it to us.
Self-host the open source from GitHub, or let us run it for you on the managed platform. Either way, you only adopt the pieces that fit the workflow you already have.