SDKs and tooling

Python SDK for document extraction.

Use Python to submit PDFs, poll jobs, receive webhooks, inspect confidence scores, and push extracted JSON into data pipelines or review tools.

When to use Python

Python is a good fit for data pipelines, batch backfills, evaluation sets, reconciliation jobs, internal tools, and workflow workers that process documents outside the request path.

Production notes

Use async jobs for long packets, verify webhook signatures before trusting callbacks, persist job IDs, and route low-confidence fields to review instead of overwriting downstream records.

from cogneris_ai import Cogneris

client = Cogneris(api_key=os.environ["COGNERIS_API_KEY"])
result = client.extract(
    file="invoice.pdf",
    template="invoice",
    include_bounding_boxes=True,
)

print(result.data["invoice_number"].value)

Extract

Turn PDFs, scans, photos, and document packets into structured records.

Validate

Apply confidence thresholds, business rules, and cross-document checks.

Route

Send clean results downstream and uncertain fields to review queues.

Related pages