Use case · Receipts

Receipt extraction. From any photo.

Expense reports break on the same edge cases every time: faded thermal print, glare on a smartphone photo, foreign currencies, and itemized restaurant tabs that nobody wants to type in. Cogneris extracts the merchant, line items, taxes, and totals from any receipt format in under 2 seconds — and pushes the result straight to Concur, Brex, or Ramp.

Why receipts are hard

Receipts are the worst-quality input in the document-AI pipeline. Thermal paper fades within months. Smartphone photos arrive crumpled, rotated, or with a finger in the frame. Restaurant receipts split lines across tax, tip, and total in unpredictable orders. International receipts add currencies, languages, and tax regimes most parsers don't handle. OCR-only systems sit at 75–80% field accuracy on receipts and collapse on damaged images — which means employees spend their post-trip evenings re-typing data into expense forms.

How Cogneris does it

Cogneris's classifier identifies the receipt type (retail, restaurant, taxi, hotel folio, fuel, parking, freight) and routes to a specialist extractor. Image preprocessing handles glare, rotation, perspective skew, and partial folds. The extractor parses merchant identity, line items, tax breakdown by rate, payment method, and timestamps with per-field confidence. The validator runs arithmetic checks (line items + tax + tip = total) and flags receipts that don't reconcile before they hit your expense system.

Sample extraction output

doc_typeReceipt — Restaurant
merchantTrattoria del Borgo
merchant_addressVia Roma 42, 50125 Firenze, IT
vat_idIT 03152470485
transaction_date2026-04-18 20:42
line_items6 items parsed
subtotalEUR 84.00
taxEUR 8.40 (10% IVA)
tipEUR 7.60
totalEUR 100.00
currencyEUR
payment_methodVisa ****4421
arithmetic_check✓ subtotal + tax + tip = total
duplicate_checkno duplicate found
policy_flagsnone — within meal limit
confidence0.96 → auto-categorized

What you get out of the box

Every receipt type

Retail, restaurant, taxi, ride-share, hotel folio, fuel, parking, freight. Thermal print, smartphone photo, email PDF, screenshot.

Damaged-image resilience

Faded thermal paper, glare, rotation, perspective skew, partial folds. Field-level confidence drops on damage so AP can route for review.

Multi-language & multi-currency

30+ languages with the original text preserved alongside an English-normalized field set. Currency stays original — your policy engine converts.

Duplicate detection

Merchant + amount + timestamp + last-4 of card. Catches resubmissions and the same-meal-two-attendees pattern with the differences surfaced.

Receipt fields and JSON shape

Receipt OCR API comparisons usually focus on field coverage and mobile-photo resilience. Cogneris returns merchant details, transaction date/time, line items, tax breakdown, tip, total, payment method, duplicate status, policy flags, confidence, and citations.

{
  "document_type": "receipt",
  "fields": {
    "merchant_name": { "value": "Trattoria del Borgo", "confidence": 0.98 },
    "transaction_date": { "value": "2026-04-18T20:42:00", "confidence": 0.96 },
    "total": { "value": 100.00, "currency": "EUR", "confidence": 0.99 },
    "payment_last4": { "value": "4421", "confidence": 0.94 }
  },
  "line_items": [{ "description": "Dinner", "amount": 84.00 }],
  "policy_flags": []
}

SDK links for developers

Use the Node.js SDK, Python SDK, or REST API reference to push receipt data into expense apps, card transaction matching, or a custom reimbursement portal.

Integration patterns

Cogneris pushes structured receipt data into the expense systems T&E teams already run. SAP Concur — direct expense-line creation with attachments. Brex, Ramp, Navan — receipt match against the card transaction with policy-flag passthrough. Expensify, Spendesk — native connectors with field-mapping templates. NetSuite Expense Reports — for organizations running T&E inside the ERP. Custom integrations — drop normalized JSON into the REST API or use webhooks for event-driven flows.

Compliance & trust

Receipts contain payment-card data and personal expense patterns. Cogneris masks card numbers in audit metadata by default (only the last 4 digits are returned), retains documents encrypted at rest with per-tenant keys, and offers configurable retention from 0 to 7 years to meet local tax-record obligations. See our trust page for the full posture: encryption, tenant isolation, sub-processors, GDPR DPA, CCPA, SOC 2 Type II in progress, and HIPAA BAA on Enterprise.

Get started

Pay-per-page pricing means you can start an evaluation today without an annual commit. Most teams ship their first receipt extraction into production within a week and reach steady-state accuracy on their employee-receipt mix in under 30 days.

Related extractors

Cogneris extracts dozens of structured document types. The closest neighbors to receipt extraction:

  • Invoice extraction — vendor, line items, GL-code hints, and totals for AP automation.
  • Bank statement extraction — transaction-level parsing used to reconcile expense reports against card and account statements.
  • Payroll extraction — pay stubs and W-2s often grouped with receipts in employee reimbursement and onboarding workflows.

For broader context, see the IDP buyer's guide, the 2026 State of Document AI report, or estimate ROI at your volume.