indentia.ai

Release notes

What's new — by capability.

Each release is organised by product area, not by engineering task. If you're looking for a specific feature, use your browser's find function (⌘ F / Ctrl F).

Nederlandse versie →

June 2026

Summer 2026

Enterprise Search

Government Q&A inline

Parliamentary questions and official decisions are now rendered directly inside search results — question and answer visible without leaving the interface. Linked to the responsible person's profile, the originating dossier and the source document.

Person profiles with linked documents

Searching for a minister or official now surfaces a profile panel with all associated parliamentary contributions, publications and decisions — grouped by date and type.

Shareable deep-links for searches

Filter state is now stored in the URL. A search with active source and date filters produces a link that opens the exact same result set for anyone with access.

Term explanation panel

Recognised terms in a search query trigger an inline explanation panel drawn from the organisation's own terminology graph — without leaving the search results.

Configurable per-source timeouts

Each data source can now have its own query timeout, preventing a slow connector from blocking results from faster sources.

Response cache without external dependencies

Search results are now cached in-process. Repeated queries return in milliseconds without requiring Redis or any other external caching service.

Document Intelligence

Intelligent multi-parser routing

The platform now selects the best document extraction engine automatically for each file type, using a self-learning routing model that improves with every conversion. Multiple engines race in parallel; the highest-quality result wins.

LLM quality judge

Extraction results are evaluated by a language model that scores layout fidelity, text completeness and structure accuracy — and triggers a re-extraction via a different engine if the threshold is not met.

Expanded file type support

The document conversion pipeline now accepts all major office formats, scanned PDFs, images, spreadsheets and presentations through a single upload interface.

Bilingual conversion wizard

The document conversion interface is now fully available in both Dutch and English, with a shared language switcher in the header.

Goal-aware extraction modes

Extraction can now be configured per job: RAG mode prioritises clean, chunked text output; extraction mode preserves tables, structure and layout metadata.

Lexicon — Terminology Management

New product: Lexicon

Lexicon is a dedicated product for managing the organisation's business vocabulary. Terms are proposed by users, reviewed by stewards and formally approved — or deprecated — by domain owners.

Terminology lifecycle

Each term has a full lifecycle: draft → under review → approved → deprecated. Every status change is logged with actor, timestamp and rationale.

SHACL validation

Approved terms are validated against the organisation's SHACL shapes before publication, ensuring structural consistency with the business ontology.

Gap detection

Lexicon analyses incoming documents and queries against the approved vocabulary and surfaces terms that appear frequently but have no approved definition yet.

DataHub integration

The approved vocabulary is automatically synchronised to DataHub as a governed glossary, making terminology available across the data catalogue without manual duplication.

Usage analytics

Each term tracks where it is used — in which documents, queries and agent responses — giving stewards a clear picture of vocabulary reach and adoption.

IndentiaDB — Knowledge Graph

SPARQL 1.2 federated queries

IndentiaDB now supports the W3C SPARQL 1.2 specification, including the SERVICE directive for cross-graph federation. A single query can now span multiple named graphs across tenant boundaries under controlled access.

RDF 1.2 compliance

Support for RDF 1.2 triple terms (reification), native directional language strings and a full W3C conformance harness. Indentia is now compliant with the latest W3C graph standards.

Production-grade durability

The write-ahead log now uses fsync on every commit, with CRC32 checksums per record and automatic torn-tail repair on startup. A committed write is guaranteed to survive a power loss, kernel panic or unclean shutdown — no data is ever partially written.

Full-text search with facet aggregations

The integrated full-text engine now supports field-level facet aggregations with year histograms alongside the existing keyword and fuzzy search — enabling drill-down filtering without a separate search engine.

Native fuzzy matching

Fuzzy full-text queries now use a native Damerau-Levenshtein engine that correctly handles transpositions and adjacent character swaps — the most common typo patterns in Dutch.

Hard query memory limits

Each query now runs under a configurable memory ceiling with a background watchdog. Queries that exceed the limit are terminated cleanly rather than causing OOM pressure on the host.

Agent Orchestrator

Quest board

The Agent Orchestrator now includes a quest board — a structured task management interface where agents receive assignments, track progress and report completion, with full audit history per quest.

WebSocket terminal proxy

Operators can now connect a live terminal session to any running agent sandbox over WebSocket, enabling real-time inspection and debugging without breaking sandbox isolation.

Multi-tenant identity bootstrapper

Each tenant now gets a dedicated ServiceAccount with scoped credentials, provisioned automatically at tenant creation. OPA policies govern what each agent identity is permitted to do within its tenant boundary.

MCP server configuration per session

Each agent session can now load a specific set of MCP (Model Context Protocol) server connections from configuration, making the set of available tools explicit and auditable per run.

Data Sources & Connectors

Rijksoverheid.nl connector

A new connector ingests publications, news items and policy documents from Rijksoverheid.nl with bitemporal semantics — meaning you can query the platform's view of the government website at any historical point in time.

Tweede Kamer documents

Parliamentary papers and Kamerstukken are now ingested directly in binary format, parsed by the document intelligence pipeline and indexed with full provenance metadata.

Source specification registry

All data source connectors now declare a machine-readable specification: schema, capabilities, supported extraction modes and quality contract. This registry is used by the platform to route data and by teams to onboard new sources consistently.

Cooperative backpressure

Connectors can now signal the ingestion pipeline to pause or slow down when upstream systems are under load, preventing connectors from overloading source APIs during peak hours.

Native Dutch Language Support

RobBERT-2022 embedding service

The platform now ships a dedicated Dutch embedding model based on RobBERT-2022 (768-dimensional). This model understands Dutch morphology, compound words and domain vocabulary significantly better than translated or multilingual alternatives.

Stay up to date

Want to see these features on your own data?

Book a demo or talk to us about a 2–4 week pilot on two or three of your own sources.

Book a demo Explore the platform →