Abstract
Policy and political work is knowledge work under pressure. It runs on cables, reports, transcripts, statutes, and briefs that an institution accumulates faster than any one analyst can read. The work breaks at three places: when context fragments across tools, when reasoning fails to inherit from the analyst who leaves to the desk that stays, and when the AI on top hallucinates the structure rather than reading it. We propose an AI knowledge layer as the fix: a typed, source-bound, time-ordered graph that turns institutional reading into reasoning the next analyst, the next desk, and the next model can inherit. The layer is built on a kernel ontology with dynamic extensions — eight universal primitives that do not change, plus per-case subclasses learned from the work and validated against the kernel. Inference runs through Ontology-Augmented Generation (OAG): typed, grounded in source spans, contested by design, temporally honest. Conflict reasoning is the first specialization we measure (TCGC) because it stresses every property of the layer at once. Policy options memos, stakeholder analysis, regulatory contestation, and mediation work are next. We invite the field to fork and improve everything.
1. The problem
Ask any policy desk officer, mission analyst, regulatory lawyer, mediator, or party staffer what they actually do all day, and the answer rhymes: read a lot of text, figure out who said what to whom and when, reconcile conflicting accounts, track commitments across time, and produce a structured artifact — a memo, a briefing, a SITREP, an options paper, a compliance review — that an institution can act on.
The structure is the product. The structure is also invisible. It lives entirely in the practitioner’s head, and when she leaves the institution it leaves with her. Almost everything she uses to do this work — email, spreadsheets, word processors, generic AI chat — was built for a different job.
The asymmetry shows up at three scales simultaneously: between the parties to a policy or political question, inside the institution that has to decide, and across the AI tools the institution uses to think. Generic chat-on-top-of-PDFs collapses all three into the same fluent surface, which is why senior practitioners read it once, find it impressive, and then return to their notes.
2. Inheritance as architecture
Institutional knowledge work has a specific failure mode that consumer software does not have to think about: handoff. The analyst who built the file rotates out. The desk officer who held the dossier moves on. The mediator who ran the last round retires. What gets inherited, today, is a folder of PDFs and a few decks. What needed to be inherited is the reasoning — which actor owns which interest, which commitment is still active, which event triggered which policy shift, which narrative stopped being credible last March.
Generic AI assistants do not solve this. They store text and generate text. They do not maintain typed structure across sessions. The 2025 work on context engineering [24]and Letta’s Context-Bench [25] document the same failure mode formally: as agents accumulate self-generated context, accuracy decays in ways that look fluent and read confident. An institution that stores its reasoning in a chat transcript is paying that bill twice — once on the way in, once on the way back out.
The knowledge layer treats inheritance as a first-class architectural property. Every claim is a typed object with a source span. Every commitment is bi-temporal. Every extension a particular case taught the system is logged, named, and reviewable. The graph survives the analyst. The next desk reads the graph, not the rumor of it. Adjacent work in temporally evolving knowledge graphs (MedKGent 2025 [18], Graphiti / Neo4j 2025 [26]) builds the same shape in medicine and agentic memory; the policy and political domain has not had its version of this yet. We are building it.
This is the move that distinguishes a knowledge layer from a chatbot. A chatbot is a query interface. A knowledge layer is the institution’s typed reasoning, persisted, contestable, and inheritable.
3. What the literature already knows
Three bodies of literature converge on the same finding: institutional reasoning is structured, not noise. Conflict theory — Fisher and Ury [1], Glasl [2], Galtung [3], Axelrod [4] — names actors, claims, interests, positions, escalation, and cooperation as recurring components of any dispute. Moral-foundations work from Haidt [5] and value-conflict research from Sunstein [6] give us the frames people argue inside.
Public administration and policy-process research adds the institutional layer: the same primitives that organize a dispute also organize a policy file — stakeholders, claims of evidence, regulatory constraints, leverage, commitments to deadlines, events on the calendar, contested narratives. The Kingdon multiple-streams view, the advocacy-coalition framework, and the punctuated-equilibrium tradition all rely on these objects without typing them.
The literature converges on a finding that nobody has yet operationalized: policy and political reasoning is a typed structure. The same primitives appear in workplace mediation, commercial arbitration, multilateral negotiation, legislative analysis, regulatory contestation, and intergenerational family disputes. The grammar travels. Until now, it has lived in the analyst’s head and left the institution when she did.
4. Kernel + dynamic extensions
The closed-ontology failure mode. SNOMED, FIBO, LKIF, the Gene Ontology — extraordinary in their domains, rigid by design, unable to extend without committee. Policy and political work is a dirtier domain. A Commitment in HR mediation, a ceasefire in a peace process, a vendor SLA, a regulatory pledge, and a campaign promise are all Commitments — and they have entirely different shapes. A fixed ontology that flattens those into one schema either over-specializes (loses domain transfer) or under-specializes (loses the case). Bian (Oct 2025) [19] names schema rigidity as one of the three enduring challenges in classical knowledge-graph construction.
The schema-free failure mode. Pure agentic graph builders without a backbone — see the agentic-deep-graph-reasoning literature [17] — produce locally coherent but globally inconsistent graphs. Two analysts looking at the same file end up with two ontologies. Provenance survives, interoperability dies. Inheritance becomes impossible.
The kernel-with-extensions third way.Eight primitives at the core (actor, claim, interest, constraint, leverage, commitment, event, narrative). Universal across cases. Versioned. Forkable. Below the kernel: domain-specific subclasses learned per case (HR-Commitment, Ceasefire-Commitment, SLA-Commitment, RegulatoryCommitment, CampaignCommitment), validated against the kernel’s invariants, logged with provenance, reviewable by the practitioner. The LLMs4OL 2025 challenge on automated ontology induction [20], OntoEKG (Feb 2026) [22], and MedKGent (2025) [18] all point in this direction.
Why this is the right bet for policy and political work specifically. Policy and political work is the highest-stakes knowledge domain that has no operationalized ontology. Medicine has SNOMED. Law has LKIF. Finance has FIBO. Policy and political work — the domain that runs on prose, memory, and hand-written briefings — has none. Building this kernel openly, in 2026, is the most useful thing TACITUS Research can do.
THE EIGHT PRIMITIVES
Actor (who participates), Claim (what is asserted), Interest (what lies beneath), Constraint (what limits outcomes), Leverage (what holds the power), Commitment (what was agreed), Event (what happened when), Narrative (what frames it). Read the live grammar with worked examples — including a single Commitment rendered in three domains — at /research/grammar.
5. Bi-temporal grounding
Valid time vs. transaction time.A commitment was made on Monday (valid time). The desk learned about it on Friday (transaction time). On Saturday, one party retroactively claims the commitment was actually a “tentative idea” — a new claim, made now, about the past. A static graph cannot represent this. A bi-temporal graph can — and must — for the data to be honest.
Why this is structurally different from event sourcing. Event sourcing tracks system state. Bi-temporal grounding tracks truth state across time. MedKGent (2025) [18] on temporally evolving medical knowledge graphs and the Graphiti / Neo4j 2025 architecture [26] articulate the same shape in adjacent domains. Policy and political work is, if anything, more demanding: positions shift, commitments are renegotiated, events are reinterpreted, narratives drift. A knowledge layer that pretends Monday was Tuesday is not legible.
The minimum viable bi-temporal claim. Every claim in the TACITUS graph carries: source span, asserter, valid-time interval, transaction-time interval, contestation status. A claim can be true-then-revised, true-then-contested, asserted-then-disclaimed. None of those is an error; all of those are the data the next analyst needs to inherit.
6. Ontology-Augmented Generation
Definition. Ontology-Augmented Generation (OAG) is a generation pattern in which a language model produces output that is (i) typed — every produced object is an instance of a kernel primitive or a validated extension; (ii) grounded — every claim cites the source span it was extracted from; (iii) contested — counter-claims and contradictions are first-class objects in the graph the LLM consults, not noise to be averaged away; (iv) temporally honest — every claim carries valid-time and transaction-time stamps the LLM is required to respect.
| Pattern | Schema | Provenance | Contestation | Temporality |
|---|---|---|---|---|
| RAG | None | Document-level | Averaged | Flattened |
| GraphRAG | Auto-induced, flat | Entity-level | Averaged | Mostly flattened |
| OG-RAG | Domain ontology, fixed | Entity-level | Not modeled | Not modeled |
| Think-on-Graph | Domain KG, fixed | Path-level | Not modeled | Not modeled |
| OAG (TACITUS) | Kernel + dynamic extensions | Span-level, bi-temporal | First-class objects | Bi-temporal |
In order: Lewis et al. 2020 on RAG, Edge et al. 2024 on GraphRAG [12], Sharma et al. 2025 on OG-RAG [15], Sun et al. 2024 on Think-on-Graph [16], and Hou et al. 2024 on WikiContradict [13].
Worked example.A regulatory consultation produces a Commitment from the agency on Monday: “the rule will be redrafted before Q3.” A trade body acknowledges ambiguously the same morning. On Thursday, the agency spokesperson denies scope: “the redraft is exploratory.” OAG types each speech act against the kernel, stamps the bi-temporal intervals, and creates a contestation edge between the original Commitment and the later denial — instead of averaging the three turns into a smoothed paraphrase. The structure is what the desk inherits.
APPENDIX · PRODUCTS AS DEMONSTRATIONS
PRAXIS is the AI analyst workbench that surfaces the knowledge layer to professionals — desks, missions, mediation teams, policy units. DIALECTICA is the engine. Wind Tunnel and CONCORDIA are side projects that reuse the same backbone for reception modeling and live deliberation. Every product reads and writes the same graph. The point of the products is to show the layer works in production, not to be the research artifact.
7. Contestability as architecture
Most ontologies assume an authoritative voice.Medical: there is one truth about a diagnosis. Legal: a contract has one binding interpretation (eventually). Telecom: a network configuration is or isn’t valid.
Policy and political work has no authoritative voice by design. The domain is the disagreement. Forcing a resolution at the schema layer is a category error.
Holding disagreement structurally. The TACITUS graph carries: claim and counter-claim as paired objects; evidence and counter-evidence as typed edges; commitment and broken-commitment as bi-temporal events; narrative drift as a primitive in its own right. WikiContradict (Hou et al., 2024) [13] and ConflictBank [14] are the benchmarks that started naming this gap.
Why this matters for the practitioner. The system does not tell the desk officer, mediator, or analyst what is true. It shows them the disagreement shaped. The decision is human. The structure is the product.
8. Why LLMs alone cannot do this
Three structural failures, all architectural rather than training-data problems. Temporality is flattened: transformers see sequences, not timelines, and cannot reliably reconstruct when a commitment was made, broken, or renegotiated (Chen et al. 2023 [9]). Causality collapses to co-occurrence: multi-hop inferences of the form “A caused B via mechanism M under condition C” exceed what attention can reliably construct (Kıcıman et al. 2023 [10]). Provenance is absent by construction: every claim a generic LLM produces is, architecturally, an uncited assertion (Ji et al. 2023 [11]); RAG reduces but does not eliminate the problem because the generation step is still the same transformer.
Two newer failure modes round out the diagnosis. WikiContradict (Hou et al. 2024) [13]shows LLMs systematically mishandle conflicting evidence unless contradiction is represented explicitly. Letta’s Context-Bench (2025) [25] documents cascading agent failure under self-generated context: as agents accumulate their own outputs, accuracy decays in ways that look fluent and read confident. This is the failure mode that punishes institutions hardest — because handoff, the moment when one analyst hands the file to the next, is exactly when self-generated context is highest and ground truth is least available.
By 2026 these failures are not contested. The interesting question is no longer whether to add structure, but what kind, who maintains it, and how it inherits.
9. Specializations
The knowledge layer is a single architecture. Specializations are the per-domain extension stacks that ride on top of it. We work on each specialization in public, with its own benchmark and its own surface in PRAXIS.
- Conflict reasoning. The first specialization, because it stresses everything: time, causality, provenance, commitment tracking, interest/position separation, narrative drift, cross-actor contradiction. Measured by TCGC. v0.2 introduces dynamic-ontology task types: schema-extension induction, kernel-invariant validation, cross-domain primitive transfer.
- Policy options analysis. Stakeholder map, interest landscape, constraint inventory, leverage assessment, options memo. Same kernel, different extensions: PolicyOption, RegulatoryConstraint, JurisdictionalLeverage. Specialization benchmark in design.
- Mediation and ADR. Live deliberation, commitment tracking, narrative drift, agreement drafting. Surfaces in CONCORDIA. Builds on the same kernel; extension stack is HR-Commitment, AgreementClause, FacilitationEvent, RoomNarrative.
- Regulatory contestation. Consultations, rulemakings, legal challenges. Kernel + extensions for RegulatoryAct, StakeholderResponse, JudicialReviewEvent.
Each specialization is a public artifact: extension schema, worked examples, benchmark methodology where it exists, an open issue tracker. We do not promise every specialization at once. We promise the kernel underneath them is the same one, and that the work shows in public.
10. Risks
Misuse for adversarial analysis.The same engine that helps a mediator structure a workplace dispute helps a corporate adversary map a counterparty’s leverage; the same engine that helps a policy desk assemble an options memo helps a campaign assemble an opposition file. We limit early deployments to partners with stated, reviewable use cases. We do not provide the engine to actors whose declared purpose is one-sided coercion.
Institutional capture. The map can become a mechanism for whoever controls the platform to win. We mitigate by publishing the kernel, by logging every dynamic extension with provenance, and by keeping the graph forkable. Capture survives openness only if nobody else picks up the fork.
Schema drift. Per-case extensions, by design, can drift. A Commitment in HR mediation should not silently become a Commitment in a regulatory consultation. We address this with kernel invariants the dynamic extensions must validate against, and with a public review surface for proposed extensions. We do not pretend this problem is solved.
Training-data leakage. Policy and political data is, by definition, sensitive. We do not train on partner-provided graphs. The reference engine is run on-prem or in a partner-controlled tenant. We publish methodology, not cases.
11. Open science as architecture
Four commitments keep the scientific layer honest: the kernel is public and forkable; the dynamic-extension log is reviewable; the pipeline is MIT-licensed; the benchmark is a shared artifact, not a competitive moat. A company that makes claims about institutional reasoning and keeps them behind closed doors does not deserve the benefit of the doubt.
12. Invitations
This paper is an invitation. Four invitations, specifically:
- Read the kernel. Disagree in public.
- Open PRAXIS or DIALECTICA on a file you know well — a peace process, a rulemaking, a mediation, an HR investigation — and tell us where the dynamic extensions break.
- Propose a new TCGC task type, or propose a new specialization benchmark (policy options, regulatory contestation, ADR).
- If you run a pilot, write up what worked and what did not, openly.
Institutional reasoning is structured. Until now, the structure was invisible and uninheritable. We are making it visible — and inheritable — in the open.
7. DIALECTICA engine — seven graphs
DIALECTICA implements OAG as a neurosymbolic Option-2 architecture (Garcez & Lamb, 2023): a neural system performs pattern recognition over noisy text, a symbolic reasoning system maintains typed structure that does not drift under self-generated context. The engine partitions state across seven logical graphs, each with its own update discipline.
| Graph | Content | Update discipline | Mutability |
|---|---|---|---|
| Base | Raw ingested facts as extracted from source corpora. | Append-only at ingest. | Immutable |
| Evidence | Source spans, citations, dataset references. | Immutable on write; cryptographic-style binding to Base. | Immutable |
| Analytical | Typed primitive instances and edges — the working ACO subgraph. | Bi-temporal; revisions logged as new edges with new t_tx; invalidation, not deletion. | Append-only at t_tx |
| Narrative | Framings, drift chains, competing stories. | Drift-tolerant by design; all framings preserved. | Append-only |
| Commitment | Bi-temporal commitment register with status transitions. | StatusTransition records; revision chains preserved. | Append-only at t_tx |
| Case | Practitioner-facing subgraph for a particular dossier. | Forkable per case; case-scoped extensions. | Mutable within case |
| Scenario | Counterfactual or prospective subgraphs for options analysis. | Forked from Case; isolated; never merged back without explicit promotion. | Mutable within scenario |
Base
Raw ingested facts as extracted from source corpora.
Append-only at ingest.
Immutable
Evidence
Source spans, citations, dataset references.
Immutable on write; cryptographic-style binding to Base.
Immutable
Analytical
Typed primitive instances and edges — the working ACO subgraph.
Bi-temporal; revisions logged as new edges with new t_tx; invalidation, not deletion.
Append-only at t_tx
Narrative
Framings, drift chains, competing stories.
Drift-tolerant by design; all framings preserved.
Append-only
Commitment
Bi-temporal commitment register with status transitions.
StatusTransition records; revision chains preserved.
Append-only at t_tx
Case
Practitioner-facing subgraph for a particular dossier.
Forkable per case; case-scoped extensions.
Mutable within case
Scenario
Counterfactual or prospective subgraphs for options analysis.
Forked from Case; isolated; never merged back without explicit promotion.
Mutable within scenario
Above the seven graphs DIALECTICA exposes four reasoning layers: GND (grounded, point-in-time over Base+Evidence), CTX (typed queries over Analytical / Narrative / Commitment), EVD (provenance traversal), and RZN (open-question layer where the LLM is brought back in, constrained by the typed graph as a hard scaffold rather than a soft retrieval source). The engine is documented at /research/dialectica.
Comparison with related patterns
Each pattern in this table is the right architecture for a different question. RAG is optimal for “find me relevant text.” GraphRAG is optimal for query-focused summarisation over corpora too large to read. OG-RAG is optimal when a stable domain ontology exists. Think-on-Graph dominates when reasoning structure dominates corpus structure. Buehler’s agentic deep graph reasoning is optimal for open-ended discovery. OAG is optimal for typed institutional analysis with contested provenance and bi-temporal honesty.
| Pattern | Schema | Provenance | Contestation | Temporality |
|---|---|---|---|---|
| RAGLewis et al., 2020 | None | Document-level | Averaged | Flattened |
| GraphRAGEdge et al., 2024 (arXiv:2404.16130) | Auto-induced, flat | Entity-level | Averaged | Mostly flattened |
| OG-RAGSharma et al., EMNLP 2025 | Domain ontology, fixed | Entity-level | Not modelled | Not modelled |
| Think-on-GraphSun et al., ICLR 2024 | Domain KG, fixed | Path-level | Not modelled | Not modelled |
| Agentic deep graph reasoningBuehler, 2025 (arXiv:2502.13025) | Schema-free, self-organising | Path-level | Not modelled | Not modelled |
| OAG (TACITUS)TACITUS Research, 2026 | Kernel + dynamic extensions | Span-level | First-class objects | Bi-temporal |
RAG
Lewis et al., 2020
- Schema
- None
- Provenance
- Document-level
- Contestation
- Averaged
- Temporality
- Flattened
GraphRAG
Edge et al., 2024 (arXiv:2404.16130)
- Schema
- Auto-induced, flat
- Provenance
- Entity-level
- Contestation
- Averaged
- Temporality
- Mostly flattened
OG-RAG
Sharma et al., EMNLP 2025
- Schema
- Domain ontology, fixed
- Provenance
- Entity-level
- Contestation
- Not modelled
- Temporality
- Not modelled
Think-on-Graph
Sun et al., ICLR 2024
- Schema
- Domain KG, fixed
- Provenance
- Path-level
- Contestation
- Not modelled
- Temporality
- Not modelled
Agentic deep graph reasoning
Buehler, 2025 (arXiv:2502.13025)
- Schema
- Schema-free, self-organising
- Provenance
- Path-level
- Contestation
- Not modelled
- Temporality
- Not modelled
OAG (TACITUS)
TACITUS Research, 2026
- Schema
- Kernel + dynamic extensions
- Provenance
- Span-level
- Contestation
- First-class objects
- Temporality
- Bi-temporal
Bi-temporal worked example
A regulatory agency issues a written statement at 09:00 Monday: “The rule will be redrafted before Q3.” The OAG pipeline emits:
Commitment(id=c1, undertaker=Agency, content="redraft rule before Q3", status=Active, t_valid=[Mon 09:00, ∞), t_tx=[Mon 09:30, ∞), source_span=PressRelease.pdf:chars[1024:1198])
A trade body posts an ambiguous acknowledgement at 11:00 Monday. OAG emits a Claim from the trade body with relation ACKNOWLEDGES_AMBIGUOUSLY to c1. On Thursday at 14:00, the agency’s spokesperson on a public panel says: “The redraft is exploratory; nothing has been promised.” OAG emits:
Claim(id=cl2, asserter=AgencySpokesperson, content="redraft exploratory; nothing promised", modality=Denied, type=Factual, t_valid=[Thu 14:00, ∞), t_tx=[Thu 14:30, ∞), source_span=PanelTranscript.txt:chars[8421:8503]) Edge(type=CONTRADICTS, from=cl2, to=c1, t_tx=[Thu 14:30, ∞)) StatusTransition(commitment_id=c1, from=Active, to=Contested, t_valid=Thu 14:00, t_tx=Thu 14:30, triggering_claim_id=cl2, source_span=PanelTranscript.txt:chars[8421:8503])
A query at Friday 08:00 asking “what did the agency commit to?” returns the Commitment c1 with status Contested, the original Monday statement as the basis, the Thursday denial as the contradicting claim, and the full revision chain. A generic RAG system asked the same question would return a smoothed paraphrase. The structure is the output.
References
Foundations
- [1]Fisher, R. et al. (1981). Getting to Yes. Houghton Mifflin.foundationsnegotiation
- [2]Glasl, F. (1999). Confronting Conflict. Hawthorn Press.foundationsconflict
- [3]Galtung, J. (1990). Cultural Violence. Journal of Peace Research, 27(3), 291–305.foundationsconflict
- [4]Axelrod, R. (1984). The Evolution of Cooperation. Basic Books.foundationscooperation
- [5]Sunstein, C. R. (1994). Incommensurability and Valuation in Law. Michigan Law Review, 92(4), 779–861.foundationslaw
- [6]Kingdon, J. W. (2003). Agendas, Alternatives and Public Policies (2nd ed.). Longman.foundationspolicy-process
- [7]Sabatier, P. A. and Weible, C. M. (2014). Theories of the Policy Process (3rd ed.). Westview.foundationspolicy-process
- [8]Baumgartner, F. R. and Jones, B. D. (1993). Agendas and Instability in American Politics. University of Chicago Press.foundationspolicy-process
- [9]Hogan, A. and et al. (2021). Knowledge Graphs. ACM Computing Surveys, 54(4).foundationsknowledge-graphs
- [10]Garcez, A. d'A. and Lamb, L. C. (2023). Neurosymbolic AI: The 3rd Wave. Artificial Intelligence Review, 56(11), 12387–12406.foundationsneurosymbolic
LLM failure-mode literature
- [11]Chen, W. and et al. (2023). Benchmarking LLMs on Temporal Reasoning. arXiv:2306.08952.arXiv:2306.08952llm-failurestemporal
- [12]Kıcıman, E. and et al. (2023). Causal Reasoning and Large Language Models. arXiv:2305.00050.arXiv:2305.00050llm-failurescausality
- [13]Ji, Z. and et al. (2023). Survey of Hallucination in Natural Language Generation. ACM Computing Surveys, 55(12).llm-failureshallucination
- [14]Hou, Y. and et al. (2024). WikiContradict: A Benchmark for Evaluating LLMs on Real-World Knowledge Conflicts. NeurIPS 2024 D&B Track. arXiv:2406.13805.arXiv:2406.13805llm-failurescontradictionbenchmark
- [15]Su, Z. and et al. (2024). ConflictBank: A Benchmark for Evaluating the Influence of Knowledge Conflicts in LLMs. NeurIPS 2024 D&B Track. arXiv:2408.12076.arXiv:2408.12076llm-failurescontradictionbenchmark
Retrieval, ontology, and generation patterns
- [16]Lewis, P. and et al. (2020). Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks. NeurIPS 2020.retrievalrag
- [17]Edge, D. and et al. (2024). From Local to Global: A Graph RAG Approach to Query-Focused Summarization. arXiv:2404.16130.arXiv:2404.16130retrievalgraphrag
- [18]Sharma, K. et al. (2025). OG-RAG: Ontology-Grounded RAG for Large Language Models. EMNLP 2025 (ACL Anthology 2025.emnlp-main.1674). arXiv:2412.15235.arXiv:2412.15235retrievalontologyog-rag
- [19]Sun, J. and et al. (2024). Think-on-Graph. ICLR 2024.retrievalgraph-reasoning
- [20]Buehler, M. J. (2025). Agentic Deep Graph Reasoning Yields Self-Organizing Knowledge Networks. J. Mater. Res. 40, 2204. arXiv:2502.13025.arXiv:2502.13025agenticgraph-reasoning
- [21]Babaei Giglou, H. et al. (2025). LLMs4OL 2025: The 2nd LLMs for Ontology Learning Challenge at the 24th ISWC. Open Conference Proceedings 6.ontology-learningbenchmark
- [22]Beliaeva, A. and Rahmatullaev, T. (2025). Heterogeneous LLM Methods for Ontology Learning. LLMs4OL 2025. arXiv:2508.19428.arXiv:2508.19428ontology-learning
- [23]Bian, H. (2025). LLM-Empowered Knowledge Graph Construction: A Survey. arXiv:2510.20345.arXiv:2510.20345surveyknowledge-graphs
- [24]Zhang, D. and et al. (2025). MedKGent: A Large Language Model Agent Framework for Constructing Temporally Evolving Medical Knowledge Graphs. arXiv:2508.12393.arXiv:2508.12393temporalagenticmedical
- [25]Rasmussen, P. and et al. (2025). Zep: A Temporal Knowledge Graph Architecture for Agent Memory. arXiv:2501.13956.arXiv:2501.13956temporalagenticmemory
- [26]Neo4j and Zep (2025). Graphiti: Build Real-Time Knowledge Graphs for AI Agents.Linktemporalagenticmemory
- [27]Anthropic (2025). Effective Context Engineering for AI Agents. Anthropic engineering blog.Linkagenticcontext
- [28]Anthropic (2025). Effective Harnesses for Long-Running Agents. Anthropic engineering blog.Linkagenticcontext
- [29]Letta (2025). Context-Bench: Benchmarking LLMs on Agentic Context Engineering.Linkagenticbenchmarkcontext
Ontology engineering
- [30]Héja, G. et al. (2008). Ontological Analysis of SNOMED CT. BMC Medical Informatics and Decision Making, 8(S1), S8.ontology-engineering
- [31]Schulz, S. and et al. (2023). SNOMED CT and Basic Formal Ontology — Convergence or Contradiction Between Standards?. Applied Ontology. doi:10.3233/AO-230018.DOIontology-engineering
- [32]Keet, C. M. and Grütter, R. (2021). Toward a Systematic Conflict Resolution Framework for Ontologies. Journal of Biomedical Semantics. doi:10.1186/s13326-021-00246-0.DOIontology-engineering
Institutional context
- [33]NIST (2023). Artificial Intelligence Risk Management Framework (AI RMF 1.0).Linkinstitutionalgovernance
- [34]OECD (2025). Governing with Artificial Intelligence: Are governments ready?. OECD Publishing.institutionalgovernance
- [35]ACLED (2024). Conflict categories and codebook methodology.Linkdatasetconflict
Cite this
@techreport{tacitus2026knowledgelayerv2,
title = {The AI Knowledge Layer for Policy and Political Work:
Kernel Ontology, Dynamic Extensions, and
Ontology-Augmented Generation},
author = {{TACITUS Research}},
year = {2026},
note = {Version 2.0},
url = {https://www.tacitus.me/research/vision}
}