A Fact-Producing Compiler

April 09, 2026 6 min read Keywords: concrete, compilers, AI, llm, Formal Verification, Programming Languages

Figures lean in around a brass orrery lit from within, studying the clockwork model of the planets — *A Philosopher Lecturing on the Orrery*, Joseph Wright of Derby, c. 1766. Derby Museum and Art Gallery (public domain).

Series note: this article is part of the Concrete series and responds to Dmitri Sotnikov’s Giving LLMs a Formal Reasoning Engine for Code Analysis. Related: When the Compiler Is the Oracle and Why Concrete Exists.

When an AI agent explores a codebase, it usually greps for names, reads a few matches, searches for callers, reads those, and tries to piece together a mental model of the program from text fragments. This works about as well as you would expect. The agent is asking structural questions about a program, things like “can user input reach this SQL query?” or “what changes if I touch this function?”, but the only tool it has is text search.

Yesterday I read Dmitri Sotnikov’s article about giving LLMs a symbolic reasoning engine for code analysis. His tool, Chiasmus, parses source code with tree-sitter (a syntax parser), turns definitions and calls into logic facts, and lets an LLM run graph queries instead of grepping through files. That is a much better interface: the agent asks a structural question and gets a structural answer.

Reading the post gave me a better phrase for part of what we are building with Concrete: a fact-producing compiler.

Concrete is the systems programming language we are building for programs that need auditability. It compiles code into an executable and into checked statements about what that executable can do.

#From syntax to semantics

Chiasmus works by recovering structure from source code after the fact: it parses files, extracts which functions exist and what calls what, and turns that into queryable facts. For existing languages that were never designed to expose this information, that is the practical approach.

Concrete can go further because the compiler already knows more than syntax. Tree-sitter can see that foo calls bar, but the Concrete compiler also knows that bar requires Network authority, that foo carries that authority in its signature, and that the call chain crosses a trusted FFI boundary.

By authority I mean what code is allowed to do: allocate memory, read files, touch the network, call unsafe code, cross into foreign code. In Concrete, those permissions are part of the program the compiler checks. They are not comments and they are not recovered later by a scanner.

Today the compiler exposes this as a human report. An authority report on our JSON parser shows where allocation comes from:

capability Alloc (13 functions):
  pub store  <- store -> vec_push
      parse_string  <- parse_string -> store
  pub parse_value  <- parse_value -> parse_string
      parse_array  <- parse_array -> vec_push

Read one line like this: parse_value needs allocation because it calls parse_string, which eventually stores bytes in a vector.

The compiler tracks capabilities (File, Network, Alloc, etc.) in function signatures, enforces them transitively, and can report the path that explains why a function needs some authority. It also tracks execution shape: direct recursion, mutual call cycles, and loop boundedness. The predictable profile is the stricter direction: today Concrete reports and checks parts of bounded execution, while the fully enforced profile is still being tightened.

For proofs, the same rule applies. A report can say whether a claim was merely reported by the compiler, enforced by a compiler check, proved in Lean (a proof assistant), or accepted because of a trusted assumption. Proof evidence is attached to the function name and body fingerprint, so changed code cannot silently keep stale proofs.

All of this started as human-readable reports. Since then, the project has moved closer to the thing this article is arguing for: one artifact that review tools, CI, and agents can all read. The reports now have proof bundles, source-contract obligations, VC ledgers, traceability output, audit summaries, and better drift detection. The remaining work is not to invent the facts. It is to make the whole surface pleasant to query.

#What querying looks like

Consider a routine scenario. A reviewer opens a PR that bumps a dependency. The new version reads environment variables three layers deep in a helper. In Rust, nothing in the function signatures changes and the reviewer has to diff the dependency source or hope the changelog mentions it. In Concrete, the function that calls into that dependency must have declared Env authority or the build breaks.

I want the same kind of interface for every compiler fact, capabilities included. Here is what that could look like. An agent or tool asks a question, and the compiler returns a checked answer:

“Can the packet parser core touch the network?”

{
  "reachable": false,
  "from": "main.decode_header",
  "to_capability": "Network",
  "evidence": "compiler-checked call graph and capability facts"
}

“Why is main not predictable?”

{
  "violations": [
    {
      "gate": "no_blocking",
      "capability": "File",
      "path": ["main.main", "std.fs.read_to_string"]
    }
  ]
}

“Which dependency widened authority since yesterday?” Because the compiler tracks the authority chain for every function, it can return the path.

The model can explain the result. The compiler should supply it.

#What we are making queryable

First, we should expose the facts Concrete is already built around: which functions can allocate, read files, touch the network, call unsafe code, or cross FFI; why each authority is required; which functions recurse or enter call cycles; which loops are bounded; which runtime-safety obligations exist; which proof claims are current, proved, stale, missing, assumed, enforced, reported, or trusted.

Then it should answer the questions people ask in review. Did this dependency add a path to File, Network, or Env? Did this module become less predictable? Did a proof go stale? Did a trusted boundary move? Did authority widen between yesterday’s build and today’s?

The fact artifact comes first. A query CLI, MCP, CI checks, review tools, and agent integration should all read the same checked facts.

#The compiler should say what is true

A compiler should say what is true about the program, beyond “type check passed.” What can this function touch? Is it recursive? Are its loops bounded? Does it cross FFI? Is the proof current? Which path explains the authority?

Those are program facts. Concrete produces some, enforces some, proves some, and labels the rest. The direction is not just better reports. It is a compiler artifact that other tools can safely build on.

Written with an LLM in the loop, like everything here. The ideas and the mistakes are mine. More on how I write.