Spex

Spex is a tool to check the correctness of implementations against specifications. Concretely, it provides ways to (1) define specifications, (2) derive implementation models, and (3) check these against each other w.r.t. behavioural equivalence.

In this way, it is an attempt to bring a practical slice of formal verification theory into Elixir practice, allowing you to be more confident in the correctness of your implementations and helping you catch bugs and prevent unintended behaviours, especially in complex, multi-step workflows and protocols, possibly with asynchronous steps and concurrency. You can think of it similar to a type system (or a layer above that), in that the annotations Spex requires you to provide can help you both to make your code more easily readable as well as to catch bugs early on; but instead of checking for type correctness, Spex checks for behavioural correctness.

The three primary concepts of Spex are specifications, instances, and implementation models: you define specifications, initialise instances of these specifications and record their state transitions, and from this the Spex engine derives an implementation model for the given specification.

More details on the formal concepts follow in the Theoretical background section below.

How to use Spex (Quick start)

This section is a shortened version of the moduledoc of Spex. See there for details.

Install, configure, and start Spex

Add Spex to your dependencies:

def deps do
  [
    {:spex, "~> 0.1.1", hex: :lts_spex}
  ]
end

Please note the hex: :lts_spex.¹

Also note that Spex uses a Rust NIF under the hood, which is not yet available pre-compiled, so you need to have Rust installed on your system.

Optionally, add :spex to your .formatter.exs file under :import_deps.

Configuration

The derived implementation models are stored in a custom format in a given folder. Ideally, this should be within the priv directory of your application (:code.priv_dir(:your_app)).

config :spex, impl_models_dir: Path.join(:code.priv_dir(:your_app), "spex_impl_models")

Define specifications

Specifications represent protocols or workflows and are modelled as graphs / state machines. The focus lies on the transitions (edges) rather than the states (nodes): when we compare an implementation model against its specification, their states can be entirely distinct, since behavioural equivalence is judged entirely based on their transition behaviours (basically: action sequences).

In the simplest case, you can create a specification as follows:

defmodule YourApp.Specifications.Tree do
  @moduledoc """
  A specification that forms a small, simple tree structure:

               s0
              /  |
            a/    |a
            /      |
          s1       s2
          / |
        b/   |c
        /     |
      s3      s4

  """
  use Spex.Specification

  def_transition :s0, :a, :s1
  def_transition :s0, :a, :s2

  def_transition :s1, :b, :s3
  def_transition :s1, :c, :s4
end

Each transition consists of a from_state, an action, and a to_state.

Specifications can have custom error handlers, to which any reported errors are passed for processing, e.g. logging and deciding which errors are okay.

For details on configuring and customising specifications, see Spex.Specification; in particular regarding custom error handlers, see c:Spex.Specification.error_handler/2.

Observe transitions in your implementation

In your actual implementation code, you need to initialise instances of your specification and record transitions.

As a useless toy example, say we manage a pine forest where each pine has a number. We offer a function for planting a seedling:

alias YourApp.Specifications.Tree

def plant_seedling(pine_id) do
  specification = Tree
  instance_identifier = {Tree, pine_id}
  meta = %{planting_datetime: DateTime.utc_now()}
  initial_state = :seedling

  Spex.init_instance(specification, instance_identifier, meta, initial_state)

  # Actual domain logic then follows
end

Note that the instance identifier is an arbitrary term that uniquely identifies the instance across all specifications (hence it can make sense to include the specification module). The meta can optionally be added to make possible error reports regarding the instance more useful. The initial state is also optional and defaults to the specification's initial state.

After an instance is initialised, we can record its transitions with Spex.transition(instance_identifier, action, new_state):

def pour_onto(pine_id, :water), do: Spex.transition({Tree, pine_id}, :a, :sapling)
def pour_onto(pine_id, :acid), do: Spex.transition({Tree, pine_id}, :a, :withered)
def observe_growth(pine_id), do: Spex.transition({Tree, pine_id}, :__internal__, :mature_tree)
def cut_down(pine_id), do: Spex.transition({Tree, pine_id}, :b, :lumber)
def burn(pine_id), do: Spex.transition({Tree, pine_id}, :c, :ash)

As mentioned above, the states do not have to match the specification states. The actions, however, do need to match, as these are what the behavioural equivalence is judged on.

An exception to this and a useful addition is the :__internal__ action, which can be used to record internal state transitions that are not part of the specification but that you still want to be recorded in the implementation model. These are ignored (or rather: treated specially) for the behavioural equivalence checks.

Derive implementation models: in tests

Your tests should cover all possible transitions of your implementation, so we can use them to derive an implementation model. To do this, add the following to your test_helper.exs:

Spex.Testing.prepare_for_test_suite()

This prepares the Spex engine to record all transitions from tests and write derived implementation models in the directory configured above.

In unit tests, we often construct our parameters to be in a certain state. When using Spex.init_instance/4, though, the given initial state is added as an initial state of the implementation model, which we don't want in these cases. To construct an instance in a certain state without affecting the implementation model, there is Spex.Testing.mock_instance!/4. In a test for our pine forest above, you might do something like this:

alias YourApp.Specifications.Tree

Spex.Testing.mock_instance!(Tree, {Tree, 1}, :mature_tree)

assert YourApp.Pine.cut_down(1) == :ok

Since cut_down/1 calls Spex.transition({Tree, pine_id}, :b, :lumber), this test would add a transition from state :mature_tree with action :b to state :lumber to the implementation model.

Check behavioural equivalence: offline

After implementation models have been derived and stored, you can run mix spex (see Mix.Tasks.Spex) to check whether they are behaviourally equivalent. Currently, the output only tells you if an implementation model is not behaviourally equivalent to its specification and you are required to analyse the model to understand why. If one or more implementation models are not behaviourally equivalent to their specifications, the command exits with exit code 2. You might consider adding this command to your CI pipeline.

Check behavioural equivalence: online

Once you have derived implementation models that were deemed behaviourally equivalent to their specification, you are ready to use Spex in production. Every time you initialise an instance or record a transition, it is checked whether the initialisation or transition is part of the implementation model. If it is, all is well. If it isn't, a behavioural equivalence check on the resultant model against the specification is run. In any case, an %Spex.Errors.InstanceError{} is reported, either with :reason being :deviation_still_equivalent or :deviation_not_equivalent. It is up to your specification's error handler to handle these cases accordingly (e.g. log the occurrence but still return :ok in the :deviation_still_equivalent case; this is what the default error handler does). The reason is that even deviations that don't break the equivalence hint at scenarios your tests don't cover. You can then fix these reports in the future by adding a test that covers the transition or by adding the transition manually to your implementation model.

Theoretical background

Spex is motivated by ideas from concurrency theory and behavioural semantics. It is intended to be a light form of a refinement checker that does not require complex, static code analysis or manual proofs.

The primary theoretical model in Spex is that of labelled transitions systems (LTSs): an LTS consists of a set of states and a set of transitions between those states, each transition being labelled by an action. The possible sequences of actions an LTS allows describe the behaviour of the system it models, roughly speaking. The idea behind Spex is that both specifications as well as implementation models can be described through LTSs.

Bisimilarity is a way to compare two LTSs and to deem them behaviourally equivalent based on these action sequences. The concrete definition of behavioural equivalence used by Spex is called branching bisimilarity. This is a relation between two LTSs that considers them equivalent if they can simulate each other's behaviour, while allowing for some flexibility in terms of internal transitions. The idea is that two systems are considered equivalent if they can perform the same sequences of observable (i.e. non-internal) actions, even if they may differ in their internal workings or states.

To learn more about the theory, you might start by going to the Wikipedia page for any of the terms above. You might also read the sections 2.1 and 2.2 of my Bachelor's thesis (skipping the Isabelle sections, which formalise the notions in the Isabelle theorem prover). (The main part of the thesis is not related to Spex.)

The particular form of bisimilarity used by Spex is branching bisimilarity. To learn more about it, you can read e.g. the following paper:

Van Glabbeek, R.J., Weijland, W.P.: Branching time and abstraction in bisimulation semantics. JACM 43(3), 555–600 (1996). https://doi.org/10.1145/233551.233556

Here is the paper describing the algorithm Spex uses under the hood for determining branching bisimilarity:

Martens, J., Laveaux, M. (2026). Faster Signature Refinement for Branching Bisimilarity Minimization. In: Junges, S., Katz, G. (eds) Tools and Algorithms for the Construction and Analysis of Systems. TACAS 2026. Lecture Notes in Computer Science, vol 16505. Springer, Cham. https://doi.org/10.1007/978-3-032-22752-2_23

The algorithm was implemented in Rust in the crate merc_reduction and used in Spex through a NIF. I want to thank Maurice Laveaux for implementing this tool and making it available!

This readme file has been written without the use of AI. The code and other documentation were written partially AI-assisted.

The package name :spex itself was already taken (which I realised only after becoming attached to the name). However, since that package is unmaintained and not widely used, I decided to keep the namespace simply as Spex. The lts prefix stands for labelled transition system (see the Theoretical background section). ↩

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.github/workflows		.github/workflows
config		config
doc		doc
lib		lib
native/spex_merc_wrapper		native/spex_merc_wrapper
priv/native		priv/native
test		test
test_meta		test_meta
test_support/specifications		test_support/specifications
.credo.exs		.credo.exs
.formatter.exs		.formatter.exs
.gitignore		.gitignore
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
LICENSE		LICENSE
README.md		README.md
mix.exs		mix.exs
mix.lock		mix.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Spex

How to use Spex (Quick start)

Install, configure, and start Spex

Configuration

Define specifications

Observe transitions in your implementation

Derive implementation models: in tests

Check behavioural equivalence: offline

Check behavioural equivalence: online

Theoretical background

About

Uh oh!

Releases 2

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Spex

How to use Spex (Quick start)

Install, configure, and start Spex

Configuration

Define specifications

Observe transitions in your implementation

Derive implementation models: in tests

Check behavioural equivalence: offline

Check behavioural equivalence: online

Theoretical background

Footnotes

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 2

Contributors

Uh oh!

Languages