feat(Query): query complexity framework with sorting examples by kim-em · Pull Request #401 · leanprover/cslib

kim-em · 2026-03-05T05:57:57Z

This PR implements Sebastian Graf's unified approach to query complexity (discussed in the CSLib Algorithm frameworks thread), combining the strengths of #372 (explicit Prog/FreeM query types) and #376 (monad-parametric approach).

Programs are Prog Q α (free monad over query type Q), and the oracle is supplied after the program produces its query plan — giving anti-cheating guarantees for both upper and lower bounds. No WP/Hoare triple machinery is needed: correctness is just equations about Prog.eval oracle, and cost is just equations about Prog.queriesOn oracle.

This provides an alternative to the TimeM-based cost analysis already in the repo: here query counting is structural (derived from the Prog tree) rather than annotation-based.

New files

File	Contents
`Query/Prog.lean`	Core `Prog` type, `eval`, `queriesOn`, simp lemmas
`Query/Bounds.lean`	`UpperBound` and `LowerBound` definitions
`Query/QueryTree.lean`	Decision trees with fixed response type, for lower bound proofs
`Query/Sort/LEQuery.lean`	Comparison query type for sorting
`Query/Sort/IsSort.lean`	`IsSort` correctness specification
`Query/Sort/Insertion/{Defs,Lemmas}.lean`	Insertion sort: correctness + O(n²) upper bound
`Query/Sort/Merge/{Defs,Lemmas}.lean`	Merge sort: correctness + n·⌈log₂ n⌉ upper bound
`Query/Sort/QueryTree.lean`	`Prog`-to-`QueryTree` bridge + pigeonhole depth lemma
`Query/Sort/LowerBound.lean`	Any correct comparison sort needs ≥ ⌈log₂(n!)⌉ queries

Results

Insertion sort: correctness (permutation + sortedness), n² upper bound, IsSort instance
Merge sort: correctness, n·⌈log₂ n⌉ upper bound, IsSort instance
Lower bound: any IsSort on an infinite type makes ≥ ⌈log₂(n!)⌉ queries. The proof constructs n! distinct total orders via permutations of embedded elements, shows they force distinct sorted outputs, then applies an adversarial pigeonhole argument on QueryTree depth.

🤖 Prepared with Claude Code

Author : Shreyas Srinivas Co-Author : Eric Wieser Co-Author : Tanner Duve

Co-authored-by: Eric Wieser <wieser.eric@gmail.com>

…o query-final-squash

Co-authored-by: Shreyas Srinivas <Shreyas4991@users.noreply.github.com> Co-authored-by: Eric Wieser <eric-wieser@users.noreply.github.com> Co-authored-by: Tanner Duve <tannerduve@gmail.com>

…quash

Shreyas4991 · 2026-04-22T03:41:53Z

I wish to point out that the authorship and copyright headers must also be changed. This is my code. The FRO can't claim copyright.
Same for author comments. The order of authorship matters.
This PR still doesn't address the technical deficiencies w.r.t. mine, because it entirely overwrites my content.
Query Trees are still redundant. They are trivially derivable from Progs.
The Arith query should use my query and build on top of it.
Per Fabrizio's message to me there was no mention of "superseding" any of my work. He said the pr would build on top of mine.
The overall technical merits of adding extra monadic polymorphism becomes even less apparent. Technically it doesn't even make sense to add another monad parameter since FreeM is already parametrising a functor (more general than a monad). That is it is already monad parametric.
All the features claimed to be derived from model hiding can already be achieved in the model of 372. In fact this is as simple as quantifying over arbitrary models in theorem statements. The extra monad parameter is redundant. It can actually be substituted by any type constructor (which is essentially what the F in FreeM F a really is).
The examples I added should be preserved.

Add a framework for proving upper and lower bounds on query complexity of comparison-based algorithms, using `Prog` (free monad over query types) with oracle-parametric evaluation and structural query counting. Results: - Insertion sort: correctness + O(n²) upper bound - Merge sort: correctness + n·⌈log₂ n⌉ upper bound - Lower bound: any correct comparison sort on an infinite type needs ≥ ⌈log₂(n!)⌉ queries (via adversarial pigeonhole on QueryTree depth) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> Co-authored-by: Shrys <shreyasss94@gmail.com>

Add `Prog.cost`, a weighted generalization of `Prog.queriesOn` where each query type can have a different cost. Demonstrate this with complex multiplication: naive (4 muls + 2 adds) vs Gauss's trick (3 muls + 5 adds), proving correctness, exact parametric costs, and the crossover condition. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> Co-authored-by: Shrys <shreyasss94@gmail.com>

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> Co-authored-by: Shrys <shreyasss94@gmail.com>

sgraf812

I think 3 of my 4 items in #401 (review) haven't been addressed yet. I think the doX stuff has been addressed.

sgraf812 · 2026-04-22T16:18:30Z

+/-- Evaluate a program by answering each query using `oracle`. -/
+@[expose] def eval (oracle : {ι : Type} → Q ι → ι) : Prog Q α → α
+  | .pure a => a
+  | .liftBind op cont => eval oracle (cont (oracle op))


Still relevant

sgraf812 · 2026-04-22T16:19:12Z

+
+This is the free monad specialized to a single fixed-type operation, used to reify
+algorithms as explicit trees for query complexity lower bounds. -/
+inductive QueryTree (Q : Type) (R : Type) (α : Type) where


Delete this file. Use ProgM throughout.

Shreyas4991 · 2026-04-27T11:03:09Z

Programs are Prog Q α (free monad over query type Q), and the oracle is supplied after the program produces its query plan — giving anti-cheating guarantees for both upper and lower bounds. No WP/Hoare triple machinery is needed: correctness is just equations about Prog.eval oracle, and cost is just equations about Prog.queriesOn oracle.

One small nitpick, in the counting of time complexity, the complexity of an oracle call can depend on size/parameters of inputs supplied to it. For example the cost of calling a vertex connectivity query to a subgraph depends on the edge and vertex size of a subgraph. The current definition simply assumes each oracle call costs 1.

Prog Q α was already a definitional `abbrev` for FreeM Q α; this commit deletes the Prog namespace and moves the eval/queriesOn/cost interpreters to a new Cslib/Algorithms/Lean/Query/FreeM.lean. All call sites in the query subtree (sorting, arith examples, bounds) now refer to FreeM directly. One-step query constructors (LEQuery.ask, ArithQuery.doAdd/ doSub/doMul) now use FreeM.lift rather than raw .liftBind … .pure. QueryTree and the Prog→QueryTree bridge (now FreeM.toQueryTree) remain in place; deleting QueryTree requires generalising the lower-bound lemma and lands separately. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

The QueryTree decision-tree datatype was a single-response-type specialisation of FreeM, kept around because the existing combinatorial lower-bound lemma was easier to state with a fixed response type. This commit ports the lemma directly to FreeM: FreeM.exists_queriesOn_ge_clog : if every response type has cardinality ≤ r, n distinct injective oracles force some oracle to make ≥ ⌈log_r n⌉ queries. The proof mostly mirrors the QueryTree version, using @liftBind to bind the existential response type, and one extra ceiling-division step (Nat.div_le_div_left) to relate the per-node branching factor to the global bound r. Sort/LowerBound.lean now applies the FreeM lemma directly, with LEQuery.fintypeResponse / cardResponse_le_two witnessing that LEQuery responses are always Bool. The Prog→QueryTree bridge (toQTOracle / fromQTOracle / toQueryTree / *_eval / *_queriesOn) is gone; only LEQuery.oracleOf survives, renamed and moved into Sort/LEQuery.lean. Both QueryTree.lean and Sort/QueryTree.lean are deleted. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

…ment The threshold theorem `gauss_le_naive` uses `3 * c_add ≤ c_mul` (inclusive), so the section header should say "at least 3×", not "more than 3×". Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

…liftM Addresses Eric Wieser's review (Mar 5 2026, on the original Prog.lean): "pattern matching on the free monad is exploiting an implementation detail, and that everything should really go through the universal property, FreeM.liftM." All three interpreters are now defined as `liftM` into a target monad: eval : liftM (m := Id) oracle cost : liftM (m := Tally) (fun op => ⟨weight op, oracle op⟩) |>.cost queriesOn : cost oracle (fun _ => 1) where `Tally` is a tiny accumulator monad (a value paired with a `Nat`-valued running cost) introduced in this file with `Monad` and `LawfulMonad` instances. The right primitive turned out to be `def`, not `abbrev`. With `def`, the constructor-form simp lemmas (eval_pure, eval_liftBind, cost_pure, cost_liftBind, queriesOn_pure, queriesOn_liftBind) all reduce by rfl, so downstream proof ergonomics are unchanged from the original pattern-match definitions. simp normal form is determined by the explicit @[simp] theorems rather than opportunistic abbrev unfolding (which would otherwise mix `queriesOn` and `cost _ (fun _ => 1)` forms in goals and confuse omega). Net effect: the universal property is the actual definition, not a post-hoc characterisation. queriesOn_eq_cost_one is rfl. No downstream proof needed updating. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

kim-em · 2026-04-28T04:25:53Z

Define queriesOn as an abbreviation of cost, kill lemmas for the former

@sgraf812 could you take a look again and see if you'd still like me to remove things here?

eric-wieser · 2026-04-28T04:47:40Z

+/-- Evaluate a program by answering each query using `oracle`.
+Defined as `liftM` to `Id`, the canonical interpreter into pure values. -/
+@[expose] def eval (oracle : {ι : Type} → F ι → ι) (p : FreeM F α) : α :=
+  p.liftM (m := Id) oracle


This is missing pure and Id.run

…linter Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

eric-wieser · 2026-04-28T04:52:14Z

+    of its input, and produces a sorted list when the oracle implements a total order. -/
+structure IsSort (sort : List α → FreeM (LEQuery α) (List α)) : Prop where
+  /-- The sort produces a permutation of its input, for any oracle. -/
+  perm : ∀ (xs : List α) (oracle : {ι : Type} → LEQuery α ι → ι),


Requiring this for even bad oracles is an interesting but reasonable choice.

Although is it really reasonable? I think it could unnecessarily rule out some high performance implementations. I don't have a concrete example, though...

The name isSort is misleading because you also include perm inside. More descriptive/explicit name is better e.g., isSortPerm.

eric-wieser · 2026-04-28T04:52:53Z

+  | le (a b : α) : LEQuery α Bool
+
+/-- Lift `LEQuery.le a b` into a `FreeM` that returns the comparison result. -/
+@[expose] def LEQuery.ask (a b : α) : FreeM (LEQuery α) Bool :=


I'd suggest making all these wrappers for FreeM.lift abbrevs

(as I do in #525)

sgraf812

Much better 👍

sgraf812 · 2026-04-28T05:10:40Z

+    of its input, and produces a sorted list when the oracle implements a total order. -/
+structure IsSort (sort : List α → FreeM (LEQuery α) (List α)) : Prop where
+  /-- The sort produces a permutation of its input, for any oracle. -/
+  perm : ∀ (xs : List α) (oracle : {ι : Type} → LEQuery α ι → ι),


Although is it really reasonable? I think it could unnecessarily rule out some high performance implementations. I don't have a concrete example, though...

sgraf812 · 2026-04-28T05:40:55Z

+  (p.liftM (m := Tally) (fun op => ⟨weight op, oracle op⟩)).cost
+
+/-- Count the number of queries along the path determined by `oracle`. -/
+@[expose] def queriesOn (oracle : {ι : Type} → F ι → ι) (p : FreeM F α) : Nat :=


Minor: I tend to read queriesOn and think "this gives back a trace of the queries done by p" even if it doesn't make sense typically. Maybe cost1? Or countQueries (which nicely matches existing use of count in Std)?

sgraf812 · 2026-04-28T05:49:29Z

+    (orderedInsert x (y :: ys)).eval oracle =
+      if oracle (.le x y) then x :: y :: ys
+      else y :: (orderedInsert x ys).eval oracle := by
+  simp [orderedInsert, LEQuery.ask]


LEQuery.ask here hints at a missing simp lemma? Shouldn't eval_ask have fired? Maybe it is fixed by making LEQuery.ask an abbrev, as Eric suggests...

sgraf812 · 2026-04-28T05:50:06Z

+      if oracle (.le x y)
+      then x :: (merge xs' (y :: ys')).eval oracle
+      else y :: (merge (x :: xs') ys').eval oracle := by
+  simp [merge, LEQuery.ask]


Similarly here. Is there a reusable simp lemma missing?

sgraf812 · 2026-04-28T05:50:22Z

+      1 + if oracle (.le x y)
+      then (merge xs' (y :: ys')).queriesOn oracle
+      else (merge (x :: xs') ys').queriesOn oracle := by
+  simp [merge, LEQuery.ask]


Shreyas4991 · 2026-04-28T07:42:12Z

+
+/-- Count the number of queries along the path determined by `oracle`. -/
+@[expose] def queriesOn (oracle : {ι : Type} → F ι → ι) (p : FreeM F α) : Nat :=
+  cost oracle (fun _ => 1) p


Queries can have more generic, input-size dependent costs.

eric-wieser · 2026-04-29T00:37:07Z

+/-- Weighted query cost: each query has a cost given by `weight`, accumulated along the
+oracle-determined path. Defined as `liftM` into the accumulator monad `TimeM`. -/
+@[expose] def cost (oracle : {ι : Type} → F ι → ι)
+    (weight : {ι : Type} → F ι → Nat) (p : FreeM F α) : Nat :=
+  TimeM.time <| p.liftM fun op => ⟨oracle op, weight op⟩


I don't think there's a good reason to restrict to Nat here.

eric-wieser · 2026-04-29T00:39:01Z

+@[simp] theorem eval_orderedInsert_nil (oracle : {ι : Type} → LEQuery α ι → ι) (x : α) :
+    (orderedInsert x ([] : List α)).eval oracle = [x] := by


I think we should just prove that (orderedInsert x l).eval oracle = l.orderedInsert x (fun x y => oracle (.le x y)), and then we can delete most of this file.

sorrachai · 2026-05-06T09:38:23Z

+    {ix : Type} (p : FreeM F α) (S : Finset ix) (hS : S.Nonempty)
+    (oracles : ix → ({ρ : Type} → F ρ → ρ))
+    (h_inj : Set.InjOn (fun i => p.eval (oracles i)) ↑S) :
+    ∃ i ∈ S, p.queriesOn (oracles i) ≥ Nat.clog r S.card := by


sorrachai · 2026-05-06T09:44:44Z

+
+Because the oracle is supplied *after* the program produces its query plan (the `FreeM` tree),
+a sound implementation has no way to "guess" what the oracle would respond. This is the
+foundation of the anti-cheating guarantee for both upper and lower bounds.


FreeM is more robust than TimeM, but I understand that FreeM is not immune to cost cheating, right? If so, we should explicitly write a warning/caveat to the users so they aware some weaknesses. In particular, we should write how much trust do we require in this model so that the complexity is counted correctly.

This is fairly trivial. The scope for cheating is essentially "sneak in operations via pure". This is different from TimeM, where additionally the location and individual cost annotations can vary from place to place which can also cause problems. One must also ideally avoiding adding extraneous typeclass instances and choose Boolean propositions.

This is however one form of "cheating" that I am not too worried about anymore.

On the one hand, we do want pure operations when we don't care too much about their implementation details. This came up in my MPI talk.

I have two ideas on how to eliminate pure operations based issues with model-level lower bound proofs entirely, in addition to model/input hiding. Can elaborate on Zulip if asked.

sorrachai · 2026-05-06T09:48:45Z

+/-! # Merge Sort as a Query Program
+
+Merge sort implemented as a `FreeM (LEQuery α)`, making all comparison queries explicit.
+Uses an alternating split (odds/evens) to avoid needing `List.length` in the termination


Can you elaborate on why using List.length is bad in the termination argument?

The odd even split uses structural recursion instead of well founded recursion.

sorrachai · 2026-05-06T09:50:46Z

+/-- Merge two sorted lists using comparison queries. -/
+@[expose] def merge (xs ys : List α) : FreeM (LEQuery α) (List α) :=
+  match xs, ys with
+  | [], ys => pure ys


Suggested change

| [], ys => pure ys

| [], ys => pure ys

return ys has better stylistic reading. This applies to other similar lines.

sorrachai · 2026-05-06T09:58:59Z

+@[expose] def orderedInsert (x : α) : List α → FreeM (LEQuery α) (List α)
+  | [] => pure [x]
+  | y :: ys => do
+    let le ← LEQuery.ask x y
+    if le then
+      pure (x :: y :: ys)
+    else do
+      let rest ← orderedInsert x ys
+      pure (y :: rest)


Suggested change

@[expose] def orderedInsert (x : α) : List α → FreeM (LEQuery α) (List α)

| [] => pure [x]

| y :: ys => do

let le ← LEQuery.ask x y

if le then

pure (x :: y :: ys)

else do

let rest ← orderedInsert x ys

pure (y :: rest)

@[expose] def orderedInsert (x : α) : List α → FreeM (LEQuery α) (List α)

| [] => pure [x]

| y :: ys => do

let le ← LEQuery.ask x y

if le then

pure (x :: y :: ys)

else do

let rest ← orderedInsert x ys

pure (y :: rest)

Stylistic change suggestion:

@[expose] def orderedInsert (x : α) : List α → FreeM (LEQuery α) (List α)
| [] => return [x]
| y :: ys => do
let le ← LEQuery.ask x y
if le then
return (x :: y :: ys)
else do
let rest ← orderedInsert x ys
return (y :: rest)

Shreyas4991 and others added 30 commits February 26, 2026 06:28

Big PR

8be8d07

Author : Shreyas Srinivas Co-Author : Eric Wieser Co-Author : Tanner Duve

Fixed worst case statement

e50c8b0

Linarith

79d77de

More review fixes

c1e3323

More review fixes

ddab6f0

More review fixes

08decfa

More review fixes

a2b4782

More review fixes

54bb351

More review fixes

7ee16a0

More review fixes

cc806f0

Fix test file imports

a732ed8

More review fixes

2cee489

small golfs

3e7edf2

Update CslibTests/QueryModel/ProgExamples.lean

4789639

Co-authored-by: Eric Wieser <wieser.eric@gmail.com>

Add docstrings for test files

afcfd57

Merge branch 'query-final-squash' of github.com:Shreyas4991/cslib int…

30a7905

…o query-final-squash

simps in a tutorial example

4e3d80c

Suggested name change. Additionally add co-author list:

9f3df4d

Co-authored-by: Shreyas Srinivas <Shreyas4991@users.noreply.github.com> Co-authored-by: Eric Wieser <eric-wieser@users.noreply.github.com> Co-authored-by: Tanner Duve <tannerduve@gmail.com>

Merge branch 'main' of github.com:leanprover/cslib into query-final-s…

6db1fde

…quash

Fix lake shake issues

a9485da

Done

6fef51f

Switch to bool

f479c93

Lower bound

5e2a2f6

GPT generated lower bound

6b78316

Added module

4fab097

exe mk_all

8097c61

Minimize imports

87e7ded

Done

3f71048

GPT finished the proof for lists with nodup

57856b7

Got it for infinite types as well

53c2ef3

kim-em requested a review from chenson2018 as a code owner April 22, 2026 03:40

kim-em force-pushed the combined-query-complexity branch from 12a4d9f to b331f1b Compare April 22, 2026 03:40

kim-em force-pushed the combined-query-complexity branch from b331f1b to 0f75466 Compare April 22, 2026 03:49

kim-em and others added 3 commits April 22, 2026 03:55

docs(Query/Arith): clarify these are toy examples of parametrized costs

7327006

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> Co-authored-by: Shrys <shreyasss94@gmail.com>

kim-em force-pushed the combined-query-complexity branch from 0f75466 to 7327006 Compare April 22, 2026 03:55

sgraf812 reviewed Apr 22, 2026

View reviewed changes

kim-em and others added 4 commits April 28, 2026 02:46

eric-wieser reviewed Apr 28, 2026

View reviewed changes

Comment thread Cslib/Algorithms/Lean/Query/FreeM.lean Outdated

eric-wieser reviewed Apr 28, 2026

View reviewed changes

docs(Query/FreeM): add field docstrings to Tally to satisfy docBlame …

2dc7d9f

…linter Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

eric-wieser reviewed Apr 28, 2026

View reviewed changes

sgraf812 reviewed Apr 28, 2026

View reviewed changes

Shreyas4991 reviewed Apr 28, 2026

View reviewed changes

drop Tally

391cab0

eric-wieser reviewed Apr 29, 2026

View reviewed changes

sorrachai reviewed May 6, 2026

View reviewed changes

		@[simp] theorem eval_orderedInsert_nil (oracle : {ι : Type} → LEQuery α ι → ι) (x : α) :
		(orderedInsert x ([] : List α)).eval oracle = [x] := by

Conversation

kim-em commented Mar 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

New files

Results

Uh oh!

Shreyas4991 commented Apr 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sgraf812 left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Shreyas4991 commented Apr 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kim-em commented Apr 28, 2026

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sgraf812 left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Shreyas4991 May 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

kim-em commented Mar 5, 2026 •

edited

Loading

Shreyas4991 commented Apr 22, 2026 •

edited

Loading

Shreyas4991 commented Apr 27, 2026 •

edited

Loading

Shreyas4991 May 6, 2026 •

edited

Loading