Pin Layout V5 default MLX pair for device evidence by RNT56 · Pull Request #14 · RNT56/pines

RNT56 · 2026-05-25T20:10:34Z

Summary

Pins Pines to the Layout V5 default MLX pair:
- mlx-swift 260c8fb16df772b8c20295529fde958fffb66369
- mlx-swift-lm 13d3b35a9f6207fbf342c40ff7ff77cd6f0b9b5e
Updates the runtime compatibility pair ID, Xcode package lock, generated project, and TurboQuant docs/metadata.
Records that Layout V5 is the default device-test layout while Layout V4 remains available for legacy/comparison runs.

Validation

bash scripts/ci/check-mlx-package-pins.sh
swift build --disable-automatic-resolution
swift test --disable-automatic-resolution (189 Swift Testing tests)
swift run --disable-automatic-resolution PinesCoreTestRunner
bash scripts/ci/run-xcode-validation.sh all
- iOS app build without signing passed
- iOS test build passed
- Pines unit smoke tests passed: 29 tests, 3 device-only skips
- Pines UI smoke tests passed: 7 tests
git diff --check

Hardware Gate

devicectl sees GBU-12, iPhone 15 Pro Max (iPhone16,2), but it is unavailable; no online iPhone-class hardware was available in this workspace. This PR intentionally does not activate Verified, Certified, Fast mode, snapshot restore, or adaptive precision claims.

RNT56 · 2026-05-25T20:29:27Z

Real-device TurboQuant smoke run completed on the physical iPhone target.

Device:

Name: GBU-12
Model: iPhone 15 Pro Max (iPhone16,2)
Device ID: 00008130-00041C6E2EB8001C
iOS: 26.5 (23F77)
Architecture: arm64

Command:

xcodebuild -project Pines.xcodeproj \
  -scheme Pines \
  -destination 'platform=iOS,id=00008130-00041C6E2EB8001C' \
  -derivedDataPath build/DerivedDataDevice \
  -skipMacroValidation \
  -skipPackagePluginValidation \
  -onlyUsePackageVersionsFromResolvedFile \
  -disableAutomaticPackageResolution \
  -scmProvider system \
  -allowProvisioningUpdates \
  ONLY_ACTIVE_ARCH=YES \
  '-only-testing:PinesTests/MLXTurboQuantRuntimeSmokeTests' \
  '-skip-testing:PinesUITests' \
  test

Result:

MLXTurboQuantRuntimeSmokeTests: 10 tests passed, 0 failed
Physical-device Metal codec path covered by testHighBitSeedMetalCodecRoundTripWhenAvailable
Fixed high-bit seed device path covered by testTurboQuantCacheUsesFixedHighBitSeedOnDevice
Result bundle: build/DerivedDataDevice/Logs/Test/Test-Pines-2026.05.25_22-24-29-+0200.xcresult

Follow-up committed in 32e9ca9: hosted PinesTests on the app target because physical iOS devices cannot run tool-hosted XCTest bundles.

Scope note: this is a targeted real-device smoke pass, not a full BenchmarkReport.v1 acceptance tuple. It should not promote any model to Verified/Certified by itself.

Use plain FP16 KV (faster, higher quality) whenever its uncompressed cache fits the live memory budget; fall to TurboQuant only to reach contexts that otherwise would not fit RAM — replacing the static min(ctx, 8192) plain-KV cap with a memory-feasibility decision. - MLXRuntimeBridge.kvCacheAdmission honors admission.recommendsPlainKVCache: returns plain FP16 at full admitted length (no 8K cap); conversion carries the flag through coreTurboQuantAdmission. - LocalRuntimeAdmissionService.admit: FP16-first ladder (fp16KVBytesPerToken; FP16-full → [.fastest: shorter FP16] → TurboQuant) + recommendsPlainKVCache on the PinesCore type. +7 ladder tests (206 PinesCoreTests pass). - Bump mlx-swift (aa4a071: cooperative coalesced QK decode, opt-in) and mlx-swift-lm (002ec99: recommendsPlainKVCache planner) pins. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>

In-place KV-cache donation fix on append (was reallocating full-capacity buffers per token — audit 1.3 OOM suspect). 69 cache tests pass. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>

4.1: fallback decode uses fp16 scratch + an OOM guard (recoverable instead of crash). 2A: mid-generation FP16->compressed spill under memory pressure (the missing dynamic half) — GenerateParameters.spillMemoryWatermarkBytes, default off (on-device tuning). Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>

Picks up the mlx-swift-lm fork tip carrying the turbo3 bit-metadata fix (5.5) and the new TurboQuantBench on-device A-series benchmark harness. Both are behavior-neutral / test-only for the app (pines uses its own scheme enum without turbo3; the app does not depend on the TurboQuantBench product), so this is a sync + manifest-resolution bump. Verified: full Pines app resolves mlx-swift-lm @ 3118d5b and builds green (xcodebuild, iOS Simulator, -skipPackagePluginValidation -skipMacroValidation). Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>

Captures the working-tree TurboQuant work (control-plane + evidence types, tests, and the turboquant-implementation docs/baselines) as a green checkpoint at the current MLX pin pair (mlx-swift 609e833 + mlx-swift-lm 725add5). PinesCore builds and all 227 PinesCore tests pass, including TurboQuantPinDriftTests. Note: the mlx-swift-lm pin bump to pick up the N2 self-speculation API (makeGenerationIterator + GenerateParameters.selfSpeculationMode) is intentionally NOT included here — it requires regenerating compatibility-pair.json via the validation harness (the evidence artifact must not be hand-edited), which needs the deferred A-series device run for full evidence. See the mlx-swift-lm overhaul handoff (N2 Pines section) for the exact pin-coordination sequence. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

Documents that mlx-swift-lm 295e66b now exposes the N2 self-speculation product API (GenerateParameters.selfSpeculationMode + makeGenerationIterator, bit-exact, default-off) and mlx-swift adds the data-free Gaussian payload codec, and that adopting them requires advancing the MLX pin pair + regenerating compatibility- pair.json via the wave0 harness (not hand-edited) + the deferred A-series device run. Self-speculation ships default-off (inert until enabled + device-validated). Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

RNT56 added 2 commits May 25, 2026 22:09

Pin Layout V5 default MLX pair

9204465

Host Pines unit tests for device runs

32e9ca9

RNT56 added 27 commits May 25, 2026 23:20

Enable TurboQuant runtime support for profile families

4cac4b0

Broaden TurboQuant model support in Pines

9d99174

Consume MLX TurboQuant capability registry

f9d6bcd

Consolidate artifacts UI components

ea6f700

Update artifacts surface contract

f2f3663

Fix TurboQuant model admission regressions

00bb2fa

Stabilize TurboQuant dense Qwen runtime support

c686b15

Stabilize quality-sensitive TurboQuant profiles

7f04583

Pin Pines to corrected TurboQuant profiles

73c5c76

Pin Pines to guarded TurboQuant pair

17d9b4a

Make iOS stress builds noninteractive

17fff9c

Pin Pines to guarded TurboQuant lower-bit runtime

a9a532e

Document latest TurboQuant device evidence

0a3883e

Route short local runs away from TurboQuant

4e2394b

Pin Qwen TurboQuant proof runtime

eca4cc3

Pin hardened Qwen TurboQuant runtime

fc40178

Pin runtime layout TurboQuant kernels

2a7f3c2

Pin TurboQuant Qwen production path

eebe884

Pin TurboQuant fused bit path proof

b317846

Promote fused TurboQuant proof pins

518a4f3

Pin grouped-query TurboQuant proof pair

4f50d0e

Pin optimized TurboQuant Mac proof pair

9448b11

Pin optimized TurboQuant Qwen proof pair

e05d855

Pin optimized TurboQuant tuning pair

fe470cd

Pin Layout V6 TurboQuant proof pair

82390e4

Pin fused-partial TurboQuant proof pair

f130935

Pin adaptive raw-first TurboQuant routing

bac0428

RNT56 and others added 14 commits May 30, 2026 09:43

Bump mlx-swift-lm pin to a6e9448 (3A-a in-place cache donation)

e7beefb

In-place KV-cache donation fix on append (was reallocating full-capacity buffers per token — audit 1.3 OOM suspect). 69 cache tests pass. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>

Wire Pines TurboQuant evidence gates

1f3cbc4

Pin hybrid TurboQuant native diagnostics pair

679473b

Require real model TurboQuant evidence

54df79b

Pin affine K8 V4 MLX pair

7184d0e

Record affine K8 V4 compatibility app commit

c95ce29

Pin TurboQuant pipeline cleanup pair

d336b81

Record TurboQuant cleanup compatibility app commit

e712530

Document TurboQuant speed memory baseline

4722f57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pin Layout V5 default MLX pair for device evidence#14

Pin Layout V5 default MLX pair for device evidence#14
RNT56 wants to merge 43 commits into
tq/integration-pin-mlx-productionfrom
tq/real-device-evidence-acceptance

RNT56 commented May 25, 2026

Uh oh!

RNT56 commented May 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

RNT56 commented May 25, 2026

Summary

Validation

Hardware Gate

Uh oh!

RNT56 commented May 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant