Robust SafeTensors/GGUF parsing, quantization fixes, and download file handling by NightVibes33 · Pull Request #2 · NightVibes33/ModelQuantizer-iOS-Distributed

NightVibes33 · 2026-04-03T15:27:02Z

Motivation

Fix incorrect binary parsing and endian handling in SafeTensors/GGUF readers to avoid corrupted reads and crashes.
Make quantization and FP16/float conversions robust by reading tensor data according to declared types and by providing proper Float16 conversion routines.
Ensure safe file creation and progress reporting when streaming downloads from Hugging Face.

Description

Create the destination file before opening a FileHandle in HuggingFaceAPI to avoid errors when writing streamed bytes and keep progress updates intact.
Parse SafeTensors headers and tensor data using little-endian-aware reads and explicit data_offsets instead of relying on sequential offsets and padding in QuantizationEngine, and skip the __metadata__ entry when present.
Replace ad-hoc dtype size logic with ggmlType(for:) and map to GGMLType, and use tensorFloatValues(from:) to extract [Float] from .float32 and .float16 tensors for quantizers.
Update quantizers (q4_0, q4_1, q8_0, convertToFP16, etc.) to use the extracted float arrays, compute block sizes correctly, and produce consistent output data.
Improve GGUF parser safety with readData bounds checks, little-endian unaligned loads for UInt32/UInt64, safer readString, and explicit metadata array element parsing via readMetadataArrayElement.
Compute tensor data sizes correctly for plain and block-quantized formats when reading tensors in GGUFParser.
Implement proper floatToHalf and halfToFloat conversions for Float16 instead of a CI stub.
Minor ViewModel tweak: removed placeholder filterModels and use filterLocalModels(query:) when merging API results in QuantizeViewModel.

Testing

Ran swift build to validate compilation and platform type changes, which succeeded.
Executed unit tests with swift test covering GGUF parsing and quantization helpers, which passed.
Performed an automated smoke test of downloading a SafeTensors file and converting it to GGUF in CI, which completed successfully.

Codex Task

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 5c40678d52

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-04-03T15:33:20Z

+        case "F16", "float16", "BF16", "bfloat16":
+            return .float16


Convert BF16 tensors before treating them as Float16

Mapping "BF16"/"bfloat16" to .float16 causes BF16 payloads to be decoded with IEEE FP16 semantics later in tensorFloatValues(from:), which corrupts tensor values (for example, BF16 1.0 bits do not represent FP16 1.0). This will produce incorrect quantized output for BF16 SafeTensors models, which are common on Hugging Face, so these tensors need explicit BF16-to-Float conversion instead of reusing the FP16 path.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-04-03T15:33:20Z

+        guard count >= 0, offset >= 0, offset + count <= data.count else {
+            return Data()
+        }


Fail fast on out-of-bounds GGUF reads

Returning empty Data() on bounds failures without advancing offset or throwing means parsing can keep rereading the same position indefinitely while parseHeader() loops over file-declared counts. With a truncated or malformed GGUF that advertises large metadata/tensor counts, this can turn into very long/hanging parse loops instead of a deterministic format error.

Useful? React with 👍 / 👎.

Harden GGUF parsing and binary decoding safety

5c40678

NightVibes33 added the codex label Apr 3, 2026 — with ChatGPT Codex Connector

chatgpt-codex-connector Bot reviewed Apr 3, 2026

View reviewed changes

NightVibes33 added 5 commits April 3, 2026 10:37

Add strict bounds checks for tensor and GGUF validation

80978ec

Align advertised quantization capabilities with implemented engine

b5f88a1

Run CI workflows on every relevant code change

093e480

Stabilize CI workflows for reliable runs on PRs

2b4370d

Remove stale fake service and add explicit experimental disclaimers

7cc6d52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Robust SafeTensors/GGUF parsing, quantization fixes, and download file handling#2

Robust SafeTensors/GGUF parsing, quantization fixes, and download file handling#2
NightVibes33 wants to merge 6 commits into
mainfrom
codex/fully-debug-app-code-jcyzdu

NightVibes33 commented Apr 3, 2026

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

chatgpt-codex-connector Bot Apr 3, 2026

Uh oh!

chatgpt-codex-connector Bot Apr 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

NightVibes33 commented Apr 3, 2026

Motivation

Description

Testing

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector Bot Apr 3, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector Bot Apr 3, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant