This repository was archived by the owner on Aug 28, 2023. It is now read-only.

This repository was archived by the owner on Aug 28, 2023. It is now read-only.

How to quantize specific layers? #114

Open

Labels

opened

I have a trained onnx model that needs to be quantized to INT8. But I want my last fully connected layers are still in FP32 or FP16. So how can I choose specific layers to quantize (or not to quantize)?

PS when I was working with NNCF, I just use parametr ignored_scopes. Maybe is there something similar here at Workbench?

Metadata

Assignees

No one assigned

Labels

Type

No type

Fields

No fields configured for issues without a type.

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests