Skip to content
This repository was archived by the owner on Aug 28, 2023. It is now read-only.
This repository was archived by the owner on Aug 28, 2023. It is now read-only.

How to quantize specific layers? #114

@anvarganiev

Description

@anvarganiev

I have a trained onnx model that needs to be quantized to INT8. But I want my last fully connected layers are still in FP32 or FP16. So how can I choose specific layers to quantize (or not to quantize)?

PS when I was working with NNCF, I just use parametr ignored_scopes. Maybe is there something similar here at Workbench?

Metadata

Metadata

Assignees

No one assigned

    Labels

    questionFurther information is requested

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions