Skip to content

Investigate replacing mistral with the gemma model #47

@disconsented

Description

@disconsented

Why?

  • Smaller memory footprint
  • Possibly faster
  • Can possibly either have a smaller model (on disk) or a more accurate one

https://deepmind.google/models/gemma/gemma-4/

https://github.com/huggingface/candle/blob/main/candle-examples/examples/gemma4/main.rs

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions