Skip to content
This repository was archived by the owner on Apr 19, 2026. It is now read-only.
This repository was archived by the owner on Apr 19, 2026. It is now read-only.

Regarding NER finetuning #103

Description

@malduwais

Dear Authors,

I am trying to re-produce the NER results using WikiANN. However, I didn't get the same results even I am using the same hyperparameter for XLM-R and mBERT models.

Is there something I need to set other than the hyperparameter?

I am using Trainer method as follow:

args = TrainingArguments(
model_path,
overwrite_output_dir=True,
num_train_epochs=10,
load_best_model_at_end=True,
evaluation_strategy="epoch",
save_strategy="epoch",
report_to="wandb",
seed=seed,
data_seed=seed,
per_device_train_batch_size=32,
per_device_eval_batch_size=32,
learning_rate=2e-5,
weight_decay=0.01,
warmup_ratio=0.1,
)

Thank you,

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions