Skip to content

Reproducibility issues #9

Description

@carlavintila

Hello! I have been trying to benchmark RIPOR on another dataset, however I have encountered multiple issues. First of all, the pre-trained model cannot be used, to my knowledge, on other datasets than MSMARCO due to the fact that many of the model's scripts are hardcoded to use the MSMARCO document collection and queries. Training the model from scratch on another dataset has also proved infeasible, due to many of the bash scripts having hardcoded file paths that were not provided in the RIPOR_data Google drive. The README.md file does not contain anything other than surface-level instructions, and without a proper understanding of the internal code/file structure, it appears impossible to reproduce the experiment.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions