Reproducibility issues

Hello! I have been trying to benchmark RIPOR on another dataset, however I have encountered multiple issues. First of all, the pre-trained model cannot be used, to my knowledge, on other datasets than MSMARCO due to the fact that many of the model's scripts are hardcoded to use the MSMARCO document collection and queries. Training the model from scratch on another dataset has also proved infeasible, due to many of the bash scripts having hardcoded file paths that were not provided in the RIPOR_data Google drive. The README.md file does not contain anything other than surface-level instructions, and without a proper understanding of the internal code/file structure, it appears impossible to reproduce the experiment. 

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reproducibility issues #9

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Reproducibility issues #9

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions