Skip to content
This repository was archived by the owner on Oct 31, 2023. It is now read-only.
This repository was archived by the owner on Oct 31, 2023. It is now read-only.

dumping embeddings for closed-set experiments #4

Description

@joocjun

Hello!

I've been trying to dump the embeddings for the closed-set experiments. Unfortunately except for enwiki-0 corpus, which has a 0_valid file, other corpora seem to have a problem with the dumping process. The problem seems to occur while collating the 'is_valid' attribute in the datamodule part(dimension errors). If I'm not mistaken this particular line seems to be the source of trouble.

is_valid = [i for i, _id in enumerate(input_ids) if _id not in [0, 2]]

The non-uniform lengths generated for each instance seem to be causing a dimension error.

At your earliest convenience, could check whether this is a valid error to be fixed??

Thank you

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions