Skip to content

We select a new dataset for segmentation based on high image variety #10

Description

@andandandand

We check for near duplicates between train, test, and validation using the leaky splits functionality of FiftyOne

https://docs.voxel51.com/brain.html#leaky-splits

We use FiftyOne's uniqueness score to pick images with low chance of being near duplicates.

https://docs.voxel51.com/api/fiftyone.brain.internal.core.representativeness.html

The representativeness score can be used to select from these images.

https://docs.voxel51.com/api/fiftyone.brain.internal.core.representativeness.html

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions