FEA Make `LedoitWolf` estimator array API compatible by betatim · Pull Request #33573 · scikit-learn/scikit-learn

betatim · 2026-03-18T14:53:58Z

Reference Issues/PRs

Came up as part of pondering #33564

What does this implement/fix? Explain your changes.

This makes LedoitWolf array API compatible. After this it might be easier to convert other covariance estimators as it also updates some of the basic infra shared by them

Todo

it might be worth doing ledoit_wolf (the function) in this PR as well

AI usage disclosure

I used AI assistance for:

Code generation (e.g., when writing an implementation or fixing a bug)
Test/benchmark generation
Documentation (including examples)
Research and understanding

Any other comments?

The tests were written as part of a TDD approach during development, they cover things that the common tests also cover. So removing them to reduce the amount of duplication.

In some cases it returns flaot64 for float32 input. There are users who rely on this behaviour.

bruAristimunha · 2026-03-19T15:01:20Z

Hey @betatim,

Sorry for dropping into the PR, but what you are working on is something super nice! At Pyriemann, with @agramfort and @qbarthelemy, we are using many of the covariance matrix implementations from scikit-learn and are also moving towards array API compatibility.

I just would like to offer some assistance with the covariance part api compatibility, if necessary.

bruAristimunha · 2026-03-19T15:02:24Z

I tried to find a macro issue to offer help, but I couldn't find a place that seemed appropriate.

Please feel free to delete my comment, given that it's off-topic for your PR.

betatim · 2026-03-19T15:30:09Z

I tried to find a macro issue to offer help, but I couldn't find a place that seemed appropriate.

Please feel free to delete my comment, given that it's off-topic for your PR.

No worries and welcome! It is fantastic to hear from someone using scikit-learn that says "this looks useful". I don't think there is a macro/mega issue related to this area of scikit-learn + array API support. I picked it more or less at random/as an exercise to see how much infrastructure would be missing.

I think if you are interested in contributing to converting estimators/tools from https://scikit-learn.org/stable/api/sklearn.covariance.html go for it! It could be that the first step towards that is helping review this PR or picking another estimator from the list and converting it. I'd have to think about it, it probably depends on how much interdependency there is.

Another thing that could be worth doing is creating an example for the gallery that shows off the array API support in LedoitWolf based on a real-world use-case.

I will create a mega-issue for sklearn.covariance so we can keep track of what is done, what needs doing, etc.

bruAristimunha

if you do this, should unlock too the empiral covariance

Co-authored-by: Bru <b.aristimunha@gmail.com>

Convert _oas() and OAS.fit() to use array API compatible operations, following the same pattern as LedoitWolf from PR scikit-learn#33573.

If we flag it as supporting array API all the classes that inherit from it also get marked as supporting array API. This doesn't work, so we need to do this at the end when all estimators support array API.

betatim · 2026-03-23T10:48:48Z

I undid the changes to EmpiricalCovariance again. By marking this class as supporting array API classes that inherit from it also get marked with support. So we need to tackle this at the end when everyone else has array API support.

bruAristimunha · 2026-03-23T11:04:19Z

I can create a separate PR for this, if you want, @betatim. I fixed in my draft PR, small tricks are necessary for empirical covariance.

Co-authored-by: Olivier Grisel <olivier.grisel@ensta.org>

betatim · 2026-05-18T14:21:27Z

I've addressed all the comments, let me know what y'all think. Maybe we could get this wrapped up in time for the release?

ogrisel

LGTM besides the following:

ogrisel · 2026-05-19T16:21:34Z

I ran the tests of this PR with DPNP on an Intel GPU with float32-only support and there are two failures barely above the rtol threshold for float64 inputs:

https://github.com/probabl-ai/scikit-learn-intel-workflow/actions/runs/26148891699/job/76911036653

I think we should bump it up a bit (e.g. 1e-6 instead of 1e-7).

ogrisel · 2026-05-20T14:13:23Z

@cakedev0 Actually there is something I don't understand in the above. Does your float32-only GPU:

How is the case the following does not fail if the GPU is float32 only xp.asarray(np.ones(3, dtype=np.float64, device="gpu").

Also: I would have expected _array_api_for_tests to have skipped that particular test case (dpnp-gpu-float64 on a float32-only GPU host) entirely.

cakedev0 · 2026-05-20T14:33:30Z

Not sure to understand exactly your message but, on my laptop (float32-only GPU):

>>> dpnp.asarray(np.ones(3, dtype=np.float64), device="gpu")
array([1., 1., 1.], dtype=float32)

And this test should definitely have been skipped, let's not bump the rtol but make sure it's skipped instead (I'm looking into that)

cakedev0

Let's update the usage of _array_api_for_tests to skip tests for which the device doesn't support the dtype.

Co-authored-by: Arthur Lacote <arthur.lcte@gmail.com>

ogrisel · 2026-05-28T13:28:10Z

New Intel GPU workflow: https://github.com/probabl-ai/scikit-learn-intel-workflow/actions/runs/26577559267

EDIT: It's green!

Let me fix the linter.

ogrisel · 2026-05-28T14:19:30Z

@betatim this PR should be ready to merge once the following is addressed: #33573 (review)

betatim · 2026-05-29T15:34:44Z

I fixed the use of _convert_to_numpy. GitHub says that @cakedev0 requested changes, but I can't find what/where? Has that been resolved already or is there a change you'd like to see?

cakedev0

This should discard the "request changes" review. (I'll avoid that next time)

betatim · 2026-06-01T08:15:16Z

Yeah, my impression is that the "request changes" feature isn't super useful for a project like scikit-learn. It ends up making it more complicated to merge things and I don't think I've ever experienced it where someone's feedback was ignored/not worked on because they "only" left it as a comment/normal review feedback. The social fabric is strong here :D

betatim · 2026-06-01T08:16:04Z

@OmarManzoor and @ogrisel you both left 👍 , do you want to take a last look and then press merge? Or should we attract some more reviews?

ogrisel · 2026-06-01T08:27:38Z

Merged! Thanks all!

betatim added 3 commits March 17, 2026 18:08

Make LedoitWolf estimator array API compatible

419248e

Reduce duplication in the tests

7058c80

The tests were written as part of a TDD approach during development, they cover things that the common tests also cover. So removing them to reduce the amount of duplication.

Add to docs

f8f65ff

betatim added the CUDA CI label Mar 18, 2026

github-actions Bot added module:covariance and removed CUDA CI labels Mar 18, 2026

betatim added 2 commits March 18, 2026 15:55

Whats new

bd405d8

Preserve old dtype behaviour in empirical_covariance

33aa780

In some cases it returns flaot64 for float32 input. There are users who rely on this behaviour.

betatim mentioned this pull request Mar 19, 2026

Adding array API support to sklearn.covariance #33584

Open

14 tasks

betatim added the CUDA CI label Mar 19, 2026

github-actions Bot removed the CUDA CI label Mar 19, 2026

bruAristimunha reviewed Mar 20, 2026

View reviewed changes

Comment thread sklearn/covariance/_empirical_covariance.py

Comment thread sklearn/covariance/_empirical_covariance.py

betatim and others added 4 commits March 20, 2026 14:14

Add test cases for edge cases

f98ca92

Merge remote-tracking branch 'upstream/main' into array-api-ledoitwolf

906dee3

Update tests to use new convention

74b02eb

Enable array API support for EmpiricalCovariance

9c985ae

Co-authored-by: Bru <b.aristimunha@gmail.com>

github-actions Bot added the CI:Linter failure The linter CI is failing on this PR label Mar 20, 2026

Fix lint

0dbe455

github-actions Bot removed the CI:Linter failure The linter CI is failing on this PR label Mar 20, 2026

bruAristimunha mentioned this pull request Mar 22, 2026

[WIP] Allowing the array api to work with covariance OAS #33600

Open

4 tasks

Undo array API support flag for EmpiricalCovariance

fe345dd

If we flag it as supporting array API all the classes that inherit from it also get marked as supporting array API. This doesn't work, so we need to do this at the end when all estimators support array API.

betatim marked this pull request as ready for review March 23, 2026 12:14

betatim and others added 4 commits May 11, 2026 15:30

Apply suggestion from @ogrisel

bcb6b95

Co-authored-by: Olivier Grisel <olivier.grisel@ensta.org>

Merge remote-tracking branch 'upstream/main' into array-api-ledoitwolf

39fa891

Document float32 behaviour

0ceccf0

Cross reference issue discussing automatic chunking

00a06b3

ogrisel added the CUDA CI label May 19, 2026

github-actions Bot removed the CUDA CI label May 19, 2026

ogrisel approved these changes May 19, 2026

View reviewed changes

Comment thread sklearn/covariance/tests/test_covariance.py Outdated

cakedev0 suggested changes May 20, 2026

View reviewed changes

Apply suggestions from code review

6bcfe73

Co-authored-by: Arthur Lacote <arthur.lcte@gmail.com>

github-actions Bot added the CI:Linter failure The linter CI is failing on this PR label May 28, 2026

Fix ruff linting

a37e60d

github-actions Bot removed the CI:Linter failure The linter CI is failing on this PR label May 28, 2026

Use move_to not _convert_to_numpy

3240cc5

betatim added the CUDA CI label May 29, 2026

github-actions Bot removed the CUDA CI label May 29, 2026

cakedev0 approved these changes May 29, 2026

View reviewed changes

ogrisel merged commit 3b37225 into scikit-learn:main Jun 1, 2026
43 checks passed

github-project-automation Bot moved this from In Progress to Done in Array API Jun 1, 2026

betatim deleted the array-api-ledoitwolf branch June 1, 2026 08:30

jeremiedbb mentioned this pull request Jun 1, 2026

Release 1.9.0 #34173

Merged

16 tasks

Uh oh!

Conversation

betatim commented Mar 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Todo

AI usage disclosure

Any other comments?

Uh oh!

bruAristimunha commented Mar 19, 2026

Uh oh!

bruAristimunha commented Mar 19, 2026

Uh oh!

betatim commented Mar 19, 2026

Uh oh!

bruAristimunha left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

betatim commented Mar 23, 2026

Uh oh!

bruAristimunha commented Mar 23, 2026

Uh oh!

betatim commented May 18, 2026

Uh oh!

ogrisel left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ogrisel commented May 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ogrisel commented May 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cakedev0 commented May 20, 2026

Uh oh!

cakedev0 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ogrisel commented May 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ogrisel commented May 28, 2026

Uh oh!

betatim commented May 29, 2026

Uh oh!

cakedev0 left a comment

Choose a reason for hiding this comment

Uh oh!

betatim commented Jun 1, 2026

Uh oh!

betatim commented Jun 1, 2026

Uh oh!

Uh oh!

ogrisel commented Jun 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

betatim commented Mar 18, 2026 •

edited

Loading

ogrisel commented May 19, 2026 •

edited

Loading

ogrisel commented May 20, 2026 •

edited

Loading

ogrisel commented May 28, 2026 •

edited

Loading