creation functions according to NEP 35 and updated `*_like` behavior #1669

keewis · 2022-11-28T16:49:06Z

Closes Handle array creation functions such as np.asanyarray #1406, closes Implement np.array/np.asarray #1037, closes Expected behavior of np.*_like functions #882
Executed pre-commit run --all-files with no errors
The change is fully covered by automated unit tests
Documented in docs/ as appropriate
Added an entry to the CHANGES file

For a more consistent experience with the *_like functions and the functions from NEP 35 (introduced in numpy=1.20, minimum version is 1.19.5 but could be 1.20 according to NEP 29), this changes the semantics to:

have full_like return the units of fill_value (a non-quantity fill_value results an array of the type of the magnitude):

full_like(arr, fill_value=q) == Quantity(full_like(arr, fill_value=q.magnitude), q.units)

have empty_like use the units of the argument:

empty_like(q) == Quantity(empty_like(q.magnitude), q.units)

have ones_like and zero_like a quantity with the units of the argument:

ones_like(q) == full_like(q, fill_value=Quantity(1, q.units))
zeros_like(q) == full_like(q, fill_value=Quantity(0, q.units))

support the NEP 35 keyword like for creation functions:

np.array(arr, like=q) == Quantity(np.array(arr), q.units)
np.arange(10, like=q) == Quantity(np.arange(10, like=q.magnitude), q.units)
...

I'm not sure I got all of the functions that were changed by NEP 35, I couldn't find a comprehensive list. The ones I found are:

array
asarray
asanyarray
arange
ones
zeros
empty
full
identity
eye

but I'm almost certain I missed some (linspace does not seem to be included in that list, unfortunately).

The definition of all of them except full and arange is pretty simple: just use the units of the like argument.

For full, I chose to go with the unit of the fill_value or dimensionless if the fill value happens not to be a quantity (because by passing like we expect a quantity). For arange, we can go the "units of like" route, but we can also support something like

np.arange(1 * ureg.kg, 5 * ureg.kg, 100 * ureg.g, like=[0] * ureg.kg)

(the like=[0] * ureg.kg is important for dispatching, unfortunately)

I might be missing something, though, so I'd appreciate someone checking this.

cc @jthielen

jthielen · 2022-11-30T21:33:59Z

The changes here look good to me, and as far as I'm aware, that's the same list of functions that I'm aware of that'd be covered by NEP 35. So I'd say go with it, and if something a user needs happens to have been missed, it could be added later?

Speaking of adding later, though, would you be able to add code comments (and modify the existing ones that begin with # Make full_like by multiplying) clarifying the approaches taken here, so that it is easier for future contributors to these sections of code? For example, the code pattern in https://github.com/hgrecco/pint/pull/1669/files#diff-3902d8460bf42f38121b360a0cf397597567077c08aa6d0a565a65332290395cR953-R967 seems unnatural on first glance, and so could use comments to clarify that first you're doing the "automatic" implementations, then the custom ones. Also, while it is still an unchecked checklist item (at time of writing comment), I think these nuances of array creation could benefit from some direct documentation in the NumPy page of the docs.

jules-ch · 2022-11-30T21:49:49Z

Seems like some tests with xarray are making CI fail.
Do we know what's triggering this ?

keewis · 2022-12-19T22:36:58Z

thanks, @jthielen, those are some good points. I've tried to address them in the most recent commits, though I suspect to be really useful we'd need to rewrite / restructure the numpy guide: right now, the implementation details are condensed into a single paragraph, which doesn't make it easy to search for a specific type of function, and we're also missing clear examples. Instead, I suggest having one section per function category. If nobody objects, I'll try to do that in a new PR (not sure when I get to that, though).

@jules-ch: no idea. Might be because of changes in this PR, I'll have to investigate.

keewis · 2022-12-20T15:22:51Z

Seems like some tests with xarray are making CI fail. Do we know what's triggering this ?

It seems xarray is using zeros_like in isnull to construct a all-False array for dtypes that are not nullable, and because we changed zeros_like to return a quantity this has stopped working.

I'd argue that this is implicit behavior in xarray (False just so happens to be represented as 0), and that it would be much better to use full_like in this case:

zeros_like(data, dtype=bool)  # old
full_like(data, dtype=bool, fill_value=False)  # new

However, even though we can fix this xarray very quickly we'd basically have to require the newest version of xarray for use with a new release of pint. So I'm not sure how to best handle this: should we update xarray, and wait 6 months before merging this PR? Or is it fine to just merge and require people to update xarray?

keewis · 2022-12-20T15:25:12Z

another option would be to avoid wrapping bool dtypes, but that might make the code much more complicated

This is important because `dask` chose not to implement `subok`.

keewis · 2022-12-21T22:59:53Z

this has become tricky... see pydata/xarray#7395 for more details.

The most recent commit fixes a bug in full_like when called on a pint quantity wrapping a dask array (dask chose not to implement subok), which prevents xarray from merging my PR and releasing a new version compatible with the new behavior of *_like. However, we need that version to be able to change the behavior of *_like without breaking xarray's implementation of isnull on non-nullable dtypes (all except object, float, and complex).

To get out of this dead-lock, I think we need to somehow introduce a way to switch between old and new behavior of *_like (a new registry option?). This would allow us to properly deprecate the old behavior: the next version would ship with deprecation warnings for the old behavior of *_like, the version after that would switch the behavior but allow switching back to the old behavior, and some versions later we'd remove the switch and the old code. In general, though, I think we can keep the deprecation cycle short since we warned about the behavior of *_like being experimental.

(and of course, if someone has a better idea I'd be open to that as well)

andrewgsavage · 2024-05-26T20:07:43Z

t he next version would ship with deprecation warnings for the old behavior of *_like

Do you want to add a depreciation warning in another PR?

keewis added 18 commits November 23, 2022 14:18

implement nep35

520bbea

hook nep 35 into Quantity

9d4b4cb

more nep 35 functions

2776a0a

tests for *_like

2b81f82

change the behavior of ones_like, zeros_like, and empty_like

e5d272a

refactor the full_like implementation

0e97b83

tests for the nep35 functions

f4f908a

also implement ones, zeros, and empty

5220226

add comments detailing why a minimum version is required

50126c3

define the behavior and a test for full

9327a0f

implement nep35 support for full

032c560

pass the magnitude to like

19b09b1

make sure the arguments have consistent units

af31586

require numpy>=1.20.0 for NEP35

c906a30

support quantities as arguments to arange

bdb6f36

fix the skip

9ac0e3e

implement arange to support quantities for start, stop, step

555b796

actually append the additional functions

d88d06a

keewis added 5 commits December 19, 2022 22:52

use a decorator to do the functon registration

5cfad02

remove the outdated comment

61ba8a3

Merge branch 'master' into creation-like

4b0ce83

fix a few typos

42df5ee

add some documentation on the implementation details of the changes

2d591a1

replace unused parameters to full_like with **kwargs

78700de

This is important because `dask` chose not to implement `subok`.

keewis mentioned this pull request Apr 14, 2023

use **kwargs to pass along additional arguments to ones_like #1741

Merged

4 tasks

hgrecco added the In progress label Apr 25, 2023

hgrecco force-pushed the master branch from ce185c2 to 5643c32 Compare May 2, 2023 21:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

creation functions according to NEP 35 and updated `*_like` behavior #1669

creation functions according to NEP 35 and updated `*_like` behavior #1669

keewis commented Nov 28, 2022 •

edited

jthielen commented Nov 30, 2022

jules-ch commented Nov 30, 2022

keewis commented Dec 19, 2022

keewis commented Dec 20, 2022 •

edited

keewis commented Dec 20, 2022

keewis commented Dec 21, 2022 •

edited

andrewgsavage commented May 26, 2024

creation functions according to NEP 35 and updated *_like behavior #1669

Are you sure you want to change the base?

creation functions according to NEP 35 and updated *_like behavior #1669

Conversation

keewis commented Nov 28, 2022 • edited

jthielen commented Nov 30, 2022

jules-ch commented Nov 30, 2022

keewis commented Dec 19, 2022

keewis commented Dec 20, 2022 • edited

keewis commented Dec 20, 2022

keewis commented Dec 21, 2022 • edited

andrewgsavage commented May 26, 2024

creation functions according to NEP 35 and updated `*_like` behavior #1669

creation functions according to NEP 35 and updated `*_like` behavior #1669

keewis commented Nov 28, 2022 •

edited

keewis commented Dec 20, 2022 •

edited

keewis commented Dec 21, 2022 •

edited