MAINT: Work with unicode strings in `dtype('i8,i8')` by eric-wieser · Pull Request #15261 · numpy/numpy

eric-wieser · 2020-01-06T16:42:07Z

Right now, we convert str objects to bytes, and then work with those.

Since this is a human convenience API, the input really ought to be a string.
A future patch will suggest deprecating dtype(b'i8,i8'), but for now it will continue to work.

Should fix the CI failures in #15249

eric-wieser · 2020-01-06T16:42:35Z

numpy/core/src/multiarray/descriptor.c

This is what motivated #15254

seberg

LGTM, nice. I think in conversion_utils.py there is another bunch of these were the logic should be reversed like here. I think this can change the error type in very weird, but it seems Unicode*Error inherits from ValueError so nothing really changes, and quite honestly it would be very weird cases in any case.

charris · 2020-01-08T00:16:45Z

Needs a rebase.

eric-wieser · 2020-01-10T19:36:35Z

Will rebase on #15310 once that goes in

Right now, we convert `str` objects to `bytes`, and then work with those. Since this is a human convenience API, the input really ought to be a string. A future patch will suggest deprecating `dtype(b'i8,i8')`, but for now it will continue to work.

mattip · 2020-01-11T19:15:27Z

Thanks @eric-wieser

eric-wieser · 2020-01-11T19:52:22Z

@seberg: Mind elaborating on that with a link to a line?

seberg · 2020-01-12T00:00:40Z

I think I thought of invalid unicode input. But the change of error type doesn't happen of course. The error message just improved a bit for np.dtype("ä").

eric-wieser · 2020-01-12T00:15:29Z

I think in conversion_utils.py there is another bunch of thes

I meant regarding this, sorry for being unclear

seberg · 2020-01-12T00:29:34Z

Ah, I meant this type of thing:
https://github.com/numpy/numpy/blob/master/numpy/core/src/multiarray/conversion_utils.c#L604-L611

I think most converters that accept strings will convert unicode to ascii and then make a recursive call.

eric-wieser · 2020-04-21T09:14:34Z

@seberg: Looks like I ended up addressing that in #16008.

eric-wieser added 03 - Maintenance component: numpy.dtype labels Jan 6, 2020

eric-wieser commented Jan 6, 2020

View reviewed changes

numpy/core/src/multiarray/descriptor.c Outdated

Copy link

Member Author

eric-wieser Jan 6, 2020

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is what motivated #15254

eric-wieser force-pushed the unicode-commastring branch from f9efee7 to 03dfab7 Compare January 6, 2020 17:05

eric-wieser added the 25 - WIP label Jan 6, 2020

seberg approved these changes Jan 6, 2020

View reviewed changes

seberg mentioned this pull request Jan 7, 2020

MAINT: Tidy PyArray_DescrConverter #15265

Merged

eric-wieser force-pushed the unicode-commastring branch from 03dfab7 to 21f012c Compare January 11, 2020 11:15

eric-wieser removed the 25 - WIP label Jan 11, 2020

eric-wieser force-pushed the unicode-commastring branch from 21f012c to e83bd46 Compare January 11, 2020 11:18

mattip merged commit c9fd0e7 into numpy:master Jan 11, 2020

eric-wieser mentioned this pull request Jan 12, 2020

MAINT: Eliminate some calls to eval #15249

Merged

eric-wieser mentioned this pull request Jan 12, 2020

Clean up internal callers of PyUnicode_AsASCIIString #15317

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

MAINT: Work with unicode strings in `dtype('i8,i8')`#15261

MAINT: Work with unicode strings in `dtype('i8,i8')`#15261
mattip merged 1 commit intonumpy:masterfrom
eric-wieser:unicode-commastring

eric-wieser commented Jan 6, 2020

Uh oh!

eric-wieser Jan 6, 2020

Uh oh!

seberg left a comment

Uh oh!

charris commented Jan 8, 2020

Uh oh!

eric-wieser commented Jan 10, 2020

Uh oh!

mattip commented Jan 11, 2020

Uh oh!

eric-wieser commented Jan 11, 2020 •

edited

Loading

Uh oh!

seberg commented Jan 12, 2020

Uh oh!

eric-wieser commented Jan 12, 2020

Uh oh!

seberg commented Jan 12, 2020

Uh oh!

eric-wieser commented Apr 21, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Conversation

eric-wieser commented Jan 6, 2020

Uh oh!

eric-wieser Jan 6, 2020

Choose a reason for hiding this comment

Uh oh!

seberg left a comment

Choose a reason for hiding this comment

Uh oh!

charris commented Jan 8, 2020

Uh oh!

eric-wieser commented Jan 10, 2020

Uh oh!

mattip commented Jan 11, 2020

Uh oh!

eric-wieser commented Jan 11, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

seberg commented Jan 12, 2020

Uh oh!

eric-wieser commented Jan 12, 2020

Uh oh!

seberg commented Jan 12, 2020

Uh oh!

eric-wieser commented Apr 21, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

eric-wieser commented Jan 11, 2020 •

edited

Loading