Polars: py-0.18.14 Release

Release date:
August 14, 2023
Previous version:
py-0.18.13 (released August 7, 2023)
Magnitude:
7,297 Diff Delta
Contributors:
15 total committers
Data confidence:
Commits:

51 Commits in this Release

Ordered by the degree to which they evolved the repo in this version.

Authored August 10, 2023
Authored August 9, 2023
Authored August 10, 2023

Top Contributors in py-0.18.14

orlp
ritchie46
c-peters
stinodego
alexander-beedie
ion-elgreco
lfn3
zundertj
reswqa
MarcoGorelli

Directory Browser for py-0.18.14

We haven't yet finished calculating and confirming the files and directories changed in this release. Please check back soon.

Release Notes Published

πŸ† Highlights

  • Native implementation of dataframe interchange protocol (#10267)

⚠️ Deprecations

  • Deprecate behavior of list/tuple inputs for lit (#10461)

πŸš€ Performance improvements

  • optimise retrieval of values from df.item (~4-5x speedup) (#10411)
  • pre-alloc int_ranges (#10399)
  • use hash as CSE Identifier (#10385)

✨ Enhancements

  • quote style option for csv writer (#10422)
  • add "raise_if_empty" flag to read_excel, read_csv, scan_csv, and read_csv_batched (#10409)
  • add use_earliest to to_datetime / strptime (#10426)
  • add new "header_format" option for write_excel (#10392)
  • {any/all}_horizontal to expression architecture (#10412)
  • Native implementation of dataframe interchange protocol (#10267)
  • allow unaligned pointers in arrow FFI (#10403)
  • add line_terminator option to write_csv (#10373)
  • add explicit selector variants for signed/unsigned integers (#10384)
  • Add is_local and to_local to categorical namespace (#10372)
  • enhance selectors expansion function, so it can operate on a schema as well as a frame (#10341)
  • Order percentiles in describe (#10378)
  • cse for groupby.agg and reduced cse collisions (#10381)
  • improve take_every(0) exception (#10352)
  • add offset and length to get_ptr (#10361)

🐞 Bug fixes

  • fix pyarrow write_to_dataset wrt check_not_directory parameter (#10471)
  • fix negative decimal parsing (#10444)
  • ensure sorted_sink hash equals the default path (#10464)
  • address inconsistency in init from square numpy arrays with/without an explicit schema (#10445)
  • ensure last aggregation deals with default chunk (#10453)
  • fix cse input schema (#10450)
  • Fix by argument handling in join_asof (#10447)
  • fix potential OverflowError in testing asserts with huge UInt64 diffs (#10437)
  • Create delta compatible schema during writing (#10165)
  • fix list groupby of array dtype (#10408)
  • correct AnyValue::hash (#10391)
  • finalize cast in partitioned groupby (#10359)

πŸ› οΈ Other improvements

  • add vertical_relaxed example for pl.concat (#10472)
  • Run all streaming tests on the same test runner (#10469)
  • Organize OOC tests (#10463)
  • add test for 10417 (#10420)
  • Clean up some Sphinx settings (#10400)
  • add test for 10401 (#10405)
  • Address Ruff per file ignores (#10258)
  • Small improvement for PySeries.get_buffer (#10363)

Thank you to all our contributors for making this release possible! @MarcoGorelli, @OndrejSlamecka, @alexander-beedie, @c-peters, @cmdlineluser, @drgif, @ion-elgreco, @lfn3, @orlp, @potzenhotz, @rea1bacon, @reswqa, @ritchie46, @stinodego and @zundertj