Home » Oreps » Polars » All Releases » py-0.20.20 Release

pola-rs/polars

Dataframes powered by a multithreaded, vectorized query engine, written in Rust

Watch

Basic Details Issues Addressed Top Contributors Directory Browser Release Notes

Polars py-0.20.20

Polars: py-0.20.20 Release

Release date:

April 13, 2024

Previous version:

py-0.20.19 (released April 8, 2024)

Magnitude:

4,599 Diff Delta

Contributors:

20 total committers

Data confidence:

Commits:

50 Commits in this Release

Ordered by the degree to which they evolved the repo in this version.

feat: Extended `BytecodeParser` to handle additional math functions, and imports from the global namespace (#15627)

Authored April 13, 2024

perf: Refactor CSV serialization to not go thorough `AnyValue` (#15576)

Authored April 12, 2024

feat: add Expr.dt.add_business_days and Series.dt.add_business_days (#15595)

Authored April 13, 2024

perf: join by row-encoding (#15559)

Authored April 10, 2024

feat: Add `str.head` and `str.tail` (#14425)

Authored April 13, 2024

fix: Recompute RowIndex schema after projection pd (#15625)

Authored April 13, 2024

feat(rust, python): Add `null_on_oob` parameter to `expr.array.get` (#15426)

Authored April 10, 2024

docs: Various minor updates to User Guide's SQL intro section (#15557)

Authored April 9, 2024

feat: Expressify `to_integer` (#15604)

Authored April 12, 2024

feat: add holidays argument to business_day_count (#15580)

Authored April 12, 2024

depr(python) Deprecate the `offset` argument in `dt.round`, `dt.truncate`, and `DataFrame.upsample` (#15478)

Authored April 9, 2024

feat: support weekend argument in business_day_count (#15544)

Authored April 10, 2024

fix: Return appropriate data type for time `mean` and `median` (#14471)

Authored April 13, 2024

feat: Optimizer; remove double SORT and redundant projections (#15573)

Authored April 10, 2024

perf(rust): read_ipc memory usage tests, and writing fix (#15599)

Authored April 12, 2024

fix: Output correct dtype for `mean_horizontal` on a single column (#15118)

Authored April 13, 2024

fix(rust): Support index upsampling (#13621)

Authored April 13, 2024

fix: Turn off cse if cache node found (#15554)

Authored April 9, 2024

fix: Explode list should take validity into account (#15572)

Authored April 12, 2024

docs: Add legacy CPU install instructions in user guide (#13676)

Authored April 13, 2024

perf: Remove extra thread spawn from row group fetcher (#15626)

Authored April 13, 2024

perf: don't use dynamic dispatch in visitors (#15607)

Authored April 12, 2024

perf: Fix binview growable complexity O(n*m) -> O(n) (#15628)

Authored April 13, 2024

fix: Handle quoted identifiers when registering CTEs in the SQL engine (#15564)

Authored April 10, 2024

chore: use bound api (#15630)

Authored April 13, 2024

fix: Fix elementwise-apply if any input is `AggregatedScalar` (#15606)

Authored April 12, 2024

chore(rust): remove try_binary_elementwise_values (#15592)

Authored April 11, 2024

chore(python): Replace most deprecated calls with bounded version (#15632)

Authored April 13, 2024

perf: Use vertical parallelism if input is chunked for `Filter`,`Select`,`WithColumns` (#15608)

Authored April 12, 2024

feat: Enable `is_first/last_distinct` for not nested non-numeric list (#15552)

Authored April 10, 2024

perf: Fix cross join batch size when one of the DataFrames is tiny (#14347)

Authored April 13, 2024

fix: Mean of boolean in streaming group_by incorrectly always gave NULL (#15616)

Authored April 13, 2024

docs(python): Add docstring examples for reading json (#14481)

Authored April 13, 2024

fix: use larger recursive stack in debug mode (#15593)

Authored April 11, 2024

feat: change default to write parquet statistics (#15597)

Authored April 12, 2024

fix: Decompress moved out of schema initialization (#15550)

Authored April 10, 2024

chore(python): Initial PyO3 0.21 support (#15622)

Authored April 13, 2024

feat: Push down `is_between` expressions to Arrow (#15180)

Authored April 12, 2024

feat: Tag concat list as elementwise (#15545)

Authored April 8, 2024

build(rust): Fix a feature gate for `lz4` compression in `polars-parquet` (#15565)

Authored April 10, 2024

Browse Other Releases

Latest Pending Unreleased 😎

py-0.20.24 Released May 7, 2024

6,245 Δ

py-0.20.23 Released April 28, 2024

4,913 Δ

py-0.20.22 Released April 21, 2024

2,785 Δ

py-0.20.22-rc.1 Released April 16, 2024

349 Δ

py-0.20.21 Released April 15, 2024

785 Δ

py-0.20.20 Released April 13, 2024

4,599 Δ

py-0.20.19 Released April 8, 2024

3,848 Δ

py-0.20.18 Released April 1, 2024

2,681 Δ

py-0.20.17 Released March 28, 2024

7,336 Δ

py-0.20.16 Released March 18, 2024

2,209 Δ

Top Contributors in py-0.20.20

ritchie46

MarcoGorelli

alexander-beedie

mcrumiller

ChayimFriedman2

reswqa

JamesCE2001

TrevorWinstral

itamarst

nameexhaustion

Directory Browser for py-0.20.20

All files are compared to previous version, py-0.20.19. Click here to browse diffs between other versions.

Loading File Browser...

Release Notes Published

🚀 Performance improvements

Fix cross join batch size when one of the DataFrames is tiny (#14347)
Fix binview growable complexity O(n*m) -> O(n) (#15628)
Remove extra thread spawn from row group fetcher (#15626)
Use vertical parallelism if input is chunked for Filter,Select,WithColumns (#15608)
Refactor CSV serialization to not go thorough AnyValue (#15576)
don't use dynamic dispatch in visitors (#15607)
Improve Bitmap construction performance (#15570)
join by row-encoding (#15559)

✨ Enhancements

add Expr.dt.add_business_days and Series.dt.add_business_days (#15595)
Add str.head and str.tail (#14425)
Add union/or operator for pl.Enum (#14965)
Extended BytecodeParser to handle additional math functions, and imports from the global namespace (#15627)
Push down is_between expressions to Arrow (#15180)
add holidays argument to business_day_count (#15580)
change default to write parquet statistics (#15597)
Expressify to_integer (#15604)
Optimizer; remove double SORT and redundant projections (#15573)
Add null_on_oob parameter to expr.array.get (#15426)
support weekend argument in business_day_count (#15544)
Enable is_first/last_distinct for not nested non-numeric list (#15552)
Turn off cse if cache node found (#15554)
Tag concat list as elementwise (#15545)

🐞 Bug fixes

Return appropriate data type for time mean and median (#14471)
Fix issue in write_excel that could lead to incorrect spanning range determination (#15631)
Output correct dtype for mean_horizontal on a single column (#15118)
Recompute RowIndex schema after projection pd (#15625)
Mean of boolean in streaming group_by incorrectly always gave NULL (#15616)
Include cloud creds in cache key (#15609)
Fix elementwise-apply if any input is AggregatedScalar (#15606)
Explode list should take validity into account (#15572)
use larger recursive stack in debug mode (#15593)
SQL interface "off-by-one' indexing error with GROUP BY clauses that use position ordinals (#15584)
Enable missing features in polars-time (#15558)
Handle quoted identifiers when registering CTEs in the SQL engine (#15564)
Decompress moved out of schema initialization (#15550)
Turn off cse if cache node found (#15554)

📖 Documentation

Add legacy CPU install instructions in user guide (#13676)
Examples for errors (#13724)
Add docstring examples for reading json (#14481)
Add security warning in LazyFrame.deserialize() docstring (#15282)
Various minor updates to User Guide's SQL intro section (#15557)

🛠️ Other improvements

Replace most deprecated calls with bounded version (#15632)
use bound api (#15630)
Initial PyO3 0.21 support (#15622)
Don't run streaming group-by in partitionable gb (#15611)
pref(rust!, python): Unify sort with SortOptions and SortMultipleOptions (#15590)
Set up CodSpeed (#15537)

Thank you to all our contributors for making this release possible! @CanglongCl, @ChayimFriedman2, @Fokko, @JamesCE2001, @MarcoGorelli, @NedJWestern, @TrevorWinstral, @alexander-beedie, @deanm0000, @douglas-raillard-arm, @eitsupi, @filabrazilska, @i-aki-y, @itamarst, @leoforney, @mcrumiller, @nameexhaustion, @orlp, @ozgrakkurt, @reswqa, @ritchie46 and @stinodego