new version (on deck)

[ENH] Upgraded update_where function to use either the pandas query style, or boolean indexing via the loc method. Also updated find_replace function to use the loc method directly, instead of routing it through the update_where function. @samukweku
[INF] Update pandas minimum version to 1.0.0. @hectormz
[DOC] Updated the general functions API page to show all available functions. @samukweku
[DOC] Fix the few lacking type annotations of functions. @VPerrollaz
[DOC] Changed the signature from str to Optional[str] when initialized by None. @VPerrollaz
[DOC] Add the Optional type for all signatures of the API. @VPerrollaz

v0.20.7

[TST] Add a test for transform_column to check for nonmutation. @VPerrollaz
[ENH] Contributed expand_grid function by @samukweku

v0.20.6

[DOC] Pep8 all examples. @VPerrollaz
[TST] Add docstrings to tests @hectormz
[INF] Add debug-statements, requirements-txt-fixer, and interrogate to pre-commit. @hectormz
[ENH] Upgraded transform_column to use df.assign underneath the hood, and also added option to transform column elementwise (via apply) or columnwise (thus operating on a series). @ericmjl

v0.20.5

[INF] Replace pycodestyle with flake8 in order to add pandas-vet linter @hectormz
[ENH] select_columns() now raises NameError if column label in search_columns_labels is missing from DataFrame columns. @smu095

v0.20.1

[DOC] Added an example for groupby_agg in general functions @samukweku
[ENH] Contributed sort_naturally() function. @ericmjl

v0.20.0

[DOC] Edited transform_column dest_column_name kwarg description to be clearer on defaults by @evan-anderson.
[ENH] Replace apply() in favor of pandas functions in several functions. @hectormz
[ENH] Add ecdf() Series function by @ericmjl.
[DOC] Update API policy for clarity. @ericmjl
[ENH] Enforce string conversion when cleaning names. @ericmjl
[ENH] Change find_replace implementation to use keyword arguments to specify columns to perform find and replace on. @ericmjl
[ENH] Add jitter() dataframe function by @rahosbach

v0.19.0

[ENH] Add xarray support and clone_using / convert_datetime_to_number funcs by @zbarry.

v0.18.3

[ENH] Series toset() functionality #570 @eyaltrabelsi
[ENH] Added option to coalesce function to not delete coalesced columns. @gddcunh
[ENH] Added functionality to deconcatenate tuple/list/collections in a column to deconcatenate_column @zbarry
[ENH] Fix error message when length of new_column_names is wrong @DollofCutty
[DOC] Fixed several examples of functional syntax in functions.py. @bdice
[DOC] Fix #noqa comments showing up in docs by @hectormz
[ENH] Add unionizing a group of dataframes' categoricals. @zbarry
[DOC] Fix contributions hyperlinks in AUTHORS.rst and contributions by @hectormz
[INF] Add pre-commit hooks to repository by @ericmjl
[DOC] Fix formatting code in CONTRIBUTING.rst by @hectormz
[DOC] Changed the typing for most "column_name(s)" to Hashable rather than enforcing strings, to more closely match Pandas API by @dendrondal
[INF] Edited pycodestyle and Black parameters to avoid venvs by @dendrondal

v0.18.2

[INF] Make requirements.txt smaller @eyaltrabelsi
[ENH] Add a reset_index parameter to shuffle @eyaltrabelsi
[DOC] Added contribution page link to readme @eyaltrabelsi
[DOC] fix example for update_where, provide a bit more detail, and expand the bad_values example notebook to demonstrate its use by @anzelpwj.
[INF] Fix pytest marks by @ericmjl (issue #520)
[ENH] add example notebook with use of finance submodule methods by @rahosbach
[DOC] added a couple of admonitions for Windows users. h/t @anzelpwj for debugging help when a few tests failed for win32 @Ram-N
[ENH] Pyjanitor for PySpark @zjpoh
[ENH] Add pyspark clean_names @zjpoh
[ENH] Convert asserts to raise exceptions by @hectormz
[ENH] Add decorator functions for missing and error handling @jiafengkevinchen
[DOC] Update README with functional pandas API example. @ericmjl
[INF] Move get_features_targets() to new ml.py module by @hectormz
[ENH] Add chirality to morgan fingerprints in janitor.chemistry submodule by @Clayton-Springer
[INF] import_message suggests python dist. appropriate installs by @hectormz
[ENH] Add count_cumulative_unique() method to janitor.functions submodule by @rahosbach
[ENH] Add update_where() method to janitor.spark.functions submodule by @zjpoh

v0.18.1

[ENH] extend find_replace functionality to allow both exact match and regular-expression-based fuzzy match by @shandou
[ENH] add preserve_position kwarg to deconcatenate_column with tests by @shandou and @ericmjl
[DOC] add contributions that did not leave git traces by @ericmjl
[ENH] add inflation adjustment in finance submodule by @rahosbach
[DOC] clarified how new functions should be implemented by @shandou
[ENH] add optional removal of accents on functions.clean_names, enabled by default by @mralbu
[ENH] add camelCase conversion to snake_case on clean_names by @ericmjl, h/t @jtaylor for sharing original
[ENH] Added null_flag function which can mark null values in rows. Implemented by @anzelpwj
[ENH] add engineering submodule with unit conversion method by @rahosbach
[DOC] add PyPI project description
[ENH] add example notebook with use of finance submodule methods by @rahosbach

For changes that happened prior to v0.18.1, please consult the closed PRs, which can be found here.

We thank all contributors who have helped make pyjanitor the package that it is today.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CHANGELOG.rst

CHANGELOG.rst

new version (on deck)

v0.20.7

v0.20.6

v0.20.5

v0.20.1

v0.20.0

v0.19.0

v0.18.3

v0.18.2

v0.18.1

Files

CHANGELOG.rst

Latest commit

History

CHANGELOG.rst

File metadata and controls

new version (on deck)

v0.20.7

v0.20.6

v0.20.5

v0.20.1

v0.20.0

v0.19.0

v0.18.3

v0.18.2

v0.18.1