Skip to content

Releases: usnistgov/SDNist

SDNist v2.4

02 Feb 16:12
8430da9
Compare
Choose a tag to compare

What's Changed

SDNist Repo:

Replaced old sample reports with the ones generated with sdnist version 2.3. Sample reports are available in repo at path: sdnist/report/sample-reports

SDNist Library:

  • Fixed issue with deleting rows when there are unknown values.
  • Fixed sdnist crash issue when deid. data has less than 3 features.
  • Fixed issue with pca-pair plot when data has less than 5 features.

Assets from previous release

SDNist v2.3

19 Jun 16:55
da1d692
Compare
Choose a tag to compare

What's Changed

Data:

  • Updated INDP feature details in data_description.json file.

SDNist Library:

  • Added explanatory text to inconsistencies and UEM (Unique Exact Matches) metric, fixed typographic errors, improved formatting in privacy section, renamed 'k-marginal breakdown' to 'worst-performing PUMA breakdown' and adjusted json structure accordingly.
  • Added 1% and 5% sampling error on the k-marginal.
  • Improved readability of propensity image.
  • Fixed feature space size in UEM.
  • Fixed deidentified data percentage in UEM metric.
  • Fixed PCA plot scaling issues.

Assets from previous release

SDNist v2.2

14 Apr 14:44
6f04c34
Compare
Choose a tag to compare

What's Changed

Data:

  • Updated description of the feature PINCP_DECILE in the data_dictionary.json

SDNist Library:

  • Added new privacy metric unique-exact-matches to compute number of unique target data records that exactly matched records in a deidentified data.
  • Reduced report bundle size.
  • Added deidentified data labels to the generated json report.
  • Update to show only worst three univariates in the k-marginal breakdown section.
  • Fixes to the regression metric.
  • Few other cosmetic changes to the report UI.

Assets from previous release

SDNist v2.1.1

17 Mar 17:49
Compare
Choose a tag to compare

What's Changed

  • Updated toy data
  • Added PCA metric highlight queries.
  • Improved stability of K-marginal scoring, and added k-marginal sub-sampling baseline for each PUMA.
  • Added support for including metadata labels in the evaluation report.
  • Enhanced data validation: does not evaluate deidentified data features that contain out of bound values.
  • Added to report the list of feature that are not included in the evaluation due to out of bound values.
  • Removed incorrect child_NOC inconsistency check.
  • Changed feature ordering of pair-wise correlation charts to match data dictionary ordering.
  • Added colors to the console output for better readability.
  • Few other cosmetic changes to the report UI.

SDNist v2.0

28 Feb 14:31
9cfb867
Compare
Choose a tag to compare

This is the stable release of the SDNist Synthetic Report Generator and the SDNist version intended for use in the 2023 Collaborative Research cycle.

What's Changed

  • Added new metrics
  • Included multiple sample results
  • Added some more READMEs (report generator tool and sample report)
  • Added better documentation in the READMEs
  • Streamlined the repo by removing materials specific to the Temporal Map Challenge.

v1.4.1-b.1: Beta release

18 Oct 13:46
Compare
Choose a tag to compare

This release uses same data resource files as release v1.4.0-b.1
Code and Readme

changes:

  • Fix synthetic data binning.
  • Fix PCA plot axes.

v1.4.0-b.2: Beta release

31 Aug 11:40
Compare
Choose a tag to compare

This release uses same resource files as release v1.4.0-b.1
Code and Readme

changes:

  • Fixed synthetic data validation.
  • Fixed apparent match metric when no matches occur.
  • Fixed sampling error computation to make sub-samples with same size as the target data.
  • Update propensity metric to use tree depth 6.
  • Update pearson and kendall correlation to have minimum upper range of 0.15 on colorbars.
  • Update k-marginal to perform 100 permutations of 3-marginal selection.

v1.4.0-b.1: Beta test release with all data resources

25 Jul 14:22
Compare
Choose a tag to compare

v1.3.0: Stable release with all data resouces

22 Mar 11:59
Compare
Choose a tag to compare

This release is the v1.3.0 stable release and contains all data assets.

Data files in parquet/json and csv are included. Use pip install from the unzipped package to install sdnist. Download SDNist-data-1.3.0 and run sdnist from the unzipped directory to prevent auto attempts to download the SDNist github release 1.3.0.

SDNist-data-1.3.0.zip contains all data tables in parquet/json and csv and also geojson files for census data.

v1.3.0 Chanelog: https://github.com/usnistgov/SDNist/blob/main/CHANGELOG.md
Full Changelog: v0.1.2...v1.3.0

Stable release with all data resouces

26 Jan 17:06
Compare
Choose a tag to compare

This release is the first stable release and contains all data assets.

Data files in parquet/json and csv are included. Use pip install from the unzipped package to install sdnist. Download SDNist-data-1.2.01 and run sdnist from the unzipped directory to prevent attempts to download the data from NIST servers.

SDNist-data-1.2.0.zip contains all data tables in parquet/json and csv.

Full Changelog: v0.1.2...v1.2.0