Releases: jekyll/classifier-reborn
Releases · jekyll/classifier-reborn
v2.3.0
Minor Enhancements
- Separate tokenizer from hasher, allowing custom tokenizers. (#162)
- Improved handling of Zero division and Zero vector normalization. (#173)
- Support Numo Gem for performing SVD (#198)
Development Fixes
- remove deprecated has_rdoc in gemspec
- auto-gen-config for Rubocop
- auto-correct offenses
- Freeze all Ruby string literals (#190)
- Migrate TravisCI to GitHub Actions & Update Tested Ruby Versions (#195)
- Test Native and GSL Implementations (#196)
- Actually Use GSL in CI Tests (#197)
Documentation
- Updated Bayes docs for filters (#191)
2.2.0
Major Enhancements
- JRuby Support, thanks to @mach-kernel (#168)
- Add support to reset trained classifiers to their initial state (#143)
- Classifier evaluation and validation (#142)
- Abbility to add custom stopwords at classifier initialization (#129)
- Don't train/untrain the Bayesian classifier with empty word hashes (#132)
- Enable auto categorization if no initial categories (#128)
- Bayes integration test of Memory and Redis backends with real data (#92)
- Memory and Redis backend support (#84)
Minor Enhancements
- improved turkish stopwords (#159)
- Set Redis keys only if they don't exist (#156)
- Require bayes_redis_backend (#157)
- Validation documentation improvements (#150)
- Updated Dokcer image to Ruby 2.4 (#149)
- Classifier validation user documenation (#145)
- Fixed persistance for BayesMemoryBackend (#147)
- Fixed error on requiring 'classifier-reborn' without using Redis (#146)
- Removed magic train untrain methods from docs, (#141)
- Links corrected to point to the new domain (#139)
- Minor docs improvements (#138)
- Return the status of the training/untraining when run (#137)
- Refactoring of backend tests to move duplicate login in the common file (#134)
- Deal with Infinity score in test (#133)
- README file cleaned up to point to the documentation site (#121)
- Added and corrected RDoc for ceratin classes and methods (#122)
- Added favicon link and forced display (#120)
- Updated the truncated LICENSE file (#116)
- Docs visual improvement and refactoring (#119)
- Fixed relative URL issue on nav links and added benchmark data (#118)
- Added custom layout with navigation (#117)
- Created a static site for documentation (#115)
- Removed redis gem from Dockerfile as it is added in gemspec (#113)
- Speed up Docker image rebilding (#112)
- Improved Docker based development documentation (#106)
- Benchmark refactoring, improving efficiency, enhanced reporting (#107)
- Add Vietnamese stopwords (#110)
- Added stop words for Arabic, Bengali, Chinese, Hindi, and Russian (#105)
- Dockerfile and documentation (#104)
- Remove hard dep on Redis and update bin (#96)
- Documented Redis backend performance (#103)
- Rename Bayes memory test class (#102)
- Added Bayes backend benchmarks (#98)
- Disabled Redis disc persistence and refactored integration test (#97)
- Removed useless intermediate variables (#90)
v2.1.0
v2.0.5
v2.0.4
Major Enhancements
- Classification thresholds can be enabled or disabled. The default is disabled. The threshold value can be set at initialization time or dynamically during processing (#47)
- Made auto-categorization optional, defaulting to false (#45)
- Added the ability to handle an array of classifications to the constructor (#44)
- Classification with a threshold has been added to the api (#39)
Minor Enhancements
- Documentation around threshold usage (#54)
- Fixed UTF-8 encoding for
hasher.rb
(#50) - Removed some unnecessary methods (#43)
- Add optional
CachedContentNode
(GSL only) (#43) - Caches the transposed
search_vector
(#43) - Added custom marshal_ methods to not save the cache when dumping/loading (#43)
- Optimized some numeric comparisons and iterators (#43)
- Added cached calculation table when computing raw_vectors (#43)
- If a category name is already a symbol, just return it (#45)
- Various Hash improvements (#45)
- Eliminated several Ruby
⚠️ s when run with RUBYOPT="-w" (#38) - Simple performance improvements for the Hasher process (#41)
- Fixes for broken regex splitting for non-ascii characters and removal of the unused punctuation filter (#41)
- Add multiple language stopwords with customizable stop word paths (#40)
Bug Fixes
- Fixed the bug where adding the same category a second time would clobber the category that was already there (#45)
- Fixed deprecation warning for
<=>
in ls.rb (#33) - Remove references to Madeline in the README and replace it with Marshal or Redis (#32)
Development Fixes
- Added development dependency on
mini_test
and added 2.2 to travis.yml (#36)
v2.0.2
Minor Enhancements
- Remove
Array#sum
monkey patch in favour of#reduce(0, :+)
(#20) - Cache total word counts per category for speed (#4)
Development Fixes
- Add a test for
Bayes#untrain_*
. (#21) - Fix link to rb-gsl gem (#24)
- Add helper scripts per Jekyll convention (#25)
Many thanks to @Ch4s3 for all his work on this release!