Skip to content

Releases: IBM/unitxt

Unitxt 1.4.2

08 Jan 15:09
Compare
Choose a tag to compare

Enhancements

  • Added ability to calculate confidence interval for additional scores beyond the main_score by @assaftibm in #431
  • Improved error messaging in test cards and added option to disable via environment variable by @yoavkatz in #440
  • Added support for Japanese language in sacrebleu by @yoavkatz in #433
  • Introduced binary recall, precision metrics, and advanced filtering operators by @lilacheden in #424
  • Added text completion task and modified lm harness template by @perlitz in #429

Internal code changes

Bug fixes

Documentation

  • Documented release notes process by @matanor in #444
  • Added guidelines for updating Unitxt documentation by @matanor in #446

Non backward compatible changes

  • Implemented SystemFormat instance operator and updated BasicRecipe to use all operators, removing renderers and ICLFormat by @dafnapension in #423
    (this change can break any code using the deprecated ICLFormat)
  • Additional verification introduced, aiming for uniquely determined results by @dafnapension in #435

New Contributors

Full Changelog: 1.4.1...1.4.2

Unitxt 1.4.1

31 Dec 10:45
Compare
Choose a tag to compare

(Same as 1.4.0 - rereleasing due to release process error)

Enhancements

  • New random generation mechanism to remove dependency between different random generators @matanor in #414
  • New MultipleChoiceTemplate which changes all QA datasets by @elronbandel in #405
  • New MAP, MRR, and Retrieval@K metrics by @assaftibm in #422
  • New LoadFromKaggle loader which allows direct loading of datasets from Kaggle by @ilyashnil in #413
  • New StringContainment metric that if one of the references is contained in the prediction by @ellarabi in #394
  • New ConvertToBoolean post processor that changes the predict to either TRUE or FALSE by @ellarabi in #394
  • 15 new open source classification datasets by @ilyashnil in #410, #418
  • Documentation is now automatically generate in each release (#384)

Bug fixes

  • Fixed and improved error checking in multi label F1 by @yoavkatz in #390
  • Changed translations to use normalized_sacrebleu by @gitMichal in #426

Non backward compatible changes

  • MultipleChoice QA datasets need to move to new MultipleChoiceTemplates
  • Translation blue metric is now between 0-1 and not 0-100
  • New randomization mechanism may change selects selection of demos , randomized text augmentation, or any other random choice.

What's Changed

New Contributors

Full Changelog: 1.3.1...1.4.0

Uniitxt 1.4.0

31 Dec 10:39
30ba648
Compare
Choose a tag to compare

Enhancements

  • New random generation mechanism to remove dependency between different random generators @matanor in #414
  • New MultipleChoiceTemplate which changes all QA datasets by @elronbandel in #405
  • New MAP, MRR, and Retrieval@K metrics by @assaftibm in #422
  • New LoadFromKaggle loader which allows direct loading of datasets from Kaggle by @ilyashnil in #413
  • New StringContainment metric that if one of the references is contained in the prediction by @ellarabi in #394
  • New ConvertToBoolean post processor that changes the predict to either TRUE or FALSE by @ellarabi in #394
  • 15 new open source classification datasets by @ilyashnil in #410, #418
  • Documentation is now automatically generate in each release (#384)

Bug fixes

  • Fixed and improved error checking in multi label F1 by @yoavkatz in #390
  • Changed translations to use normalized_sacrebleu by @gitMichal in #426

Non backward compatible changes

  • MultipleChoice QA datasets need to move to new MultipleChoiceTemplates
  • Translation blue metric is now between 0-1 and not 0-100
  • New randomization mechanism may change selects selection of demos , randomized text augmentation, or any other random choice.

What's Changed

New Contributors

Full Changelog: 1.3.1...1.4.0

Unitxt 1.3.2

19 Dec 05:42
Compare
Choose a tag to compare

Enhancements

  • Add classification datasets by @ilyashnil in #410
  • Added StringContainment metric and convert_to_boolean post processor that normalizes "yes/no/true/false" by @ellarabi in #394

Bug fixes

  • Fix for AugmentPrefixSuffix that ignored labels field by @yoavkatz in #409

What's Changed

New Contributors

Full Changelog: 1.3.1...1.3.2

Unitxt 1.3.1

18 Dec 14:48
Compare
Choose a tag to compare

Enhancements:

Fixes:

  • DiverseLabelSampler used in balancing demonstrations in NER , binary and multi-label classification receives input field to balance on.(@yoavkatz) #399
  • Fix to allow override empty_label of multi_label template used in non-english templates. (@yoavkatz) #403
  • Fix to AugmentorPrefixSuffix not return same prefix/suffix repeated (@yoavkatz) in #407

Possible changes

  • NER results will improve due to better balancing(@yoavkatz) #399

Documentation

What's Changed

New Contributors

Full Changelog: 1.3.0...1.3.1

Unitxt 1.3.0

17 Dec 07:37
Compare
Choose a tag to compare

Release Notes for Version 1.3.0

New Features and Enhancements:

  • Caching in LoadFromIBMCos: Improved data retrieval efficiency with caching in LoadFromIBMCos (@yoavkatz) - PR #377.
  • Additional Inputs for HuggingFace Metrics: Enhanced metrics with the ability to include additional inputs (@yoavkatz) - PR #383.
  • Support for Independent Random Generators: Enhanced data diversity with support for independent random generators (@matanor) - PR #374.
  • JSON Dumps with Non-standard ASCII Support: Updated json.dumps calls for non-standard ASCII support (@yoavkatz) - PR #386.
  • Improved API and Coverage of CastFields: Overhauled CastFields API and fixed issues with casting list fields (@dafnapension) - PR #372.
  • Expand Augment Suffix to Prefix Suffix: Expanded augment suffix functionality to include prefix suffix, increasing versatility (@dafnapension) - PR #375.

Fixes:

  • Circular Reference Error in FieldOperators: Addressed circular reference errors in FieldOperators (@dafnapension) - PR #376.
  • Llama Fix: Removed <s> from llama format. (@gitMichal) - PR #392.
  • Improvements and Fixes in Type Utils: Improved reliability and performance in type utilities (@elronbandel) - PR #395.
  • Deep Copy in Extract Field Values: Ensured data integrity with deep copy in extract_field_values (@dafnapension) - PR #379.

Potential for results changes

  • Llama Fix: Removed <s> from llama format. (@gitMichal) May changes llama model results- PR #392.
  • Support for Independent Random Generators: Enhanced data diversity with support for independent random generators (@matanor) May change random selection of demonstrations - PR #374.

Improvements:

  • Documentation and Code Coverage Expansion: Expanded documentation and coverage of additional operators (@dafnapension) - PR #397.
  • Backward Compatibility in Exposing Types Dictionary: Maintained backward compatibility by not exposing the types dictionary (@dafnapension) - PR #393.

Full Changelog: 1.1.4...1.3.0

What's Changed

Unitxt 1.2.0

10 Dec 11:58
Compare
Choose a tag to compare

Release Notes for Version 1.2.0

New Features and Enhancements:

  • Cache Usage Optimization: Optimized cache usage with a new default setting to enhance performance by not storing intermediate results in HuggingFace cache (@elronbandel) - PR #360.

Fixes

  • Fixed bug in Mapping Full List in MapInstanceValues: MapInstanceValues did not correctly map lists (@yoavkatz) - PR #367.

Improvements

  • Update Logging: Unitxt now uses standard logger for better clarity and troubleshooting (@elronbandel) - PR #363.

  • Better error checking in FilterByListsOfValues Operator: FilterByListsOfValues now throws and error when filtering all elements of a list, unless explicitly requested otherwise (@elronbandel) - PR #362.

  • Templates Converted to Operators: Transformed templates into operators for improved functionality and integration (@elronbandel) - PR #357.

Documentation and Code Coverage:

For more detailed information, please refer to the linked pull requests in our repository.

What's Changed

Full Changelog: 1.1.4...1.2.0

v1.0.1 pre release

12 Jul 06:28
Compare
Choose a tag to compare
v1.0.1 pre release Pre-release
Pre-release