Skip to content

Releases: pemistahl/lingua-go

Lingua 1.1.0

21 Nov 23:08
Compare
Choose a tag to compare

Features

  • The new method LanguageDetectorBuilder.WithLowAccuracyMode() has been introduced. By activating it, detection accuracy for short text is reduced in favor of a smaller memory footprint and faster detection performance. (#17)

  • The new method LanguageDetector.ComputeLanguageConfidence() has been introduced. It allows to retrieve the confidence value for one specific language only, given the input text. (#19)

Improvements

  • The computation of the confidence values has been revised and the min-max normalization algorithm is now applied to the values, making them better comparable by behaving more like real probabilities. (#16)

  • The language models are now serialized as protocol buffers instead of json. Thanks to this change, they are now loaded into memory twice as fast as before. (#22)

Bug Fixes

  • The unigram counts in the statistics engine were not retrieved correctly. This has been fixed, producing more correct detection results. (#14)

Compatibility

  • The lowest supported Go version is 1.18 now. Older versions are no longer compatible with this library.

Miscellaneous

  • The library now has a fresh and colorful new logo. Why? Well, why not? (-:

Lingua 1.0.5

25 Dec 12:33
Compare
Choose a tag to compare

Bug Fixes

  • The character â was erroneously not treated as a possible indicator for French.

Improvements

  • The dependencies to the other language detectors which are used for the accuracy comparisons were always downloaded together with the main library. They are only needed when you want to update the accuracy reports, therefore the cmd/ subdirectory now contains its own Go module that defines those dependencies. They have now been removed from the main library. Thanks to @dim and @BoeingX for identifying this problem. (#8)

Lingua 1.0.4

28 Nov 14:17
Compare
Choose a tag to compare

Bug Fixes

  • It was possible to include lingua.Unknown in the set of input languages for building the language detector. It is only meant as a return value, so it is now automatically removed from the set of input languages. Thanks to @marians for identifying this problem. (#7)

Lingua 1.0.3

20 Oct 12:23
Compare
Choose a tag to compare

Improvements

  • By replacing sync.Once with sync.Map for storing the language models at runtime, a large amount of code could be removed while preserving the same functionality. This improves code maintenance significantly.

Lingua 1.0.2

13 Oct 19:16
Compare
Choose a tag to compare

Bug Fixes

  • In very rare cases, the language returned by the detector was non-deterministic.
    This has been fixed. Big thanks to @FilipAlexander for identifying this problem. (#6)

Lingua 1.0.1

27 Jun 20:29
Compare
Choose a tag to compare

Bug Fixes

  • The language models were not embedded into the compiled binary. This resulted in problems when trying to use Lingua within a Docker container, for instance. Big thanks to @dsxack for identifying this problem and providing a fix. (#2 #3)

Lingua 1.0.0

21 Jun 17:30
Compare
Choose a tag to compare

This is the very first release of the Go implementation of Lingua. Enjoy! :-)