Skip to content

Measuring the performance of bigrams and trigrams in language modeling problems

License

Notifications You must be signed in to change notification settings

mcanalesmayo/LanguageModel-NGram

Repository files navigation

LanguageModel-NGram

Description

N-gram generalized solutions have O(K^N) storage complexity, where K = number of words, and N = complexity of the Markov chain. Therefore:

  • The ideal solution has O(K^K), which makes it computationally impossible to work
  • Bigrams have O(K^2) complexity
  • Trigrams have O(K^3) complexity
  • ... and so on

The aim of this program is to try the performance of bigrams and trigrams in language modeling problems.

References

If you use this code, please reference: Javier Civera - jcivera@unizar.es

About

Measuring the performance of bigrams and trigrams in language modeling problems

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages