Skip to content

Releases: ddangelov/Top2Vec

hierarchical topic reduction improvements

16 Nov 12:42
Compare
Choose a tag to compare
  • fixed loading bug
  • hierarchical topic reduction bug
  • added parameter for optimizing hierarchical reduction speed

Topic indexing bugfix

03 Nov 21:55
Compare
Choose a tag to compare
1.0.33

topic indexing bugfix

1.0.32

02 Nov 23:44
Compare
Choose a tag to compare

Indexing bugfix

gpu hdbscan and topic indexing

02 Nov 15:00
Compare
Choose a tag to compare
  1. Added gpu hdsbcan
  2. Added topic indexing

gpu umap

01 Nov 19:20
Compare
Choose a tag to compare
  1. Changed default embedding model to universal-sentence-encoder-multilingual.
  2. Added option for GPU umap with gpu_umap parameter.

Adding compute_topics

14 Mar 22:47
Compare
Choose a tag to compare
  • Added a method for computing topics.
  • Exposed topic deduplication parameter topic_merge_delta.
  • Bug fixes.

Sklearn change in API fix

25 Jan 22:46
Compare
Choose a tag to compare

get_feature_names() -> get_feature_names_out()

Phrases and new embedding options

03 Apr 23:02
d625b50
Compare
Choose a tag to compare
  • New pre-trained transformer models available
  • Ability to use any embedding model by passing callable to embedding_model
  • New embedding_batch_size option
  • Document chunking options for long documents
  • Phrases in topics by setting ngram_vocab=True

Query documents and topics fix

09 Jul 18:24
Compare
Choose a tag to compare
1.0.26

updating version

Query documents and topics

23 Jun 00:57
Compare
Choose a tag to compare

Added query_documents and query_topics methods which allow for using a sequence of text such as a question, a sentence, a paragraph or a document to query documents or topics.

Added num_topics parameter to get_documents_topics method which allows retrieving multiple topics per document.