You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
PS: If I don't use the parameter shardsize, the error already occurs in the similarities.Similarity call.
Steps/code/corpus to reproduce
Save the .py files in the pruvo folder (package), the .parquet file in data folder and run this script:
importpandasaspdfrompruvo.embeddingimportCorpusdf=pd.read_parquet('data/preprocess.parquet')
corpus=Corpus()
corpus.add(list(df['bookingRoomType'].unique()), pre_processed=True)
corpus.add(list(df['mappedRoomType'].unique()), pre_processed=True)
w2v=corpus.train(model='word2vec')
w2v_similars=corpus.get_similars('apartment 1 king bed in neverland')
w2v_similars.head(10)
Problem description
When I use the
shardsize
parameter in thesimilarities.Similarity
method, when querying the index the same parameter is not used, causing errors:PS: If I don't use the parameter
shardsize
, the error already occurs in thesimilarities.Similarity
call.Steps/code/corpus to reproduce
Save the
.py
files in thepruvo
folder (package), the.parquet
file indata
folder and run this script:Versions
Please provide the output of:
files.zip
The text was updated successfully, but these errors were encountered: