Loading embeddings #22

Joseph94m · 2018-10-13T10:49:50Z

Hi,

I downloaded the French embeddings, and extracted the zip file.
How can I load these embeddings in a python code and return the embeddings for a specified word, e.g.: embedding("bonjour") -----> [0.2, -0,2, etc...]

Thanks

nvietsang · 2019-03-12T15:09:16Z

You can use gensim to load the .bin model:

from gensim.models import Word2Vec
model = Word2Vec.load("vi.bin")
model.wv['nhà']

Then, you will get the embedding vector of the word "nhà", in Vietnamese for example.
Remember to install gensim library

fadeawaygod · 2019-03-13T07:38:14Z

But it didn't work with FastText, below is my code:

from gensim.models import FastText

model_f = FastText.load("zh.bin")
v = model_f.wv['你好']`

It throws a exception:
Exception has occurred: _pickle.UnpicklingError
invalid load key, ','.

fadeawaygod · 2019-03-13T09:39:40Z

But it didn't work with FastText, below is my code:

from gensim.models import FastText

model_f = FastText.load("zh.bin")
v = model_f.wv['你好']`

It throws a exception:
Exception has occurred: _pickle.UnpicklingError
invalid load key, ','.

I fixed it by replacing load with load_fasttext_format.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Loading embeddings #22

Loading embeddings #22

Joseph94m commented Oct 13, 2018

nvietsang commented Mar 12, 2019 •

edited

fadeawaygod commented Mar 13, 2019 •

edited

fadeawaygod commented Mar 13, 2019 •

edited

Loading embeddings #22

Loading embeddings #22

Comments

Joseph94m commented Oct 13, 2018

nvietsang commented Mar 12, 2019 • edited

fadeawaygod commented Mar 13, 2019 • edited

fadeawaygod commented Mar 13, 2019 • edited

nvietsang commented Mar 12, 2019 •

edited

fadeawaygod commented Mar 13, 2019 •

edited

fadeawaygod commented Mar 13, 2019 •

edited