word2vec-model: Reading word2vec binary models

[ bsd3, library, program, unclassified ] [ Propose Tags ]

Please see the README on Github at https://gonito.net/gitlist/word2vec-model.git/blob/master/README.md

[Skip to Readme]
Versions [faq]
Change log ChangeLog.md
Dependencies attoparsec, base (>=4.7 && <5), binary, binary-ieee754, bytestring, conduit, conduit-combinators, conduit-extra, text, unordered-containers, vector, word2vec-model [details]
License BSD-3-Clause
Copyright BSD3
Author Filip Graliński
Maintainer filipg@amu.edu.pl
Home page https://gonito.net/gitlist/word2vec-model.git
Source repo head: git clone git://gonito.net/word2vec-model.git
Uploaded by filipg at 2017-12-30T11:44:43Z
Distributions NixOS:
Executables word2vec-model-word-analogy, word2vec-model-similarity
Downloads 579 total (3 in the last 30 days)
Rating (no votes yet) [estimated by Bayesian average]
Your Rating
  • λ
  • λ
  • λ
Status Hackage Matrix CI
Docs not available [build log]
All reported builds failed as of 2017-12-30 [all 3 reports]


  • Data
    • Word2Vec
      • Data.Word2Vec.Model


Maintainer's Corner

For package maintainers and hackage trustees

Readme for word2vec-model-

[back to package description]


Reading word2vec binary models (generated with the original tool by Mikolov).

This simple module is only for reading word2vec models (it cannot be used to generate a word2vec model, for this the original word2vec tools should be used).

Note that word2vec binary format is not a proper serialisation format (as it is mostly a raw dump of C data. Caveat emptor, it might be risky to read a model generated on a host with a different architecture.


{-# LANGUAGE OverloadedStrings #-}
model <- readWord2VecModel "binary.bin"
let theMostSimilar = findKNearestToWord w2v 30 "polska"