word2vec-model: Reading word2vec binary models

[ bsd3, library, program, unclassified ] [ Propose Tags ]

Please see the README on Github at https://gonito.net/gitlist/word2vec-model.git/blob/master/README.md


[Skip to Readme]
Versions 0.1.0.0
Change log ChangeLog.md
Dependencies attoparsec, base (>=4.7 && <5), binary, binary-ieee754, bytestring, conduit, conduit-combinators, conduit-extra, text, unordered-containers, vector, word2vec-model [details]
License BSD-3-Clause
Copyright BSD3
Author Filip Graliński
Maintainer filipg@amu.edu.pl
Home page https://gonito.net/gitlist/word2vec-model.git
Source repo head: git clone git://gonito.net/word2vec-model.git
Uploaded by filipg at Sat Dec 30 11:44:43 UTC 2017
Distributions NixOS:0.1.0.0
Executables word2vec-model-word-analogy, word2vec-model-similarity
Downloads 145 total (13 in the last 30 days)
Rating (no votes yet) [estimated by rule of succession]
Your Rating
  • λ
  • λ
  • λ
Status Docs not available [build log]
All reported builds failed as of 2017-12-30 [all 3 reports]
Hackage Matrix CI

Modules

  • Data
    • Word2Vec
      • Data.Word2Vec.Model

Downloads

Maintainer's Corner

For package maintainers and hackage trustees


Readme for word2vec-model-0.1.0.0

[back to package description]

word2vec-model

Reading word2vec binary models (generated with the original tool by Mikolov).

This simple module is only for reading word2vec models (it cannot be used to generate a word2vec model, for this the original word2vec tools should be used).

Note that word2vec binary format is not a proper serialisation format (as it is mostly a raw dump of C data. Caveat emptor, it might be risky to read a model generated on a host with a different architecture.

Example:

{-# LANGUAGE OverloadedStrings #-}
model <- readWord2VecModel "binary.bin"
let theMostSimilar = findKNearestToWord w2v 30 "polska"