| write.paragraph2vec {doc2vec} | R Documentation |
Save a paragraph2vec model as a binary file to disk
write.paragraph2vec(x, file)
x |
an object of class |
file |
the path to the file where to store the model |
invisibly a logical if the resulting file exists and has been written on your hard disk
library(tokenizers.bpe) data(belgium_parliament, package = "tokenizers.bpe") x <- subset(belgium_parliament, language %in% "french") x <- subset(x, nchar(text) > 0 & txt_count_words(text) < 1000) model <- paragraph2vec(x = x, type = "PV-DM", dim = 100, iter = 20) model <- paragraph2vec(x = x, type = "PV-DBOW", dim = 100, iter = 20) path <- "mymodel.bin" write.paragraph2vec(model, file = path) model <- read.paragraph2vec(file = path) vocab <- summary(model, type = "vocabulary", which = "docs") vocab <- summary(model, type = "vocabulary", which = "words") embedding <- as.matrix(model, which = "docs") embedding <- as.matrix(model, which = "words")