Get Tidy Token Names

get_names_(file)

get_names(tag)

Arguments

file

File containing tagged names.

tag

Character vector containing tagged names.

Value

data.frame of of 3 columns:

  1. string - string identified

  2. type - type of name

  3. name - extracted name

a tibble

Examples

# NOT RUN {
# get working directory
# need to pass full path
wd <- getwd()

# Training to find "WEF"
data <- paste("This organisation is called the <START:wef> World Economic Forum <END>",
  "It is often referred to as <START:wef> Davos <END> or the <START:wef> WEF <END> .")

# train the model
model <- tnf_train(model = paste0(wd, "/wef.bin"), lang = "en",
  data = data, type = "wef")

# Create sentences to test our model
sentences <- paste("This sentence mentions the World Economic Forum the annual meeting",
  "of which takes place in Davos. Note that the forum is often called the WEF.")

# run model on sentences
results <- tnf(model = model, sentences = sentences)

# extract strings
(ext <- get_names(results))
# }