Skip to contents

This function extracts and counts the most common terms in a corpus.

Usage

extract_terms(
  article_data,
  text_column = "abstract",
  n = 100,
  remove_stopwords = TRUE,
  min_word_length = 3
)

Arguments

article_data

A data frame containing article data.

text_column

Name of the column containing the text to analyze.

n

Number of top terms to return.

remove_stopwords

Logical. If TRUE, removes stopwords.

min_word_length

Minimum word length to include.

Value

A data frame containing term counts.

Examples

if (FALSE) { # \dontrun{
common_terms <- extract_terms(article_data, text_column = "abstract")
} # }