Skip to contents

This function finds documents similar to a given document based on TF-IDF and cosine similarity.

Usage

find_similar_docs(text_data, doc_id, text_column = "abstract", n_similar = 5)

Arguments

text_data

A data frame containing text data.

doc_id

ID of the document to find similar documents for.

text_column

Name of the column containing text to analyze.

n_similar

Number of similar documents to return.

Value

A data frame with similar documents and their similarity scores.

Examples

if (FALSE) { # \dontrun{
similar_docs <- find_similar_docs(article_data, doc_id = 1,
                                     text_column = "abstract", n_similar = 5)
} # }