brown_metadata {tlda} | R Documentation |
Text metadata for Brown corpora
Description
This dataset provides metadata for the text files in the Brown family of corpora. It maps standardized file names to the textual categories genre and subgenre.
Usage
brown_metadata
Format
brown_metadata
A data frame with 500 rows and 3 columns:
- text_file
Standardized name of the text file (e.g. "A01", "J58", "R07")
- macro_genre
4 macro genres ("press", "general_prose", "learned", "fiction")
- genre
15 genres (e.g. "press_editorial", "popular_lore", "adventure_western_fiction"))
Source
McEnery, Tony & Andrew Hardie. 2012. Corpus linguistics. Cambridge: Cambridge University Press.
[Package tlda version 0.1.0 Index]