Get utterances

get_utterances(
  collection = NULL,
  language = NULL,
  corpus = NULL,
  role = NULL,
  role_exclude = NULL,
  age = NULL,
  sex = NULL,
  target_child = NULL,
  connection = NULL,
  db_version = "current",
  db_args = NULL
)

Arguments

collection

A character vector of one or more names of collections

language

A character vector of one or more languages

corpus

A character vector of one or more names of corpora

role

A character vector of one or more roles to include

role_exclude

A character vector of one or more roles to exclude

age

A numeric vector of an single age value or a min age value and max age value (inclusive) in months. For a single age value, participants are returned for which that age is within their age range; for two ages, participants are returned for whose age overlaps with the interval between those two ages.

sex

A character vector of values "male" and/or "female"

target_child

A character vector of one or more names of children

connection

A connection to the CHILDES database

db_version

String of the name of database version to use

db_args

List with host, user, and password defined

Value

A `tbl` of Utterance data, filtered down by supplied arguments. If `connection` is supplied, the result remains a remote query, otherwise it is retrieved into a local tibble.

Examples

# \donttest{ get_utterances(target_child = "Shem")
#> Using current database version: '2020.1'.
#> Getting data from 1 child in 1 corpus ...
#> # A tibble: 42,470 x 27 #> id gloss stem actual_phonology model_phonology type language #> <int> <chr> <chr> <chr> <chr> <chr> <chr> #> 1 345783 yeah yeah "" "" decla… eng #> 2 345784 what was … what be t… "" "" quest… eng #> 3 345785 ow ow "" "" decla… eng #> 4 345786 ow ow "" "" quest… eng #> 5 345787 yeah yeah "" "" decla… eng #> 6 345788 want a bi… want a bi… "" "" quest… eng #> 7 345789 ow ow "" "" decla… eng #> 8 345790 ow what's… ow what t… "" "" quest… eng #> 9 345791 Richard Richard "" "" quest… eng #> 10 345792 no no "" "" decla… eng #> # … with 42,460 more rows, and 20 more variables: num_morphemes <int>, #> # num_tokens <int>, utterance_order <int>, corpus_name <chr>, #> # part_of_speech <chr>, speaker_code <chr>, speaker_name <chr>, #> # speaker_role <chr>, target_child_name <chr>, target_child_age <dbl>, #> # target_child_sex <chr>, media_start <dbl>, media_end <dbl>, #> # media_unit <chr>, collection_name <chr>, collection_id <int>, #> # corpus_id <int>, speaker_id <int>, target_child_id <int>, #> # transcript_id <int>
# }