Peekbank Framework

The Peekbank data framework consists of three components (see Figure on right):

  1. processing raw experimental datasets
  2. populating a relational database
  3. providing an interface to the database

The code used to process raw datasets is hosted in peekbank-data-import and processed data files are hosted on the public Peekbank OSF repository.

The peekds library helps researchers convert and validate existing datasets to use the relational format of the database.

The peekbank module (Python) creates a database with the relational schema and populates it with the standardized datasets produced by peekds. The database is implemented in MySQL, an industry standard relational database, which may be accessed by a variety of programming languages over the internet.

The peekbankr library (R) provides an application programming interface, or API, that offers high-level abstractions for accessing data in Peekbank.

The peekbank-shiny app provides a tool for interactively visualizing the data.

Data Schema

We developed a common, tidy format for the eye-tracking data in Peekbank to ease the process of conducting cross-dataset analyses.

The schema consists of a set of tables, each tracking different types of information about a given dataset (see below for a list of tables). Each table can be accessed through functions in the peekbankr R package. Table columns are linked through connecting IDs (indicated by links in the schema figure below).

  • datasets: Information about the eyetracking dataset, including citation info
  • subjects: Data on individual participants
  • administrations: Properties of a specific session in which an individual subject participated
  • aoi_timepoints: Participant looking to the areas of interest (target vs. distractor) sampled at regular time intervals
  • xy_timepoints: Gaze coordinates of participants’ looking to the screen sampled at regular time intervals (if available)
  • trial_types: Properties of each trial, including the stimuli, target location, and carrier phrase.
  • stimuli: Stimuli in the form of word-image pairs
  • trials: A record of the specific order in which a subject completed trials
  • aoi_region_sets: Positional information about areas of interest (if available)

Data Codebook

The codebook for individual columns in Peekbank data columns can be found in the table below (link):

The Peekbank project consists of the following repositories and tools.

Peekbank is open source and under active development. If you run encounter a problem, please file an issue in the GitHub Issues page of the appropriate repository.

If you would like to contact us directly, email us at peekbank-dev[at]lists.stanford.edu.