Step 1: Vocabulary

An assemblage is the sum total of all the material collected and created in the course of your project, research, or work.

A collection is a set of different kinds of materials, for example: bunches of recorded conversations, recorded songs, procedural texts, photographs, videos, or analytical documents.

A corpus is a special type of collection that is designed for a specific purpose.

Digital data repositories are repositories where digital files (or data) are stored; generally intended to accommodate all kinds of data from different disciplines, and as such, they usually handle the assortment of different files typically found in language collections quite well, though their metadata fields may not be particularly well-suited for handling language data.

A digital language archive is a digital repository whose primary purpose is to preserve materials (files) that are in or about specific languages, usually Indigenous or minority languages that are under-documented, under-resourced, under-described, and less commonly taught.

Graded access implies different levels of access depending on the files themselves or the role of the archive user.

Granular access means that certain files in a folder of related files may be restricted while other files in the same folder are not.

An institutional repository is a type of repository where faculty and students can deposit their works. Some are data repositories, but often they are systems that are designed to hold research products such as reports, articles, theses and dissertations that can exist as single, simple digital objects and not datasets or audio-visual recordings.

Metadata is the data about your data, which includes information about what the data is, the format it is in, and its purpose.

Metadata practices are the practices an archivist or researcher follows to create metadata, which can allow future users to determine its usefulness.

Primary data is audio or video recordings or written observations of spoken or signed language that are used for analyses, including narratives, oral histories, elicitation, and experimental protocols.

Secondary data refers to transcriptions, translations, morpheme breakdowns, glosses and other types of annotation that require some level of preliminary analysis of primary data to create. 

A time embargo restricts access to a file for a limited and pre-determined amount of time.

Complete and Continue  
Discussion

0 comments