The Language of Language Machines

Progressing the Terminology

LOLM is developed through a working group that meets weekly. This document details the structure for the process by which we surface, construct, and editorially manage new entries in the project.

General method

An entry in LLM should be neither purely a technical history nor a media-theoretical critique of a term. The project is designed to contextualize the way in which terms in the NLP technical literature emerge, come to prominence, and leak historical and philosophical meaning through their circulation.

Each entry should aim to:

  1. Locate the paper/s in which the lexicon was first introduced into the computer scientific lexicon. Is there a particular paper that coined the term? Does this paper or originary location give a justification or set of motivations for the particular turn of language?
  2. Track the development and dissemination of the term in the recent history of technical literature since its inception. How widely is it used? Has it inspired or been refined by other terms or concepts in NLP or more broadly? Does this term draw any critical dependencies on other terms in the technical literature? (If these terms exist in LOLM, reference complementary entries with relative links.)
  3. Surface historical precedents or referents for the term. What are the ways in which the term has been deployed prior to its technical coinage, and how do these deployments bear on the term’s stature in technical literature? Does the term have significant meaning in any other disciplines or technical vocabularies?
  4. Offer a philosophical, critical, or media-theoretical framing for the term without abandoning its technical specificity. What critical distance, if any, should we keep from the way that the term circulates in NLP literature? Does the term bootleg any naturalizing or otherwise problematic assumptions about the technical mechanism to which it refers (such as a claim to biological verisimilitude)?
  5. Conclude with a set of questions to inspire further inquiry. This might be a seed for further media theoretical thought, an acknowledgment of missing history, or a query about how we might shore up misconceptions associated with the term or the mechanism it presumes to represent.

Term-virgils

New terms are brought to the LOLM working group through a virgil. Any member of the working group can self-elect as a new term’s virgil, and all working group sessions will begin with an open call for new term-virgils. Not all working group members need to self-elect as a term-virgil, and there is no limit to the number of concurrent terms for which one person can self-elect as virgil.

A term’s virgil brings a new term to the working group with at least one and ideally multiple of the following:

Once a new term/virgil has been presented, the working group will collectively discuss it. The aim of this discussion is to fill out the body of the aims listed above in the ‘General method’. Post-discussion, the working group will then elect an editor for the term who will work with the virgil to fill an entry in LOLM (see next section).

Term-editors

A term-editor is responsible for filling out an entry in LOLM itself by working with the term-virgil and other invested/interested parties.

This contribution will ultimately take the form of a pull request to the LOLM GitHub repository. All sections of LOLM are written in Typst, and LOLM is envisioned as being published concurrently as a website (https://lolm.ohrg.org/), a PDF, and an EPUB through the Rheo toolchain. (Merging a pull request to the main branch of the LOLM GitHub repository will trigger a recompilation of all three of these formats.)

Whereas the term-virgil is responsible for surfacing the dimensions of a term that should be documented in its LOLM entry, the term-editor is responsible for compiling these many threads into a single and coherent entry in the project. The construction of this entry may benefit from separate meetings between virgil and editor, and perhaps even sub-working groups if there are many interested parties.

There is no strict timeframe on a term’s transition from discussion (via virgil) to page (via editor). Each working group session will assess the terms that are in flight, asking status updates of virgil/editor pairs and whether the contribution would benefit from any further discussion or help.

Platforms and tools

The LOLM working group uses the following tools and platforms:

The lifecycle of a term by example: ‘attention’

[pending…]