Rhetorical structure theory
Theory of text organization
From Wikipedia, the free encyclopedia
Rhetorical structure theory (RST) is a theory of text organization that describes relations that hold between parts of text. It was originally developed by William Mann, Sandra Thompson, Christian M. I. M. Matthiessen and others at the University of Southern California's Information Sciences Institute (ISI) and defined in a 1988 paper.[1][2][3] The theory was developed as part of studies of computer-based text generation. Natural language processing researchers later began using RST in automatic summarization and other applications. It explains coherence by postulating a hierarchical, connected structure of texts,[3] which are labeled using a small, predefined inventory of relation types - for example, one part of a text may provide an elaboration on another part, provide background or specify a cause for another.[1]
In the 2000s, following the release of the first large-scale dataset implementing the theory, the RST Discourse Treebank (RST-DT),[4] Daniel Marcu demonstrated the feasibility of practical applications of RST to discourse parsing and summarization at ISI.[5] Originally limited to written text, subsequent work in the 2010s expanded RST to spoken language analysis,[6] and the framework has been applied to a variety of languages including Farsi,[7] German,[8] Mandarin Chinese,[9] Russian[10] and Spanish.[11] Following the introduction of Transformers, LLMs have been applied to automatic RST parsing,[12] with results approaching human performance on parsing text in English.
Rhetorical relations
Rhetorical relations, also called coherence or discourse relations, are paratactic (coordinate) or hypotactic (subordinate) relations that hold across two or more text spans.[13] The logical arrangement of relations in a text contributes to its coherence by connecting different propositions in a relational structure. RST using rhetorical relations provides a systematic way for an analyst to analyze the underlying intention of a text. The analysis is usually built by reading the text and constructing a tree using the relations. The following example is a title and summary, appearing at the top of an article in Scientific American magazine (adapted from Ramachandran and Anstis, 1986). The original text, broken into numbered units, is:[3]

- [Title:] The Perception of Apparent Motion
- [Abstract:] When the motion of an intermittently seen object is ambiguous
- the visual system resolves confusion
- by applying some tricks
- that reflect a built-in knowledge of properties of the physical world.
In the figure, the numbers 1-5 show the corresponding units from the text above. Unit 5 provides an "elaboration" on unit 4, and therefore constitutes a less prominent satellite of unit 4, which acts as a nucleus for the relation. Units 4-5 form a relation "Means", explaining the means by which the visual system resolves confusion. Unit 3 is the Central Discourse Unit (CDU) of the text, since all units point to it directly or indirectly. Similarly units 1 and 2 form "preparation" and "circumstance" relations relative to their nuclei. Groups of units which serve as a satellite or nucleus together are called complex discourse units, and always span a set of adjacent EDUs.
Nuclearity in discourse
RST establishes two different types of units. Nuclei are considered as the most important parts of text whereas satellites contribute to the nuclei and are secondary. Nucleus contains basic information and satellite contains additional information about nucleus. The satellite is often incomprehensible without nucleus, whereas a text where satellites have been deleted can be understood to a certain extent.
Hierarchy in the analysis
RST relations are applied recursively in a text, until all units in that text are constituents in an RST relation. The result of such analyses is that RST structure are typically represented as trees, with one top level relation that encompasses other relations at lower levels.
Why RST?
- From linguistic point of view, RST proposes a different view of text organization than most linguistic theories.
- RST points to a tight relation between relations and coherence in text
- From a computational point of view, it provides a characterization of text relations that has been implemented in different systems and for applications as text generation[14] and summarization.[15]
In design rationale
Computer scientists Ana Cristina Bicharra Garcia and Clarisse Sieckenius de Souz have used RST as the basis of a design rationale system called ADD+.[16][17] In ADD+, RST is used as the basis for the rhetorical organization of a knowledge base, in a way comparable to other knowledge representation systems such as issue-based information system (IBIS).[17] Similarly, RST has been used in representation schemes for argumentation.[18][19][20]