Projects – Project Converse

Enchrony

Intone

Engage

Move

Bridging Learning

a student uses an AAC device while communicating with another student across a table

Co-Directors: Higginbotham & Possemato
Associate Researchers: Koroschetz, Satchidanand, Project OPEN Collaborators
Start Date: October 2021

Our interaction research has shown that although augmented speakers communicate at varying rates, delays in message composition consistently create challenges during conversation. These delays can lead to mishearings, misunderstandings, disruptions in shared attention, and breakdowns in mutual understanding.

Many of these difficulties stem from the time required to compose linguistic messages using AAC systems. Communication delays place significant demands on attention and can disrupt the expected timing and flow of everyday interaction. For example, our research has shown that when communication partners speak simultaneously, the relevance and interpretation of an augmented speaker’s contribution can shift or become disrupted.

Our work has also examined how individuals using AAC navigate conversational repair, including both self-initiated repair and repair prompted by others. This research has identified important differences between spoken interaction and AAC-mediated interaction, particularly regarding the effort and time required to clarify misunderstandings and repair communication breakdowns.

Together, these findings provide an empirical foundation for evaluating and guiding our recent AI-AAC design efforts. They also establish measurable interaction benchmarks that can help assess whether emerging communication technologies meaningfully improve conversational participation and understanding.

Origins & Rationale

Research Progress

Studies

Summary

“I found not having a real-time voice was the equivalent to not having any defense to what was done to my body…”
(Robillard, A. B. (1994). Communication problems in the intensive care unit. Qualitative Sociology, 17(4), 383–395.)

Enchrony refers to the moment-by-moment temporal organization of social interaction: the way actions are sequentially organized in-time so that each action responds to what just happened and shapes what is expected next. For Project Enchrony, this temporal-sequential organization provides the central frame for understanding AAC-mediated interaction:

Conversation happens within an enchronic frame.
Conversation carries a temporal imperative to keep interaction moving forward, a feature conversation analysts often describe as progressivity.
Participants ordinarily work within a rolling “now” of roughly two to two-and-a-half seconds, where actions are still treated as connected to the immediately prior action.
This enchronic window is a primary basis for organizing social interaction and has direct implications for participation in conversation, learning, work, and everyday social life.

Robillard’s work provides an important starting point for this project. Typical conversation is cooperative: we talk with each other in time, producing an orderly rhythm of coordinated action and speech. Robillard (1994) argues that this rhythm assumes the intersubjective coordination of physical bodies, making impaired access to conversational timing a breach of the normal conversational order that can leave the speaker vulnerable to misunderstanding, ignoring, or being treated as disruptive.

Now Time=0 to a few seconds; Near Time=2-10 seconds; Delay Time=10 seconds to several minutes.

This figure illustrates the temporal frame through which conversational actions are understood:

Now time: the immediate space in which turn transitions and responses are expected.
Near time: the rolling enchronic window in which actions are still treated as connected to what just occurred.
Delayed time: the point at which responses become harder to link to the prior action and more vulnerable to misunderstanding, disattention, overlap, repair, or loss of sequential fit.
AAC composition delay: AAC composition delay is the time between an augmented speaker’s decision to compose an utterance and the moment that the utterance becomes available to the partner through visual display or speech output. This delay can affect overall communication speed, as well as the utterance’s sequential relevance and social uptake in conversation.

Project Enchrony grew out of CADL’s long-standing concern with the temporal-sequential organization of AAC-mediated interaction. The project focuses on how participants respond to temporally delayed talk, including the adjustments they make, the problems that result, and the ways AAC technologies reshape the timing of ordinary conversation.

Through collaboration with Project OPEN and prior CADL research, we developed a large video database of social interactions involving individuals with ALS, cerebral palsy, and other disabilities as they interact with others using their communication devices.

The central problem addressed by Project Enchrony is that AAC-mediated conversation is not delayed only in a mechanical or rate-based sense. Composition delay alters the temporal relationship between actions. A contribution that was sequentially relevant when the augmented speaker began composing may become misaligned, redundant, or difficult to interpret by the time it is produced. This makes enchrony a key analytic frame for understanding why otherwise appropriate and intelligible AAC utterances can become interactionally problematic.

Building the Data Infrastructure

Years 1 and 2 were devoted to developing a transcription protocol, establishing a pool of student transcribers, and training them to transcribe embodied and technologically mediated interaction between augmented speakers and their conversational partners. This training focused on treating participants’ bodies as interactional resources while also documenting how AAC devices shape the organization of conversation. Most participants were recruited through Project OPEN.

We collected video data by traveling to the homes of augmented speakers and their partners, including parents, spouses, children, and friends. Participants were recorded while engaging in a series of conversation tasks, such as video-sequence sorting, map reading, and shared-experience conversations. To date, most of our analytic work has focused on the shared-experience conversations.

Multimodal Transcription and ELAN

Our transcription process uses the EUDICO Linguistic Annotator, or ELAN, to produce detailed embodied transcripts of AAC-mediated interactions. ELAN allows us to create multimodal transcripts that capture speech, gaze, gesture, device activity, and speech output as they unfold between the augmented speaker and their conversational partner.

This transcription approach was essential because the phenomena of interest were not located in speech alone. Composition time, concurrent talk, repair, gaze shifts, device selections, and speech output all had to be examined as temporally coordinated parts of the same interactional ecology. The ELAN-based workflow therefore, became a methodological foundation for studying the relationship between device-mediated message production and the unfolding organization of talk-in-interaction.

Identifying the Problem of Composition Delay

During the second year, we began to identify recurring problems experienced by augmented speakers and their partners during interaction. One important case involved Ann and Bill discussing their honeymoon. During the conversation, Ann and Bill overlapped each other while talking, resulting in a complete misunderstanding by Bill. The case was especially striking because Ann typed at approximately 25 words per minute, which is extremely fast for someone using AAC. We did not expect to find this level of interactional trouble at such a comparatively fast AAC rate.

Note the overlap between two participants’ responses.

This case spurred a larger investigation of composition delay involving eight individuals with ALS and their partners. We used both quantitative descriptive methods to characterize aggregate patterns and detailed multimodal microanalytic techniques to examine how these problems emerged in interaction.

Grounded Unit Analysis

A major methodological contribution of Project Enchrony was the development of grounded unit analysis. During early work on composition delay, it became clear that we needed a way to segment conversation into interactional units that were comparable across participants, access methods, and conversations.

A grounded unit is organized around the augmented speaker’s composition activity. It includes the contribution before the augmented speaker’s turn that makes their utterance relevant, the augmented speaker’s whole composition process, actions by either participant during composition, the eventual AAC output, and the partner’s response. This unit allows us to analyze AAC-mediated interaction as a temporally organized sequence rather than as isolated device output.

We have applied grounded unit coding to approximately 20 conversations and plan to use this code in current and future studies. This approach is especially well-suited to SGD users who compose and issue utterances “in-the-whole,” but it can also be extended to more diverse utterance-production strategies, including communication-board interaction and mixed spoken/spelled production.

We have been using the grounded unit concept for our work on composition delay and an ongoing project in Project Open focusing on the impact of concurrent talk (i.e., partner talk that occurs during the augmented speaker’s ongoing compositions).

Analysis of Composition Delay

In collaboration with Project OPEN, our work on composition delay focused on understanding the talk-in-interaction dynamics that occur during augmentative message composition and the problems in intersubjectivity that arise during this process. This research also helped establish an empirical foundation for future AAC talk-in-interaction analysis by clarifying key concepts, terminology, and analytic techniques.

We studied the interactions of eight individuals with ALS and their partners while they discussed shared experiences. This analysis identified 91 grounded units for study. The participants varied substantially in access method, diagnosis, composition rate, and duration of composition activity.

We identified two groups of fast and slow communicators based on access method and diagnosis. Two-handed typists composed approximately four to five times faster than participants using switch access, eye tracking, head tracking, or other slower access methods. However, even relatively fast AAC composition frequently extended beyond the ordinary enchronic window of conversation.

Across the 91 grounded units, 78% contained talk that was concurrent with device users’ AAC composition. Of those grounded units with concurrent talk, 32% were problematic. In contrast, only 15% of grounded units without concurrent talk were identified as problematic. The problems identified included difficulties of hearing or reading, sequential misalignment, minimal or absent responses, redundant output, abandonment, and meta-talk.

These findings suggest that partner talk during AAC composition is not inherently problematic, but it increases the likelihood of temporal-sequential trouble. When partners talk while augmented speakers are composing, the interaction may move forward before the AAC utterance is produced, changing the sequential environment into which the utterance eventually arrives.

We presented this work at the Atypical Interaction Conference and the American Speech-Language-Hearing Association convention. Here is a link to the interactive poster. We are also finalizing a manuscript for submission to Augmentative and Alternative Communication. The paper is intended to introduce the enchronic interaction perspective to the AAC research community and to provide an empirical study of composition delay.

Repair in Interaction

Repair is a central issue in AAC-mediated interaction because composition delay, technology constraints, and the social organization of conversation all affect how communication problems are recognized and resolved. We undertook a series of investigations to examine the prevalence, temporal-sequential dynamics, social antecedents, and consequences of repair in augmented interaction. Major findings from our work on other-initiated and self-initiated repair were presented at ASHA in November 2023.

Repair is especially important from an enchronic perspective because repair practices often depend on immediate access to language to support the precise timing and placement of utterances. AAC users may be more vulnerable to misunderstanding because of composition delay while also having fewer timely resources for correcting those misunderstandings once they occur.

Other-Initiated Repair

We published our first study on other-initiated repair in AAC in 2023. Using both simulated and human performance data, we found that initiating repair within the enchronic temporal order was largely unavailable to most augmented speakers, except through single selections produced at 11 words per minute or faster, or short phrases and utterances produced at more than 55 words per minute. A major AAC device manufacturer has already applied these findings.

This research demonstrated a fundamental asymmetry in AAC-mediated conversation. While oral speakers can generally initiate repair quickly enough to remain within the unfolding temporal order of conversation, many augmented speakers cannot do so through ordinary device composition methods.

The mechanisms of other-initiated repair in AAC-mediated conversation were further explored in the dissertation research of Antara Satchidanand. Using a mixed-methods design, she adapted a coding scheme designed by Dingemanse et al. (2016) for the cross-linguistic analysis of other-initiated repair and applied it to an expanded version of our composition-delay dataset. .

In a preliminary analysis of ten speakers with ALS, Satchidanand found that repair in SGD-mediated conversation differs from repair in typical spoken interaction in several important ways. Repair initiation in augmented conversation was shown to be heavily asymmetric, with augmented speakers contributing only a negligible proportion of repair initiations.

The majority of repair initiations were performed by the oral speaking partner.

Open-class repair initiations, which request repair without identifying the problematic portion of the trouble-source utterance or specifying the nature of the problem, occurred less frequently than in typical conversation.

Oral speakers initiated repair differently when in conversation with augmented speakers vs other oral speakers.

Interestingly, a key finding concerned the behavior of the oral-speakers in our sample, who initiated repair differently when in conversation with augmented speakers. Candidate understandings, a type of repair that shifts the effort/costs of repair from the trouble source speaker to the repair initiator, were used by the oral-speakers in our sample more frequently than has been observed in conversations between two oral-speakers.

Finally, non-minimal repair sequences, which require more than one attempt to resolve, occurred much more frequently in augmented conversation. Many extended repair sequences involved multiple repair initiations on the same trouble-source utterance, first to clarify accurate perception of the utterance and then to address issues of comprehensibility.

Self-Initiated Repair

Project Enchrony also examined self-initiated repair in AAC-mediated interaction. First, a simulation study demonstrated the keystroke savings possible when a word-delete option was available in executing self-repair. APP1 below provided researchers with such an option while APP2 included only character deletion and screen clear options.

Selection costs of SISTSR: Deletion Types per App
Note: This figure is copyrighted by the authors (Satchidanand, Rayman, and Higginbotham, 2022) and is used with permission.

This study also revealed the keystrokes added to utterance production when self-repair is included with and without the word-deletion option.

The availability of a word- and screen-clear options button substantially improved the efficiency of message selection during self-repair.

Building from this work, in a graduate student research project, Cassandra Vecchio examined self-repair in situ by analyzing task-related interactions from 10 dyads, represented in 30 videos, in which one participant had late-stage ALS. Her preliminary findings, presented at ASHA, showed that self-initiated repair occurred in approximately one-third of augmented speakers’ utterances and accounted for 26% of their keystrokes.

Vecchio’s analysis of deletion types supported these findings: each clear-screen selection deleted an average of 3.5 characters, word deletion removed 1.5 characters per selection, and deleting a single character required an average of 2.3 character-deletion actions.

These findings indicate that current AAC devices are not well configured to support the efficient self-repair and editing practices required during AAC-mediated conversation.

Gabrielle Martino conducted a subsequent analysis of the efficiency of deletion types used by participants in Vecchio’s study, which revealed that while three options for the deletion of message content were available to users, they often did not choose the most efficient option, choosing character-by-character deletion instead.

A manuscript that brings together the results of the simulation and in situ study of self-repair is currently being prepared for submission by the end of June.

Project Enchrony developed from a conceptual concern with conversational timing into a systematic research program on the temporal organization of AAC-mediated interaction. Across the Project Converse funding period, the project produced a multimodal transcription infrastructure, a large video database, grounded unit analysis, empirical studies of composition delay, investigations of other-initiated and self-initiated repair, conference presentations, manuscripts, and design principles for future AAC technologies.

The central finding from Enchrony is that many AAC interaction problems are problems of temporal-sequential organization, not simply problems of vocabulary, communication rate, or message output. Face-to-face conversation unfolds within a narrow enchronic window, where actions are expected to be responsive, sequentially fitted, and close enough in time to remain intelligible as responses. AAC technologies often make it difficult for augmented speakers to act within that window. As a result, utterances that were appropriate when composition began may be heard as late, disconnected, or difficult to place by the time they are finally produced.

Composition delay is therefore not simply slow message production. It reorganizes the interactional ecology. While an augmented speaker is composing, they may be working intensely on their next contribution, but much of that work is invisible to the partner. These periods of invisible progressivity place substantial demands on the partner’s vigilance, who must wait, remain engaged with little evidence of substantive progress, maintain memory of the prior conversational context, and be ready to respond when the postponed utterance is eventually spoken. During this time, we found that many partners continue talking, shift topics, pursue other lines of action, or treat the delay itself as meaningful.

Enchrony’s repair findings further show that many AAC users face a double burden. Composition delay increases vulnerability to misunderstanding by obscuring the timing, target, and relevance of an utterance; at the same time, many users lack reliable access to timely repair initiation and efficient self-repair once trouble occurs. Other-initiated and self-initiated repair studies showed that AAC-mediated repair differs substantially from ordinary spoken repair and often requires more time, effort, and partner coordination.

Together, these findings have direct implications for AAC research, design, and clinical practice. AAC systems should not be evaluated only by whether they produce accurate messages, provide sufficient vocabulary, or increase communication rate. They must also be evaluated by whether they support temporal participation: the ability to respond, repair, interrupt, affiliate, resist, clarify, and coordinate with others as interaction unfolds moment by moment.

The design challenge identified by Enchrony is therefore clear. Future AAC technologies must help augmented speakers use their devices to interact in-time with their partners. This means supporting not only message generation, but also the timing, visibility, repairability, and temporal-sequential placement of contributions within the ongoing organization of face-to-face conversation.

Formerly: Expressive Speech Synthesis
Co-Directors: Higginbotham, Szekely & Possemato
Associate Researcher: Horowitz
Start Date: July 2022

Project Intone was developed to investigate how expressive synthetic speech can better support AAC-mediated conversation. The project emerged from the recognition that conversational participation depends not only on what is said, but also on how it is said: timing, intonation, loudness, emphasis, affect, and stance all shape the social meaning of an utterance.

Our Move research demonstrated that reducing delays in conversational turn-taking allows spoken intonation to become more functionally integrated into social interaction. As conversational timing becomes more natural, intonation can support important interactional functions such as emphasis, referencing, responsiveness, and conversational coordination.