Controlling the Behavior of Animated Presentation Agents in the Interface: Scripting versus Instructing
Andre, Elisabeth, Rist, Thomas, AI Magazine
Lifelike characters, or animated agents, provide a promising option for interface development because they allow us to draw on communication and interaction styles with which humans are already familiar. In this contribution, we revisit some of our past and ongoing projects to motivate an evolution of character-based presentation Systems. This evolution starts from systems in which a character presents information content in the style of a TV presenter. It moves on with the introduction of presentation teams that convey information to the user by performing role plays. To explore new forms of active user involvement during a presentation, the next step can lead to systems that convey information in the style of interactive performances. From a technical point of view, this evaluation is mirrored in different approaches to determine the behavior of the employed characters. By means of concrete applications, we argue that a central planning component for automated agent scripting is not always a good choice, especially not in the case of interactive performances where the user might take on an active role as well.
A growing number of research projects in academia and industry have started to develop lifelike characters or agents as a metaphor for highly personalized humanmachine communication. Work in this area is motivated by a number of supporting arguments, including the fact that such characters allow for communication styles common in human-human dialogue and thus can release users from the burden to learn and familiarize themselves with less native interaction techniques. Furthermore, well-designed characters show great potential for making interfacing with a computer system more enjoyable. One aspect when designing a character is to find a suitable visual and audible appearance. In fact, there is now a broad spectrum of characters that rely on either cartoon drawings, recorded (and possibly modified) video images of persons, or geometric three-dimensional (31) body models for their visual realization with recorded voices or synthesized speech and sound to determine their audible appearance.