Finely integrated media for language learning Henry Hamburger Department of Computer Science, George Mason University, Fairfax, Virginia 22030-4444, USA; and Naval Research Laboratory Abstract FLUENT, an immersive foreign-language learning environment, was developed without recourse to hypermedia techniques. Nevertheless, if one accepts the premisses, proposed in this paper, on which the idea of hypermedia has been constructed, FLUENT shows a strong relationship with it. The paper discusses this relationship after attempting to distil the essence of educational hypermedia, and after presenting a taxonomy of media for language learning. Overview The hypermedia technologies seem to promise an explosion of opportunity for computer- co-ordinated educational environments. Some emerging possibilities were in evidence at the fourth biennial Exeter Conference on Computer-Assisted Language Learning (September, 1991), featuring the use of hypermedia as central theme. In this exciting but sometimes confusing environment, it will help to impose some sort of conceptual framework. To that end, I will first attempt to distil the essence of the field of educational hypermedia, and will then present a simple taxonomy of individual media for language learning. These two analytical lenses permit a clear view, in the latter half of the paper, of FLUENT, an immersive foreign-language learning environment developed by my research group. FLUENT grew up outside the hypermedia, but the two bear a strong kinship. After describing our system, I will examine the media challenges it faces and show how it embodies the spirit of hypermedia. Hypermedia Conceding in advance that opinions vary as to the nature and best use of hypermedia, here is an attempt to synthesize in one sentence some widely held views on the matter: 60 ALT-J VOLUME 1 NUMBER 1 The essence of hypermedia for learning is the integrated use of multiple media, with associated activities in the various media that promote implicit learning and student initiative. Close reading reveals four assertions in the foregoing, which I now take up briefly. First, meaningfully integrated use should at least be possible, for otherwise the only benefit of making the media all available at once would be a slight improvement in convenience. For the whole system to exceed the sum of its media, there must be some kind of potential synergy, even if the responsibility of finding and exercising it falls to the student. Next, technology-based activities for foreign-language learning range from computerization of traditional syntactic exercises to the literarily sophisticated, interactive videodisc story worlds of the Athena Language Learning Project (Murray, 1987). A middle ground is the hypertext translation environment of the TIGER project (Deepwell, 1991). Third, implicit or indirect learning means getting away from explicit statements about the way things are and letting students pick up an understanding of the target subject matter in the context of a meaningful activity. The importance of this kind of 'situated' learning has been argued for education in general (Brown et al, 1989). In foreign language circles, the 'communicative' approach stresses the role of a meaningful situation, enabling the learner to acquire the regularities that grammars codify, without explicit mention of grammatical rules. The last feature is student initiative. Allowing the student to get involved in deciding what to do may have the cognitive benefits of both active learning and the development of study skills, not to mention the personal benefit of self-expression. A cautionary note is in order, though: while the efficacy of active learning is well-established, it will not do for designers to abandon their responsibility to provide adequate guidance, leaving the student afflicted with 'cognitive agoraphobia' (Twidale, 1991) or bogged down in a corner of the curriculum. Media for language learning Turning from hypermedia to media, we present a simple taxonomy, summarized in Figure 1 opposite. Note that there is no reference to particular pieces of equipment, nor to modalities like music, for which there is at most secondary use in the foreign-language context. Our approach to foreign language immersion consists of conversations that take place in a supporting visual context that helps the student to understand remarks that would otherwise be slightly too difficult. In addition, the student can use interactive graphics for replies, while gaining exposure to new words and grammatical patterns. Briefly put, this means two-modality, two-directional communication. The figure shows the two modes and the two directions, and indicates further subdivisions pertinent to each. One modality is language, of course, and the other is the spatial world that most of us (sighted people) receive by vision and which a computer may present in two dimensions by means of graphics algorithms. The two directions of language performance are comprehension and production, analogous to the familiar computerese terms input and output. The usefulness of this taxonomy will become clear in the final section of this paper, where it becomes a means to express design choices for our FLUENT immersion system, to which I now turn. 61 Henry Hamburger Finely integrated media for language learning Language y Textual Production / — Recorded Modality - Speech Direction • — Generated Still Spatial ^^^^^ Moving Comprehension ^ ^ Precise Rough Figure 1: Summary of a simple media taxonomy The FLUENT immersion project The idea of immersion is to help the student understand and use the new language directly and automatically, not mentally to translate or compute grammatical formulae. To this end, we get the computer to provide some of the immersion activities that good language tutors often use, employing the student's own language only to set the stage, and leaving the explicit teaching of grammar to others. (On the reasonableness of this approach, see Richards and Rodgers, 1986. Also, note that it can be used with other systems - and teachers and books - that complement it.) Immersion must be gradual, with exposure and comprehension preceding generation, and with new words and patterns introduced one at a time in situations where they can be figured out from the visual activity and the continuity of the conversation. Let us see how these ideas play out in one of the microworlds that we have implemented (Hamburger and Maney, 1989; Hamburger and Hashim, 1991). Upon entering Washroom World, the student sees on the screen a drawing of a bathroom, suitably equipped. There is also a person with a movable hand that the student can manipulate with a mouse to pick up 62 ALT-J VOLUME 1 NUMBER 1 the soap or toothbrush, turn on the water, and so on. In one activity, the student's job is to make the figure go through the moves of washing its own face, while the tutor makes relevant comments and suggestions, all in Spanish. This activity promotes recognition of language, but other exercises check the student's language production ability, and leave it to the tutor (system) to make screen moves. In either case, the tutor is directly responsive to whatever the student has just done or said. The responses can vary somewhat, according to different levels of general achievement the student may have attained. The most distinctive characteristic of FLUENT, is the very tight integration between the two modes (language and spatial) and between student and system. Another important aspect is that the screen actions influence and reflect the workings of an internal model, representing the current situation and the properties of objects. Having such a model enables the tutor to describe an action or its resulting state or even to critique an action in terms of a common-sense goal structure. Thus if the student tells the figure to wipe his face, the response may be: "But I don't have the towel." The current implementation does these things with limited flexibility. The next version, now under construction, is making more extensive use of the techniques of artificial intelligence, including reasoning and especially natural-language processing. It will also draw upon techniques of intelligent tutoring systems to guide the pedagogical decisions. Bringing it together I can now fulfil the promise to relate FLUENT to individual media and hypermedia, perhaps shedding some light on them all. First, referring to the media taxonomy of Figure 1, I will mention a few design issues for FLUENT, touching on each modality and direction. Speech would be ideal for FLUENT, but on the input side, recognizers are expensive and cannot readily deal with students' poor pronunciation. On the output side, inexpensive generators have a robotic sound that one hardly wants a student to mimic, and canned speech prohibits generativity. We plan to continue with text but will experiment with a compromise of cutting and pasting recorded speech, using multiple prosodic patterns for each recorded word or phrase. For spatial output, we have opted for generated images rather than recorded (photographed) ones, sacrificing visual fidelity in favour of close interaction between the student and the electronic tutor. Showing movement involves another compromise, one between still and moving graphics, a technique we call partial animation. This approach is exemplified by the screen figure whose hand is manipulable but who is otherwise stationary, and a scene where some objects are movable. Reaction from test users, even including a five-year-old, suggests that this approach can be fully engaging. As for language structure, however, compromise is far less in order. Since the system's very purpose is to convey, albeit implicitly, the structure and vocabulary of a foreign language, it would be hobbled from the start if it did not handle language accurately and broadly. This entails using detailed, formalized language knowledge in a generative fashion. Therefore, although the current system gets by with simple pattern-based analysis of language input, and filling sentence templates with simple generated noun phrases on the output side, the new version is importing a very strong natural-language processing component. 63 Henry Hamburger Finely integrated media for language learning In sum, although we do not bill FLUENT as a hypermedia system, it turns out to have all four characteristics proposed above as the essence of educational hypermedia. Certainly it thoroughly integrates the linguistic and the spatial media in a way that is essential to its goal. The second feature, the use of multiple and integrated activities, is also present. FLUENT provides activities for initial exposure to a microworld and its language aspects, as well as a variety of activities designed for comprehension, production, and assessment. Third, student initiative too is clearly in evidence. For example, in some activities the student controls the direction of conversation by making things happen with the mouse, while in others the system carries out student wishes expressed in language. Finally, the whole point of immersion is implicit learning. Abstaining from explicit statement of grammatical patterns, FLUENT instead provides a visual and linguistic environment in which those patterns should come to seem familiar and natural. References Brown, J.S., Collins, A. and Duguid P. (1989), 'Situated cognition and the culture of learning', Education Researcher, 18, 1, 32-42. Deepwell, F.H. (1991), 'The Tiger project, Coventry Polytechnic', presented at the Conference on CALL and the Hypermedia, Exeter University, September 18-20, 1991. Hamburger, H. and Hashim, R. (1991), 'A foreign language tutoring and learning environment' in Swartz, M. and Yazdani, M. (eds), The Bridge to International Communication: Intelligent Tutoring Systems for Second Language Learning, Berlin, Springer-Verlag. Hamburger, H. and Maney T. (1991), 'Twofold continuity in immersive language learning', Computer-Assisted Language Learning, 4, 2, 81-92. Murray, J. (1987), 'Humanists in an institute of technology: how foreign languages are reshaping workstation computing at MIT', Academic Computing, September 1987. Richards, J.C. and Rodgers, T.S. (1986), Approaches and Methods in Language Teaching: A Description and Analysis, Cambridge, Cambridge Language Teaching Library. Twidale, M. (1991), 'Cognitive agoraphobia and dilettantism: issues for reactive learning environments' in Birnbaum, L. (ed), Proceedings of the 1991 International Conference on the Learning Sciences, Charlottesville, VA, Association for the Advancement of Computing in Education. 64