Abstract

This book may rightly be called a milestone publication for its ambitious aspirations alone. It seeks to fuse two of the major research paradigms in late 20th and early 21st century social sciences and linguistics, namely the cognitive and the multimodal. Cognitive linguists have argued that metaphor is a matter of thought and conceptual structure, which are only made manifest through language and other semiotic forms of expression. Multimodality research is a late but natural response to the well-known fact that discourse and communication are hardly ever realized in language alone but more often than not through a combination of signing modalities in various media. The natural connection between these two hitherto separate strands of research is spelt out in the book. Cognitive linguistics uses multimodal texts to study the diverse manifestations of conceptual metaphors in pictures, sound and gestures and to see how metaphors are constructed combining these modes, i.e. multimodally. Multimodality theory looks at the crucial function of metaphors for linking various modes and for meaning making in text and discourse. The attempt at symbiosis is a strategic move designed to be conducive to both domains of enquiry. Yet it is probably fair to say that the key questions and methodologies mainly reflect the concerns of conceptual metaphor theory even though what the individual media case studies reveal is sure to also be profitable for multimodality research at large.
The very idea of multimodal metaphor forms the theoretical cornerstone of the book. In contrast to verbal (and other mono-modal) metaphor, it is defined as those metaphors ‘whose target and source are each represented (also: rendered, p. 4) exclusively or predominantly in different modes’ (p. 24, cf. also p. 4). So for a conceptual metaphor like
Although every chapter would merit individual scrutiny and comments, this review must for reasons of space and for the sake of generalization proceed differently. I have, therefore, decided to single out the most central concerns running through the various contributions in order to explain and critically assess them. What I hope to be able to point out in this fashion are the major strands of thought in the cognitive and multimodal paradigms, their recurring concerns and the problems, blind spots and grey areas this kind of research confronts.
The genre- and/or mode-oriented enquiries into metaphor in chapters 3 to 18 are introduced by two rather general contributions by the editors. Urios-Aparisi and Forceville present a very fine-grained overview of the research topics and questions in the book outlining the connections, synergies and ‘thematic patterns’ (p. 11). Forceville gives a systematic and also partly historical outline of some of the fundamental notions underlying multimodal metaphor research. Apart from providing definitions, the text reflects on the nature of similarities that can underlie non-verbal metaphor and on the essential differences between conventional and idiosyncratic metaphor. However, perhaps symptomatically, it has comparatively little to say about the influence of genre on metaphor production and interpretation – simply because this is an area hardly studied systematically as yet.
In keeping with the tradition of conceptual metaphor theory, the most pervasive element in the majority of chapters is a concern with the kinds of metaphors contained in the texts studied. So the authors invariably set out to identify the targets and sources that are realized in the signing modes, genres and media at hand. We learn among other metaphorical preferences that
A more specific and markedly multimodal enquiry of the book is directed at how sources and targets are distributed across the various modes. We learn that the concreteness and sensory directness of visuals and sound/music make them suited to realizing source domains whereas language preferentially expresses target domains (Koller and Forceville, ch. 16). Some targets, however, like the taste and smell of wine, tend towards the mono-modal, verbal metaphor and use little imagery (Caballero). Not infrequently, source domains cut across language and visuals (Shinohara and Matsunaka) or sources and targets distribute between the modes (Urios-Aparisi, Müller and Cienki), and generally mono- and multimodal metaphors seem to cooperate (Eggertsson and Forceville). As a preliminary conclusion a model of ‘dual encoding’ (Koller, pp. 45, 62) seems plausible, which suggests that the cognitive and communicative effects are optimized if metaphors build up using various modes. Considering that mode inter-linking in multimodality research is generally seen as following the principle of ‘complementarity’ or ‘transcriptivity’, which means that modes each contribute according to their potentials and deficits, too structuralist an approach to multi-modal metaphor must be detrimental. After all, despite varying patterns in the text it is the recipient who ‘zips’ cues from the modes together to arrive at a plausible reading of the metaphorical meanings.
The question I find most fascinating and promising for future multimodality research is how the specificity of the mode constrains and facilitates metaphor construal. Here the book offers a few preliminary insights. So Eerden indicates that it is the sensory and media properties of a mode that determine the way they can contribute to metaphorical meaning, e.g. animated film offers dynamic semiotic resources of the moving image, which do not have an equivalent in language. Forceville highlights that where language has deictic expressions to explicitly link modes, time-based media must rely on ‘well-timed simultaneity’ (p. 395). Zbikowski, finally, pays attention to ‘sonic analogues’ in music (p. 376) like e.g. ascending/rising pitch, which may cue gestures or emotional states. It is these timid beginnings on the way to understanding the semiotic and textual properties of modes, which deserve most attention and future dedication, as they mark an ideal halfway point between the disciplines of the cognitive and the multimodal.
Besides concerns with individual genres and modes, some contributions also focus on the more general question of how metaphorical expressions in various modes are perceived and understood. Starting out from the basic assumption that ‘the comprehension of verbal, visual and multimodal metaphors involves similar mental procedures’ (Yus, p. 147) the relevant chapters (Yu, Yus, El Refaie, Schilperoord and Maes, Zbikowski) set great store by pointing out that metaphor understanding must be more than just the static mapping of conceptual elements. Rather it involves flexible ad-hoc interpretation and a good sense of an optimally relevant reading conditioned by context and situation. Not surprisingly, those authors favour Fauconnier’s and Turner’s more dynamic conceptual blending over mapping as it allows for exactly the kind of flexibility and creativity needed in the modelling of textual and multimodal understanding. Multimodality research can afford to remain blind to the internal cognitive debates about competing models, but it will of necessity have a vested interest in a rich textual interpretation.
It is a great merit that the book adheres to the sharp delineation between metaphor and metonymy as practised in cognitive linguistics (cf. Kövecses, 2002: 143 ff.) and also addresses the specific role metonymy plays in multimodal meaning making. Urios-Aparisi shows various interactions between metonymy and metaphor in TV-commercials (cf. also Forceville, ch. 16), which suggest that metonymy’s main functions are to highlight the target of a metaphor, prepare a metaphor’s conceptual ground by providing rich knowledge about source or target and signpost those elements crucial to the mapping. For gestures, Mittelberg and Waugh argue that ‘these two figures of thought jointly structure multimodal representations of grammatical concepts’ (p. 329). Here, metonymy ‘leads the way into metaphor’ (p. 347) in that first, the gesturing hand is a metonym for an imaginary object, and secondly, this object drawn into the air metaphorically stands for an abstract idea, e.g. a sentence or morphemes. With a view to the future of a cognitively grounded multimodality research, I should like to underline the importance of looking beyond metaphor at the vast repertoire of rhetorical figures. All of them are but manifestations of conceptual structures and operations, which build multimodal texts. Here is a vast field opening up for multimodal rhetoric and cognitive semiotics.
A final issue would seem to be high on the agenda of conceptual metaphor theorists. Yet, but for the exception of Shinohara and Matsunaka, it does not surface in the book. This concerns the question of whether (and how much of) metaphor is culture-specific or rather universal. Because it is early days in cognitive multimodality research this omission can easily be excused. Yet it should be obvious that cultures may prioritize and handle modes differently in multimodal artefacts. It is also known that while embodied metaphors indeed seem universal, various cultures produce differing kinds of experiential knowledge that might motivate different metaphors in other semiotic manifestations. Again this could be a promising focal point for future research.
What I hope to have shown is that the book is rich in strategic thought and analytical detail that fruitfully engage both with multimodality theory and cognitive semiotics. The volume manages to live up to its high aspirations, yet beyond tentatively mapping out a new field it also reveals some of its inherent difficulties. The first concerns the need for transcribing multimodal texts in order to fix their temporally fleeting or spatially complex structures (cf. Schneider and Stöckl, 2011: 28 ff.). If one wants to show how modes interrelate and which roles metaphor/metonymy play in this, multimodal data is needed in a systematic form. A second difficulty is also a methodological one. The book demonstrates that metaphorical meaning can be notoriously complex and hard to model. Rather than just rely on textual structures it would be worth complementing the analysis by media–ethnographic empirical methods, like eye tracking, interviews and questionnaires in order to be able to describe actual recipient interpretations (cf. El Refaie). Finally, conceptual metaphor theory ought to be prepared to acknowledge its limitations. Churning out conceptual metaphors for a sheer infinite variety of metaphorical expressions may be helpful to create order and system. Yet it also disguises the underlying problem: the mentalese A = B shorthand is verbal and reflects the linguistic habits of a culture and the bias of the analyst. Rather than worry too much about concepts in the mind and preconceived static mappings, it would be sound to look closely at the processes of ‘interactive property attribution’ (Glucksberg, 2001; cf. Stöckl, 2010: 190 ff.) in multimodal texts. Also the concrete forms and types of metaphorical expressions and their effects on the meaning potential created deserve more attention – both in language and in other non-verbal semiotic modes.
