After International Relations Critical R...
After International Relations Critical R...
Critical realism is one of the most influential new developments in the philosophy of
science and in the social sciences, providing a powerful alternative to positivism and
post-modernism. This series will explore the critical realist position in philosophy
and across the social sciences.
Critical Realism
Essential readings
Edited by Margaret Archer, Roy Bhaskar, Andrew Collier, Tony Lawson and Alan Norrie
The Possibility of Naturalism, 3rd edition
A Philosophical Critique of the Contemporary Human Sciences
Roy Bhaskar
Heikki Patomäki
List of illustrations ix
Acknowledgements xi
Introduction 1
PART I
Overcoming international relations 19
PART II
Explicating a critical realist methodology 97
PART III
Visions of world politics 165
Epilogue 237
Bibliography 239
Index 257
Illustrations
Tables
Figures
ix
Acknowledgements
During the good ten years that this book has been in the making, I have
accumulated debt to many persons. Any reader will soon notice the central role
of Hayward [Link]. He has not only inspired me, but also commented on
most parts of this book, on various occasions. Special thanks are also due to
Tuomas Forsberg and Colin Wight, with whom I have worked closely together,
and who are doing critical realism in their own ways. Collaboration with them
has sparked many stimulating discussions and debates. Also the following
colleagues have provided useful comments and irreplaceable support (I apologise
for any possible mistaken omissions): Jason Abbott, Osmo Apunen, Chris Brown,
Walter Carlsnaes, Stephen Chan, Mick Dillon, Chris Farrands, [Link],
Stefano Guzzini, Henrikki Heikka, Mark Hoffman, Jef Huysmans, Pertti
Joenniemi, Eerik Lagerspetz, Robert Latham, Richard Little, Iver [Link],
Hannu Nurmi, Lloyd Pettiford, Juha Räikkä, Magnus Ryner, Hidemi Suganami,
Teivo Teivainen, Teija Tiilikainen, Ole Wæver, Rob Walker (who has also
instigated the title of this book), Keith Webb, Alexander Wendt, Matti Wiberg
and Andy Williams. In some cases, the most valuable comments have come
from the persons most critical of my project!
Chapter 2 was previously published as ‘Hayward Alker: An exemplary voyage
from quantitative peace research to humanistic, late-modern globalism’, in I.
Neumann and O.Wæver (eds) (1997) The Future of International Relations. Masters
in the Making?, London: Routledge, pp. 205–35. Chapter 3 was previously
published as ‘How to tell better stories about world politics’, European Journal of
International Relations 2(1):105–33, reprinted by permission of Sage Publications
Ltd (© Sage Publications Ltd and the European Consortium for Political
Research (ECPR) Standing Group on International Relations 1996). Chapter 4
was published previously as ‘Concepts of “action”, “structure” and “power”’ in
“critical social realism”: A positive and reconstructive critique’, in the Journal for
the Theory of Social Behaviour (1991) 21(2):221–250, reprinted with permission.
Chapter 6 was published previously as ‘Scientific realism, human emancipation
and non-violent political action’, Gandhi Marg (1992) 14(1):30–51, reprinted
with permission. Chapter 7 was first published in Finnish as ‘Mitä Thukydideen
Melos-dialogi voi kertoa?’ [What Can Thucydides’ Melos-Dialogue Reveal?],
Rauhantutkimus (1990), 20(4):26–63; used here with permission. Finally, Chapter
xi
xii Acknowledgements
What are international relations? This book argues that all social relations are
changeable. They are dependent on transient concepts, practices and processes.
What has once emerged can change and even become absent. So it is also with
international relations.
There is no closed list of transformative possibilities. After International Relations
claims that critical realist (CR) social sciences can help determine some of them.
Social sciences should proceed in three successive stages. First, in order to explain,
researchers have to engage with social meanings and assume a dialogical relationship
with relevant actors or traces and evidence of history, including texts and quantified
observations. Only with dialogically understood data can social episodes, practices,
relations and processes be studied adequately and in sufficient detail.
Mere understanding is not enough. Causally explanatory models can and
should rely also on holistic metaphors, historical and other analogies, and various
social scientific theories. The best available explanatory model is often different
from prevalent understandings. Consequently, in the second stage, explanatory
models may not only turn out to be critical of the understandings of the lay
actors but also take steps towards explaining them.
The critical moment evokes a need for social scientific re-signification of
practices, including edification by means of better causal stories and proposals
for concrete alternatives. If a change is argued for, as it often is, realist strategies
for transformation have to be spelled out as well. Constant openness to critique
and further studies is a condition of validity. In the third, and practical moment,
CR explanations should translate into practical action—often transformative.
Also, practical-political action should be a learning process, in turn enriching
theoretical discourses.
CR has emerged in a particular world historical context. As some
constructivists and post-structuralists argue, there are signs of far-reaching
transformations of international relations into something more akin to world
politics (cf. Ruggie 1998:174–75, 195–97; and, more explicitly, Walker 1993:183).
After International Relations makes a case for CR social sciences which are not
content with describing transformations but try to understand, explain and
critically assess contemporary practices, relations and processes, and, thereby,
contribute to making them more emancipatory and empowering.
1
2 Introduction
International relations
correspondence between any particular theoretical discourse and the social worlds
they refer to and purport to understand, explain and, sometimes, change.
Nonetheless, since actions and practices are dependent on and informed by
concepts, theoretical innovations are potentially efficacious in changing realities.
Practices can be constituted or guided by misleading or partially false or
contradictory understandings. More adequate concepts—if given a chance—should
therefore enable and also encourage changes of practices, institutions and deep-
seated social relations. It is certainly true that achieving well-founded conceptual
changes require no less hard work than practical political activities.
Critical realism
Developments
Plato etc.: The Problems of Philosophy and Their Resolution (1994) is represented as an
attempt to popularise Dialectic, although it has a slightly different focus and somewhat
less speculative appearance. Like Dialectic, it has fascinating and very useful discussions
on causality, time, space and causality/freedom, taking the notions developed in
earlier works further. The emphasis is on a totalising critique of Western philosophy.
Not unlike Derrida’s criticism of Western logo-centrism, Bhaskar claims that, since
Plato, the ontological monovalence7—including the absence of notions such as
‘absence’, depth and change—have generated a philosophical problematic that, from
a consistent CR point of view, is nonsensical. Yet this philosophical problematic—
also through the structures of positivism, which are causally efficacious in many
modern practices—has had far-reaching consequences in the structuration of societies.
Bhaskar asserts that ‘the net project of Dialectic and Plato etc. is to attempt an anti-
Parmidean revolution, reversing 2500 years of philosophical thought, in which
negativity (absence and change), ontology, structure, diversity and agency come to
the fore’.8
Perhaps quite logically then, Plato etc. was followed by Bhaskar’s post-colonial
turn to the East and, as it seems, partial retreat to mystical thinking. From East
to West, published in 2000, ‘aspires to begin to construct a dialogue, bridge and
synthesis’ between traditions of both radical libertarian Western thought and
mystical Eastern thought. Many fellow critical realists have found the outcome
to be a philosophically and historically weak attempt to bring in popular New
Age ‘philosophies’ (e.g. Hostettler and Norrie 2000). Despite the importance of
the project, it also seems to contradict many CR theses, including the idea that
philosophy must be broadly consistent with the substantive contents and
methodologies of the sciences (see Patomäki 2001c, for a discussion of, and
comparison to, the somewhat similar attempts by Friedrich Nietzsche, Jacques
Derrida and Johan Galtung to appropriate notions from Buddhism and develop
dialogical notions of global philosophies).
While Bhaskar has been either adding new layers and aspects to CR, or
testing and exceeding its limits (depending on one’s view on his later works),
scientific realism has gained ground in analytical philosophy (e.g. Margolis 1986;
Searle 1996). CR itself has been cultivated among a group of social, political,
legal and economic theorists. These theorists include Margaret Archer (1988,
1995, 2000), Andrew Collier (1994, 1999), Tony Lawson (1997), Alan Norrie
(1991, 1993, 2000) and Andrew Sayer (1992, 1995, 2000). Besides introductions
to CR, and applications of CR to the problem fields of various disciplines,
many of these works have developed original positions on a number of central
philosophical and social theoretical issues. It is also significant that there are
parallels to CR developments in continental Europe, in particular in France. At
times, Derrida (cf. 1998 [1967]:50; 1988:114–54) and Paul Ricoeur (1981:274–
96; 1984:78; 1992:300), for instance, seem to have been approaching a position
close to CR. However, it is Pierre Bourdieu (e.g. 1977; 1998:1–13), in particular,
who has been working, since the 1970s, on a social theory very similar to CR.
This compatibility has been increasingly acknowledged by people working on
CR social theory (see, for example, May 2000).
8 Introduction
Philosophical theses
In this book, I identify CR with three philosophical theses and a related theory
of emancipation. The three philosophical theses concern (1) ontological realism,
(2) epistemological relativism and (3) judgemental rationalism. Ontological realism means
that the world is real and—except for a very small part—independent of the
researcher’s (my or our) knowledge of it. Bhaskar (1986:7) argues that ‘every
philosophy, inasmuch as it takes science for its topic, is essentially a realism, or
at least has realism for its principle, the pertinent questions being on how far and
in what form this principle is actually implemented’.
CR claims to have taken the principle of realism further than any Western
philosophical system thus far. The claim is that the world is not only real but it
must also be differentiated, structured, layered and possess causal powers, otherwise
our knowledge of it—or our being—would not be possible. Natural science is a
matter of the conceptual and practical work of scientists, who create artificial closures
in laboratories in order to understand and explain mechanisms, which can be
causally efficacious across different factual contexts. With few exceptions, there are
no closed systems in nature or society. Therefore, a realist conception of causality
does not equate causality with constant conjunctions but with structured powers
capable of producing particular, characteristic effects if triggered.
In this regard, natural and social sciences are similar. There are real causal
powers both in natural and social systems, whether actualised or not. CR
encourages research to focus on identifying and analysing the structures of beings
and mechanisms (or complexes), which are causally efficacious. Moreover, it is
also a condition of science that scientists and other social actors themselves possess
causal powers. A corollary is that, although not all reasons are causally efficacious,
and actors can themselves be mistaken about the effective reasons for their actions
and practices, some reasons must be efficacious in generating actions.
The idea that reality is layered, and that different, emergent layers (such as
biological, human and social) are ontologically irreducible and causally efficacious
implies a further ontological notion. Absence and emergence must also be real. Not
everything that is has always existed. Whatever exists can become absent—and
already is absent in relation to most fields and spaces most of the time anyway.
A parallel of the notions of absence and emergence is that change is not only
possible but also ubiquitous. This is one of the senses in which CR is a dialectical
system of thought and action.
CR has thus cultivated ontological realism. The second main feature of CR is
its epistemological relativism. Epistemological relativism connotes that all beliefs
and knowledge claims are socially produced, contextual and fallible. CR knowledge
is structured and layered. There are strongly universalising philosophical claims
such as those for ontological realism and epistemological relativism; there are
rather stable scientific theories; and there are reflective interpretations of social
meanings and episodes. However, even the deepest and most universal
philosophical theses must be, in principle, open to criticism and change. Openness
and transfer inability are essential to any research. The work of scientists is based
Introduction 9
on the pre-existing means of production, including both the available material facilities
and technologies as well as the antecedently established facts and theories, models
and paradigms. Models and paradigms must rely on analogies and metaphors.
Therefore, a rationally developing science uses both historically transient conceptual
resources and imagination in a systematic and controlled fashion. However, in the
social sciences, laboratory experiments and precise quantitative measurements
are impossible. Social sciences are conditioned by the fact that they have to gather
all their empirical evidence from open, historical systems. Moreover, social sciences
also involve attempts to understand lay meanings. Thus, social scientific models
and theories must be more reflectively interpretative undertakings than natural
sciences.
Epistemological relativism is consistent with the idea of rational judgements.
Judgemental rationalism means that, in spite of interpretative pluralism, it is possible
to build well-grounded models and make plausible judgements about their truth. CR
gives, in fact, the best available account of the grounds for scientific activities. It
explains the nature of the objects we study. It explains what we as researchers do and
why—and also why we should aim at amending and revising our theories and models.
It is easy to see, for instance, that ontological realism and epistemological relativism
are conditions for rational truth-judgements. On the one hand, without independent
reality, truth-claims would be meaningless. Truth-claims have to be about something. On
the other hand, with our theories already mirroring reality, no further research or
truth-judgements would be needed. Besides making sense of the rationale for research,
CR enables us to acknowledge that claims to truth have both antecedent conditions
and consequent effects. Some of these effects can be seen, in a Foucauldian manner,
albeit in realist terms, as effects of power.
A theory of emancipation is also an essential component of CR. The core of
this theory is the idea that lay meanings are both necessary for social practices
and relations, and can be criticised. This pertains to why this particular form of
scientific realism is called ‘critical’.
In what sense is CR ‘critical’? Kant’s philosophy was called critical, because Kant
problematised and analysed the presuppositions of our knowledge. Bhaskar
(1986:22) mentions that CR is post-critical in its form. That is, CR relies upon
transcendental arguments, yet it analyses historical practices and not universal
categories of human mind. CR is critical, first, in this post-Kantian sense. It asks
Kantian questions and deploys transcendental arguments. However, in contrast
to Kant, CR uses these arguments to establish a realist ontology. Further, in
contrast to naïve realism, CR sees beings as relational and changing, and, also,
argues that there is no unmediated access to these beings.
The second sense in which CR is ‘critical’ is related to the distinction between
ontology and epistemology. No knowledge simply reproduces reality or ‘mirrors’ it.
Correspondence in this sense would not make any sense (Sayer 2000:42). Any
10 Introduction
attempt to describe and explain involves complex mediations; that is, interpretation.
CR is a form of reflective philosophy, self-consciously interpretative and relativist.
The starting point for Kant’s epistemological reflections was an abstract bourgeois
individual, an allegedly representative example of the human species, a universal,
disembodied mind. For a critical realist, the point of departure is man as an
embodied historical being, the existence of which is made possible by antecedent
conditions and structures, both natural and social. Men and women have causal
powers to change some of these conditions and structures, but, as Marx (1968
[1852]:96) said, ‘not just as they please, not under circumstances chosen by
themselves, but under circumstances directly encountered, given and transmitted
from the past’.9 Adequate theory of knowledge is thus not possible without adequate
social ontology. Also in this sense ontology is primary. Ultimately, CR contends that
interpretative reflectivism must be grounded on ontological considerations.
Third, CR includes a theory of emancipation; it is also ethico-politically critical.
The original version of the CR theory of emancipation was based on the notion of
truth. Theories or discourses that are in some important regards false can, nonetheless,
be necessary for the reproduction of practices and relations. False understandings and
related structures are unduly limiting human possibilities. The basic idea is that we
can derive ethico-political judgements from truth-judgements. Hence, as virtuous
scholars, we may have a moral obligation to change those practices and relations,
which presuppose false theories and discourses. Although truth-judgements are
necessary for an emancipatory argument, they are not sufficient. A generalised
conception of emancipation strives for human flourishing, and also takes into account
other values such as pluralism, justice, democracy, good, virtue, economic efficiency,
ecological care and the like.
This book is divided into three parts. The first part outlines the international
problematic and some attempts to overcome it. It explain why theories based on this
problematic have failed, and concludes with CR suggestions about how to tell better
stories about world politics. The second part deepens these suggestions and develops
them in detail into a CR account of social sciences. It is also my contribution to CR
social and political theory. The last section applies CR methodology to the study of
world politics. The different applications aim not only at making the methodological
discussions more concrete, but suggest—by way of explanatory criticism and concrete
Utopias—transformations from international relations towards world politics.
In the first chapter ‘How it all started? Can it end? A genealogy of the
international problematic’, the conventional problematic of international
relations (IR) is explicated. I argue that the IR problematic has usually been
Introduction 11
descriptions of social events and characteristics. The second lesson concerns how
to analyse and help to solve collective dilemmas. The Prisoner’s Dilemma (PD)
is a game-theoretical model in which individual utility maximisation appears to
be self-defeating, or at least contradictory to some kind of social or collective
rationality. Not only systems of subordination have been analysed in terms of
this model, but also the ‘tragedy of global commons’ as well as inter-state insecurity,
arms races and crisis bargaining. After a long road towards more conversational
and dramaturgical models, Alker eventually came to the conclusion that there are
close connections between the world understandings of different research
paradigms—with their parallels in the actors’ world understandings—and of actors’
abilities to resolve PDs (acknowledging that any way of defining the problem and
its ‘solutions’ is based on the adoption of a perspective).
Lesson three is about ‘how to broaden and deepen scientifically relevant
modelling approaches’. The starting point is that ‘we must broaden and deepen
the universe of scientifically relevant modelling approaches appropriate for
the formal analysis of interpretative and theoretical world histories’ (Alker
1996:271). Alker makes a strong plea for more reflective approaches: ‘conflict
and crisis, as well as the steady state in which violence remains latent, can be
partially explicated in terms of dynamics which are approximated by the
interactions of artificial systems equipped with the story-building and story-
appraising characteristics’ (Alker 1996:270). Hence, an adequate analysis of
world histories must in fact incorporate the (perhaps ideological?) role of
narratives and their interplay. The argument can be summarised in a question:
Indeed, the fourth lesson is about ‘how to theorise history?’ Alker concludes
that world history should be analysed (interpretatively) as an open process, in
which choices of actors do make a difference, sometimes also to the intended
direction. After having analysed the interplay of power-balancing, collective
security and the international socialist transformation through people’s wars,
he demands more contextually sensitive and historically reflective doctrines of
collective security, some less total, more realistic alternatives.
Unlike many post-positivists, including most post-structuralists, Alker has
continued to develop more adequate and emancipatory methodologies for the study
of peace, conflict and international relations. However, his post-modernist excursions
and exercises in ‘computational hermeneutics’ etc. have led to the criticism that he
is ultimately an ontological idealist. There are many passages and claims that would
point towards a more realist position, some of them made in explicit reference to
CR. However, it is true that the question of realism is left largely unthematised in
Introduction 13
Alker’s texts, and there are also some noteworthy ambiguities. In conclusion, I
maintain that, although Alker’s methodological lessons are mostly valid and his
attempts to develop better methodologies highly valuable, there is a need to articulate
a consistently realist ontology and epistemology
The third chapter ‘How to tell better stories about world polities’ discusses some
key meta-theoretical problems of IR. The levels-of-analysis problem emerged in the
early 1960s as part of the positivist project to categorise the explanations of peace,
war, co-operation and conflict in terms of locating the main explanatory ‘factors’
into separate ‘levels’: individual or sub-state group or bureaucracy; state; and inter-
state system. In the 1980s, Richard Ashley and Alexander Wendt brought the
agency-structure problem into IR, following Giddens’s and Bhaskar’s earlier
contributions to social theory. In the debates of the 1990s, the levels-of-analysis and
agent-structure problems have often—and misleadingly—been conflated.
By analysing a well-known debate between Wendt vs. Martin Hollis and Steve
Smith, I make a case for a CR approach to grasp levels and depth, and to resolve—
at least tentatively—the agent-structure problem. In particular, I maintain that the
levels-of-analysis problem is nothing but an implicit verticalisation of the horizontal
inside/outside problem, the essential component of the international problematic. I
propose, first, that layers and depth should be analysed more realistically in terms of
deep assumptions of background discourses, historical sedimentation of social
structures and emergent power of various overlapping (or intra-relational) social
systems. Second, that a more adequate account of global spatiality would also enable
us to see non-territorially organised practices, including—despite their territorialist
pre-conceptions—many of the traditional international institutions, such as
diplomacy, international law, balance of power and collective security. Third, that
the agent-structure problem can be unpacked by means of analysing the elements of
causal complexes responsible for the effects we are interested to explain. And fourth,
that social scientific explanation is a form of understanding, based on a double-
hermeneutic process.
Moreover, I conclude the third chapter by arguing that any explanation
includes a genuinely critical moment, with real emancipatory potential.
In the previous part, the quest for a CR ontology and epistemology was brought
up. In response, the second part explicates the basic elements of a CR
methodology for the study of social relations and politics from a globalist
perspective. Since the subject matter determines its cognitive possibilities, the
fourth chapter ‘Critical realist social ontology’ starts this part by developing a
social ontology by a conceptual analysis of ‘action’, ‘structure’ and ‘power’. I
scrutinise critically the conceptions of Giddens and Bhaskar, and thereby pursue
further the fundamental categories of social scientific ontology. Accordingly, I
also spell out in detail the notions of causal complex and iconic model, which form
the core of the argument of the rest of the book.
14 Introduction
case for explicitly normative visions. Positive arguments about concrete Utopias
should complement explanatory criticism, and guide transformative actions.
unilaterally the norms of the dominant states. It is argued that the project of
CR—and related forms of other critical theories—is to work for enhancing the
self-transformative capability of contexts, and thereby for creating the conditions
for global integration and opening up and sustaining public spaces for world
politics (in the wide sense of the term).
The last chapter ‘From security to emancipatory world politics: beyond
“Nordic nostalgia”’ is both an example of explanatory criticism of a contemporary
process of change and also indicates the urgency to conceive ethico-political
projects in globalist terms. Of the Social Democratic regimes of the Bretton
Woods era, the Swedish model was perhaps the most radical and universalist.
This model has been shared in most respects by all Nordic countries, which
together form a highly integrated pluralist security community, Norden. The
Swedish model was economically successful and sustained highly egalitarian
economic policies for more than forty years, and translated those aspirations
into a progressivist foreign policy of active neutrality of the Third Way.
Ole Wæver’s 1992 article ‘Nordic nostalgia: Northern Europe after the Cold
War’ seemed to articulate the widespread feeling of the end of Norden. However,
this argument is flawed and mistaken. Logically and conceptually it is confusing
and tendential, and the evidence to back the empirical claims is casual. This
explanation is contrasted to an alternative, more systematic iconic model of the
same process. The alternative explanation in terms of political economy starts
from the observation that it was not the end of the Cold War but rather the
end of the Bretton Woods system that gave rise to the re-defmition of the Third
Way already in the early 1980s, well before the end of the Cold War. The
problems of the Nordic model stemmed from insurrections against local relations
of domination at the workplace; transformations of the occupational structures
and class relations; the crisis of the Bretton Woods system of regulating the
global economy; and the liberalisation of the exit options of capital, among
other processes. Together with the end of the Cold War, this interplay has also
reinforced irrealist and economistic discourses, which had replaced the earlier
rather Marxist concepts of the theorists of the Social Democratic Party. The
various irrealist and economistic discourses generated accounts about the
requirements of ‘new times’ and gradual changes in the meaning of the notion
of Third Way. They also constituted neo-liberal framing of new social problems.
Starting with the CR idea that emancipation is ‘characteristically the transition
from an unwanted, unnecessary and oppressive situation to a wanted and/or
needed and empowering or more flourishing situation’ (Bhaskar 1993:397), it
is my argument that emancipatory, globally oriented political action is a condition
for the Nordic ideals to be realised and developed further under new conditions.
Tackling the real conditions of the decline of the Norden demands a new
orientation in time and space. The nation-state cannot provide a sufficient
framework for progressive political action any more, and the socially flavoured
Wilsonian ‘idealism’ appears also to be a somewhat anachronistic basis for
‘progressive internationalism’. Claims about democracy and social justice have
to be re-articulated.
Introduction 17
This is not to argue that politics now occur only outside sovereign states.
Rather, local and national struggles are essentially connected to regional and
global struggles, and therefore they cannot be taken as separate spheres (‘first
progressivism at home, and then exportation of these universalist ideals to the
rest of the world’). Along these lines, I discuss two programmatic visions that
should take the Nordics beyond ‘Nordic nostalgia’, and provide inspiration and
hope for others. The Europeanist strategy is based on a new positive
understanding of the ‘logic of stateness vis-à-vis the free play of market forces’
(Hettne 1993), this time articulated as the ‘pursuit of regionness’. Yet, this strategy
can at best be complementary to a more globalist strategy of carrying out global
democratic and social reforms.10
The currency transaction tax is a particularly promising idea, given that it
would tackle a key issue of globalisation, namely the power of global financial
markets; enable the collection of substantial resources for global purposes
(including Keynesian demand-inducements and re-distribution); and constitute
a leverage for other democratic reforms. To really tackle the conditions of the
decline and the partial neo-liberal transformation of Norden, many further
programmatic ideas would be needed as well. However, particularly assuming
some success in empowerment and increased autonomy, these programmatic
ideas may also concern the organisation of very immediate local sites and related
national and regional developmental strategies, in the spirit of democratic
experimentalism.
Indeed, the argument is that a re-construction of world politics seems to be
a condition for successful emancipatory politics anywhere.
Notes
1 British and US editions have often translated Staat into ‘nation’ in Kant’s political
writings, but this is misleading (I am thankful for Stefano Guzzini and Ollie Kessler
for checking the original text). Kant was discussing a society formed by bourgeois
citizens, and he sometimes gave anthropomorphic characteristics to the resulting entity,
but in no way did he, unlike Hegel soon after him, consider nations as carriers of
moral conviction and historical progress. It is of course also true that, even for Kant,
the theme of war and peace was not the first and foremost concern. Manicas (1987:32)
argues that ‘after Machiavelli, no theorist in the “liberal” tradition put war at the
center of his analysis’. Kant is but a partial exception. Nonetheless, in his political
writings—which may appear marginal to his main output—he clearly argued that the
problem of war and peace is absolutely decisive to the future of mankind. Therefore,
he also put some effort into articulating the problematic systematically and also to
developing original solutions to the problem of peace and war. For comments on why
liberal political theorists have not been concerned with international relations, see
Wight 1966 and, for a very different account, Ashley 1988. More recently, since the
1980s, due to the prevalence of globalisation discourse, the situation has been rapidly
changing.
2 Paradoxically, however, Morgenthau’s accusation that Kantians are to blame for the
catastrophes of the twentieth century seems to affirm the causal efficiency of ideas.
3 This scepticism is more fully articulated in Harré 1979.
18 Introduction
4 A few remarks on Marx’s understanding of ‘laws’. Although this is not the place to
develop the argument further, I believe that, in his understanding of ‘laws’, Marx in
fact conflated at least three very different meanings:
(i) Laws in the sense of ‘natural laws’ of classical political theory, concerning particularly
the labour theory of value (which was also the normative basis of his immanent
critique of capitalism).
(ii) Laws in the sense of Hegelian laws of ‘logic’, explaining for example the ‘transition’
from (i) to (iii), and also forming the basis of his critique of ‘alienation’ and,
simultaneously, the assumption of ‘laws of history’.
(iii) Empirical realist laws a la Newton and Mill, who also both allowed for scientific
realist questions and inferences. These conflations have been an endless source
of confusions and debates within Marxism (cf. the problem of getting from labour
value to relative prices in the market, i.e. the ‘transformation problem’), and a
reason for many to reject the Marxist doctrine. An attempt to rescue this doctrine
by re-reading Marx as a critical realist is thus likely to mystify more than clarify.
5 Isaac (1987) has argued, however, that it is possible, to an extent, to make sense and
develop further some Marxist conceptions of power in realist terms. Marxism has
manifold gaps and lacunae and is not the correct theory of the world, but ‘deserves to
be examined and criticized on its own merits as a theory, and not simply discarded on
strictly formal (and specious) grounds because it fails to conform to the canons of
empiricist philosophy’ (Isaac 1987:10). This and other similar works give a more
interesting and plausible account of the relationship between CR and post-Marxist
theories of power, for instance.
6 Bhaskar (1989:141), however, criticises Horkheimer for introducing a contrast between
emancipatory and technical reason, which opened up the possibility, which Herbert
Marcuse and Habermas so vehemently exploited, of turning critical theory against
the ‘system’ (world). For Marcuse and Habermas, the ‘system’ (world) is based on
technical reason and science. Whereas Marcuse attempted a totalising critique of the
‘system’, Habermas in fact accepted the inevitability of this ‘system’, thus criticising
only its tendency to ‘colonialise’ the life-world. In this form, critical theory is derived
from (1) a misunderstanding of science and (2) reification of the ‘system’ (world).
Bhaskar’s criticism of these misunderstandings and reifications is correct. However,
these confusions hardly played any important role in Horkheimer’s original text.
7 Ontological monovalence is defined as a purely positive account of reality. If there
is no absence, there can be no emergence either. Without absence/emergence, there
can be no change. ‘It acts ideologically to screen the epistemological and ontological
contingency of being and to sequester existential questions generally, as ontological
actualism sequesters “essential” ones. The result is the doubly dogmatically
reinforced positivization of knowledge, and eternalization of the status quo’ (Bhaskar
1994:254–6).
8 In the first chapter, I apply a rather similar type of analysis to IR theory. However, I
try to eschew a totalising critique. Rather, I want to participate in the debates and
open up new possibilities for better debates and research on world politics/global
political economy, as well as better and more emancipatory practices. These would
not be possible without a dialogical attitude with others here and now, however
constituted.
9 These circumstances are not only external to human beings. In one of his more thrilling
metaphoric statements, Marx (1968 [1852]:96) argued further that ‘the tradition of all
the dead generations weighs like a nightmare on the brain of the living’.
10 Or one could well also talk about socialist reforms in a sense of questioning regulating
or transforming private property rights in democratic and pluralist terms.
Part I
Overcoming
international relations
1 How it all started? Can it
end? A genealogy of the
international problematic
21
22 Overcoming international relations
But this universal and primary opinion of all men is soon destroyed by the
slightest philosophy, which teaches us, that nothing can ever be present to
the mind but an image or perception, and that the senses are only the inlets,
through which these images are conveyed, without being able to produce
any immediate intercourse between the mind and the object. The table, which
we see, seems to diminish, as we remove farther from it: but the real table,
which exists independent of us, suffers no alteration: it was, therefore, nothing
but its image, which was present to the mind. These are the obvious dictates
of reason….
(Hume 1975 [1777]:§118)
Everything—God and angels, beings and their causal powers, society and
morality—have to be, therefore, reduced to a few general principles governing
our (or rather my own) sense-perceptions and how the human mind in general
understands them. Hume argued, for instance, that causation is a mere operation
of mind, ultimately based on regular sense-impressions of contiguity and
succession between A and B. These give rise to a habit of thinking, an association,
that A and B are always connected. Causation is therefore not real, but a mere
property—or rather a universal habit—of human understanding. Causation is
just one of the three ‘bonds that unite our thoughts together’ (Hume 1975
[1777]:§41). Also, Hume’s re-interpretation of Newton’s mechanics is based on
this radical empiricist theory of knowledge and causation. Hume’s scepticism
also gave rise to the famous problem of induction, later declared to be the ‘scandal
of philosophy’. The problem of induction is:how can we know that B will also
follow A in the future? Do we really know that the sun will also rise tomorrow
morning? (See Hume 1975 [1777]:§29).
Hume did not succeed in purging ontology from philosophy and science.
Rather, he posited a particular, narrow ontology as absolute, as the only ontology
that is beyond doubt. Hume persevered tacitly to realism of senses, perceptions
and mind, but denied reality otherwise. If we assume that only sense-perceptions
and mind are real, what will be the resulting social ontology? Society is reduced
to individuals (recipients of sense-perceptions and recorders of constant
24 Overcoming international relations
In summe, in what matter soever there is place for addition and substraction,
there is also place for Reason; and where these have no place, there Reason has
nothing at all to do…. As when a master of a family, in taking an account, eastern
up the sum of all bills of expence, into one sum; and not regarding how each bill
is summed up, by those that give them in account; nor what it is he payes for; he
advantages himself no more, than if he allowed the account in grosse, trusting to
every of the accountants skill and honesty: so also in Reasoning of all other
things, he takes up conclusions on the trust of Authors, and doth not fetch them
from the first items in every Reckoning. Loses his labour; and does not know any
thing; but only beleeveth.
(Hobbes 1986 [1651]:111–12)
Genealogy of the international problematic 25
The spatial imagery of a strict inside/outside dichotomy rose gradually with the
new cartography of the Renaissance, bursting into blossom by the time of the
Enlightenment (Harvey 1990:240–59; Walker 1993:125–40). Originally, modern
cartography was also developed to help establish property rights over land and
to serve commerce, particularly colonial commerce. It soon became connected
to statist and, later, nationalist myths. Private property provided the model. In
an important paper published in the early 1980s, Ruggie (1983:276–9) noted
that modern sovereign states have been constituted as analogous to possessive
individuals (see also MacPherson 1964). In the case of an increasing number of
states and treaties, external sovereignty was constituted as analogous to private
property rights in the seventeenth- and early eighteenth-century Western and
Northern Europe. This gave the sovereign body an absolute and exclusive
control or ownership over all things ‘inside’.
The idea of nation as an exclusive community rose much later, at the time of or
after the French Revolution. Nationalist ideas were subsequently cultivated with great
fervour and also used to oppose the rule of empires and great powers, and unequal
exchange in the capitalist world economy (cf. Linklater 1990:55–75; Linklater
1998:145–78). In essence, however, the notion of exclusive national community is a
metaphorical extension of the capitalist conception of man, who owes nothing to
society to which he belongs, and whose possessions exclude the others.
Genealogy of the international problematic 27
In the course of the seventeenth and early eighteenth centuries, the capabilities
of (particularly Western) European states were greatly strengthened due to a
number of administrative and financial innovations. These innovations enabled
an increased mastery over activities on states’ ‘own’ territory and also
management of their overseas colonies. Colonialism in fact precipitated the
universalisation of European property rights and, also, legal regulation of the
‘high seas’. But for the emergence of the international problematic, the
development of state capabilities at ‘home’, as it were, was decisive. Seventeenth-
century France was a forerunner of many bureaucratic practices. External borders
were clarified—in 1718 a line was drawn on a map for the first time in a treaty—
and states assumed administrative control over the units inside the borders (see
Giddens 1985:84–102).
Socio-economic developments came to be carefully monitored by means of
state reports and memoranda. Gradually, the financial ties to the feudal system
were cut off. The new sources of state revenue included customs on economic
flows, taxes on transactions, establishment of trade companies and involvement
in economic developments. Moreover, states began to use lotteries—early form of
bonds—as a means of raising funds for wars or colonial projects, and they also
established central banks (Goede 1999). Diplomatic representation became both
more permanent and professional. By the eighteenth century, most European
states had large corps of diplomatic staff, at home and abroad. However, it was
not until 1796 that the terms ‘diplomacy’ or ‘diplomatic’ were used in English to
apply to the conduct and management of international relations, and not to the
study of archives. It was after 1815 that the status and rules of this profession
were established by an international agreement (Nicolson 1939:26–8).
The seventeenth- and eighteenth-century liberal philosophers dealt with the
problems of the modern European state. They justified the authority of the
sovereign state but also carefully circumscribed its rights and sphere. Internally,
the authority of the state was strictly limited both by the autonomous sphere of
civil society and by natural and positive laws. The rule of law was thus
established. Externally, the state emerged as analogous to private property. Hume,
for instance, lived in mid-eighteenth century Britain, in which the bourgeois
revolution had been completed and which was, evidently, a growing power and
empire. The relations between European states, or European powers and the
rest of the world, were not his first preoccupations. However, after having
completed his voluminous History of England in the early 1760s, Hume worked
at the British Embassy in Paris. In 1767–8, he was in London as Under-Secretary
of State, Northern Department. Also, in his essays, mostly drafted before his
practical-political experiences, Hume reflected upon the rise of England. He
argued for instance that Great Britain’s growing power was due to the prevalence
of ‘commerce and industry’ in Britain (Hume 1994 [1777]:219–30). Hume (1994
[1777]:135–42) also devoted one of his essays to the idea of the balance of
power. Like the diplomats and princes, who had written the principle down in
the Treaty of Utrecht in 1713, Hume saw balance of power as the solution to
the problem of European inter-state order.
Genealogy of the international problematic 29
by the time of the religious wars. ‘A feeling arose in the Renaissance and became
firm conviction during the seventeenth century that moralizing philosophy and
religious precept could no longer be trusted with restraining the destructive
passions of men’ (Hirschman: 14–15).
How is it possible to have an organised and relatively peaceful society in the
absence of God and His commands? This is the modern problem of order.
Hobbes made a novel, apparently secular argument for a solution, which
appeared at least somewhat similar to that of St Augustine. The Hobbesian
solution to the problem of order was coercion and repression, combined with
the idea of rationalising education, to replace religious command. However,
there was a flaw in Hobbes’s argument. Why should we assume the sovereign
to be different from other humans? Either the Hobbesian order would end up
in a full-fledged tyranny, or Hobbes would have to make the sovereign
accountable to God. In the first case, the whole argument would turn out to be
futile; in the latter, the system would have to rely, ultimately, on God’s commands,
i.e. religious belief.
The great innovation of the eighteenth century was to direct human passions
to the service of generally beneficial order. Hints at this solution can be found
in earlier texts, but it seems that the idea was articulated, first sporadically and
then more systematically, in the seventeenth and eighteenth centuries. Already
Bacon’s The Advancement of Learning (1605) mentions the practical-political idea
of countervailing passions within and between states. Later, for instance, Spinoza’s
Ethics (c. 1680), Mandeville’s The Fable of Fees or Private Vices, Public Benefits
(subsequent versions 1705, 1714 and 1729), Vico’s The New Science (1725) and
Hume’s A Treatise of Human Nature (1739), rewritten as Enquiries Concerning Human
Understanding and the Principles of Morals (1748 and 1751), discuss the idea of
setting one passion against another. Thus, Hume argued that in many cases
private passions, including the ‘passion of interest’, can best be controlled by
means of countervailing passions. Love of leisure can be countered with the
lust for gain. One (faction of) state can counter another (faction of) state, both
driven by passion for power and glory. It is therefore possible to engineer social
progress by setting up one passion to fight another.
Hume was ambivalent about the notions of interest and passion. Increasingly
since the early seventeenth century, prevalent discourses had contrasted ‘interest’,
which tended to acquire an economic meaning, against ‘passion’. Whereas the
aggressive pursuit of possessions by means of violence and power seemed a
source of misery and evil, the rational money-making of the bourgeoisie in a
stable and just setting, based on producing and exchanging goods, services and
assets, appeared beneficial to many. Thus rose the great civilisational quest to
fabricate the calculative self-interested man. In a particular social setting,
characterised by Hume’s three fundamental rules of justice—stability of possession,
transfer by consent and keeping of promises—calculative money-making appeared
to benefit everybody (i.e. property-owning men). In 1776, Adam Smith, who
had pleased Hume by summarising his Treatise so insightfully as a young student,
invented the metaphor of the ‘invisible hand’. Smith’s metaphor describes how
Genealogy of the international problematic 31
Hume claims that there have always been balance of power policies.4 This
rational policy is relentlessly threatened by human passions. An eternal sceptic,
Hume did not quite share the increasingly common sentiment that reason
could tame passions and transform them into socially beneficial interests. He
is famous for his phrase ‘reason is, and ought only to be the slave of passions’.
It is also telling how Hume complains that the balance of power does not
have a practical effect to the extent it should. In Hume’s view, statesmen are
driven by their passions, not by rational considerations. This condition does
not change. Here we have the elementary formulation of the to-be Kantian
distinction between causal phenomena and the impotent world of moral reason.
What is most noteworthy in this passage is that Hume is repeating his
metaphysical conviction that there can be no real changes. If society consists
of atom-like individuals, constant in their inner structure; and if, despite the
ubiquity of ephemeral changes, there are invariant regularities between relevant
As and Bs in society; it is hard to see what could, really, ever change.
Everywhere people must be the same as in eighteenth century England and
France. If not, they must be becoming the same, if they are rational.
Just as we view with deep disdain the attachment of savages to their lawless
freedom…and consider it barbarous, rude and brutishly degrading of humanity,
so also we should think that civilized peoples (each one united into a nation)
would hasten as quickly as possible to escape so similar a state of abandonment.
(Kant 1983 [1795]:115)
Genealogy of the international problematic 33
As long as there are no external laws, states are living in a state of nature. Thus,
Kant advocates the rule of law and elementary principles of citizenship also in
the international realm.
Kant introduced a far-reaching dualism to Western political philosophy,
that of phenomena/noumena. At the level of phenomena, Kant’s world is
ultimately Humean. Kant was plainer than Hume himself; for Kant,
international relations are in a state of nature. This is, indeed, the origin of
what Ashley (1988) calls the anarchy problematic. As a transcendental idealist,
Kant also introduced the noumenal world, the site of reason and morality.
However, this world was cut off from the world of causal phenomena. To
demonstrate that his theory was also practically useful, Kant was at pains to
introduce all kinds of ‘secret mechanisms’ that would eventually establish and
then maintain the legal order of perpetual peace. These ‘mechanisms’ resonated
with the ideas of the late eighteenth century, and they included: the ‘invisible
hand’ of free trade and the civilising effect of commerce; the republican
constitution of states, guaranteeing freedom and peacefulness; and attempts
to curb the capabilities of states to finance and wage wars.
Already ten years before his famous ‘Perpetual peace’, Kant tried to
explain why history should develop into a cosmopolitan and peaceful
direction by introducing the notion that nature, too, is teleological and
that ‘all of a creature’s natural capacities are destined to develop completely
and in conformity with their end’ and that ‘man’s natural capacities are
directed toward the use of his reason’ (Kant 1983 [1784]:30). The main
argument is that there must be a simple and crude but, from a rational
perspective, necessary learning process. The impetus for learning comes
from the experience of pain and the high costs of the ever more destructive
wars. A lengthy quotation from Kant illustrates the logic of the argument:
In the end, even war gradually becomes not only a very artificial undertaking,
so uncertain for both sides in its outcome, but also a very dubious one, given
the aftermath that the nation suffers by way of an evergrowing burden of
debt (a new invention) whose repayment becomes inconceivable. At the same
time, the effect that any national upheaval has on all the other nations of our
continent, where they are all so closely linked by trade, it is so noticeable that
these other nations feel compelled, though without legal authority to do so,
to offer themselves as arbiters, and thus they indirectly prepare the way for
the great body politic of the future, a body politic for which antiquity provides
no example. Although this body politic presently exists only in very rough
outline, a feeling seems nonetheless to be already stirring among all its
members who have an interest in the preservation of the whole, and this
gives rise to hope that, finally, nature’s supreme objective—a universal
cosmopolitan state, the womb in which all of the human species’ original
capacities will be developed—will at last come to be realized.
(Kant 1983 [1784]:37–8)
34 Overcoming international relations
Since Kant’s world is dualistic, the moral reasons (noumena) can do very little
to change determinist chains of constant conjunctions (phenomena). The learning
process, republicanism and the civilising effect of commerce gave him some
hope. Ultimately, Kant had to rely on anachronistic teleological/Aristotelian
assumptions about the God-given telos of humanity. Often, he phrased this idea
in terms of the late eighteenth-century notion of Providence. Kant thought that
there was a guarantee for morality only in the Kingdom of Angels (absolutely
disembodied rational beings), that is, in Heaven (see Kant 1956 [1788], 121–4).
Repeating the dilemmas of St Augustine, Kant was doubtful about the prospects
of human progress on Earth. No wonder Kant’s theory appeared for later
German thinkers to be not particularly ‘realistic’.
Since Kant and the wars of Napoleon, different solutions to the international
problematic have been discussed, some of them in some regards original. For
Hegel, of course, Kant’s consciousness was unhappy. It was unhappy because,
ultimately, it had to turn to other-worldly solutions in its search for morality
and rationality. So Hegel wanted to show that the existing world is already
rational. At one and the same time, as a kind of a by-product, Hegel also
invented the post-Enlightenment solution to the problem of universalism vs
particularism: universal progress is carried out by particular nations, which often
fight against each other. The winner carries history forward. Later, the Hegelian
idea of Cunning of Reason assumed the form of social Darwinism.
It was in this intellectual context that the term realpolitik was coined in
Germany after the unsuccessful liberal revolutionary year of 1848. The
disillusioned liberals drew the conclusion that liberal thinkers should denounce
their ambitious programmes of change. Some of them developed the fundaments
of realpolitik (Palonen 1987:99–102).5 The contrasting Other of the realpolitik
theorists and practitioners was Kant and the ‘rationalist’ Enlightenment thinking.
They also read romantic, conservative and other anti-Enlightenment texts, as well
as Thucydides or Machiavelli and, later, Charles Darwin or Friedrich Nietzsche—
and used these texts to articulate their differences from liberalism and
Enlightenment ‘rationalism’. However, in fact their scepticism and Machtpolitik-
rationalisations never questioned Hume’s and Kant’s account of the phenomenal
world, only the Kantian possibility of overcoming it. In fact, this impossibility was
implicit in both Hume and Kant. Hume believed that ‘nothing can change’. Kant
thought that inter-state relations were in a Hobbesian ‘state of nature’ and that
moral reason was outside the phenomenal world and therefore causally impotent.
The ‘realism vs idealism’ debate thus started well before the formal Anglo-
American academic discipline of international relations was established in the
aftermath of the First World War. Although the native American Reinhold
Niebuhr came quite independently to similar kinds of conclusions in the 1930s
and 1940s, he was much better known as a theologian. Outstandingly, it was
Genealogy of the international problematic 35
Hans Morgenthau who brought the German realpolitik discourse to the USA in
the late 1930s. Just listen to him in his most Nietzschean moment, contrasting
idealisations of ‘scientific man’ to the brute ‘realities of power polities’, when he
refers to Hume and, simultaneously, echoes Kant’s problem of the relationship
between phenomena and noumena:
The nihilism and will-to-power metaphysics of Nietzsche was in fact already built in
Hume’s scepticism and notion of passion. In the twentieth-century political realism,
the standard reference point has been the synthesis of Max Weber (see Smith
1986). Weber combined ideas from Kant, Hegel, romantic and hermeneutic thinkers
such as Schleiermacher, nineteenth-century positivists such as Kelsen and Comte,
realpolitik, Nietzsche and the turn of the century positivism. Implicitly he adopted
the Humean account of phenomena and causality. In domestic politics, Weber was
a liberal, a sceptical and sometimes critical believer in modernisation as progress. In
international relations he articulated a version of political realism that denied any
progress and emancipation. Weber thought that outside of modern nations there
are no shared values. The result is a quasi-Nietzschean struggle of wills-to-power of
different national leaders. Thereby, Weber justified international ‘power polities’,
that is, the unilateral use of military force to achieve aims and to establish relations
of domination. Besides this Nietzschean, anti-Enlightenment strand, Weber’s
positivism also opened up the possibility of ‘scientific’ political realism.
We can name some of the twentieth-century solutions in terms of attitudes
towards relations of domination. These attitudes were originally identified by
Hegel, but have been further elaborated by Bhaskar (1993:2–4, 330–5; 1994:107–
8, 208–9). Whereas the stoic tries to be indifferent and neutral to the reality of
the world, the sceptic attempts to deny the world. The Beautiful Soul thinks it
is enough to pretend that Heaven is already on Earth, and act accordingly. The
Unhappy Consciousness seeks refuge in other-worldly asceticism or solace in
the projective duplication of another world, and after-life or beyond, where for
instance happiness will be in accord with virtue (as Kant believed). The existing
relations of domination are the problem. It is because of the lack of any notion
of emancipatory transformation in this world that they opt for the stoic, sceptical
or other-wordly solutions, to accommodate themselves with the world.
A twentieth-century stoic tended to adopt an attitude of quasi-scientific neutrality.
Here we have the later Morgenthau, who claimed that political realism is the
scientific theory of international politics, and the neo-realists, who have become
faithful followers of a modern, scientist Hume by borrowing their methodological
36 Overcoming international relations
ideas from positivist economics. However, the Morgenthau of Scientific Man versus
Power Politics was extremely sceptical about finding any scientific knowledge about
the world, because, ultimately, the world is romantically tragic and Nietzschean.
Nietzsche relied upon the Humean account of the phenomenal world. He believed
that Hume’s ‘English petit bourgeois’ scepticism could be re-written, more boldly,
in terms of the metaphysical voluntarism of ‘will-to-power’. Some post-modernists
are very close to repeating this pattern (for a close link between the Humean
notion of causality, scepticism, Nietzsche and forms of deconstruction, see Culler
1983:86–7). Finally, the Christian realists such as Niebuhr, and early Herbert
Butterfield and Martin Wight, seem to verge on an Unhappy Consciousness. To
justify their traditionalist Humean theories and deep pessimism, and their retreat
from the world of emancipatory politics, they have returned to medieval
metaphysics, to the ideas of St Augustine. The Christian realists see the origin of
tragedies of wars to lie in the original sin of humankind, in the nature of the
fallen man. Salvation must therefore be otherworldly.
Thus, the fault line of the international theory problem-field solution set
starts with Hume and Kant, as shown in Figure 1.1, and continues via Hegel,
realpolitik, Nietzsche and Weber to the twentieth-century self-declared realists.
However, as we now know, their ‘realism’ was in fact based on the Humean
rejection of realist ontology and the social metaphysics this denial served to
justify and naturalise. Moreover, Hume’s conception of human nature originates
in Christian theology. Most twentieth-century forms of political realism and
Kantian ‘idealism’ merely reproduce this theory/problem-field solution set.
According to conventional wisdom, there have been four ‘great debates’ in the
twentieth-century academic discipline of international relations. The inter-war
debate between Kantian ‘idealism’ and political realism led to the dominance of
the latter after the Second World War. The second, methodological debate
between traditionalists and positivists took place in the 1950s and 1960s.
Positivism won this debate, at least in the USA and Northern Europe. In the
1970s, liberals took up the notion of interdependence and attacked the
assumptions of political realism. In the 1980s, various forms of post-positivism
have risen to dispute the basic tenets of positivism.
Most aspects of these ‘great debates’ can be summarised in terms of a relatively
simply and highly repetitious cognitive structure depicted above and in Figures
1.2 and Table 1.1. To clarify the possibilities, it is useful to distinguish between
three Humes and two Kants. Hume believed in Newtonian science, but was in
his own political analysis a classical political theorist. Hume2 demands the
application of the experimental methods of Newtonian science to political science
and international relations, whereas the Nietzschean Hume3 radicalises Hume’s
scepticism and notion of passion. Similarly, there are also two Kants (a
Nietzschean Kant would be a contradiction in terms). Like Hume, Kant was a
Genealogy of the international problematic 37
‘reality’ in this theory is mostly empty of content and structure? It is this order
of things that my dialectical comment aims to disturb. As Derrida puts it:
Derrida (1982:329–30) argues further that there are always concepts and
meanings that are irreducible ‘to the dominant forces’ behind the international
problematic, and which have resisted that conceptual and/or non-conceptual
order. This is in fact the Foucauldian moment, entailing resistance to every
repression. Classics, such as Thucydides’ Peloponnesian Wars, can be re-read to
demonstrate how they are irreducible to the dominant understandings. Karl
Deutsch’s notion of pluralist security community, or Hayward Alker’s learning
and manifold methodological innovations, are examples of concepts and
resources that can be used for effective interventions in this theory/problemfield.
Derrida does not go far enough, however. There is also a quest for new
conceptual resources. Reality should be reclaimed, also by introducing negativity,
absence and emergence, and thereby real change. By positing reasons as causes,
we locate meaningful human activities in the differentiated, stratified and
structured world. Moreover, the intransitive and transitive dimension of research
should be distinguished. Thereby much confusion about the relationship between
subject and object of research can be clarified. By being able to identify the
internal and external relations between social/political theories and social realities,
without conflating the two, critical realism promises a new emancipatory
conception of social scientific research.
Finally, but crucially, the Habermasian aspect of a dialectical comment entails
a willingness to communicate with and listen to all. A comment would not be
dialectical if it did not engage with, and learn from, the others in the field.
Notes
1 About the concept of theory problem-field solution set, see Bhaskar 1994:10, 215–16
and 224.
2 Strictly speaking, the term ‘positivism’ is historically misleading, as it was invented by
Saint-Simon and Comte in the nineteenth century. In the eighteenth century, Hume
was quite rightly conceived of as an empiricist. With Bhaskar (1986:226), however, I
concede that ‘most of positivism is already contained [and] elegantly expounded in
the writings of Hume’, and that it is best to see positivism as a causally efficacious
movement of thought and cognitive structure, with a number of possible variations
42 Overcoming international relations
and permutations within it. Thus, in the twentieth-century economics (the positivist
social science par excellence), for instance, there has been a long-standing debate between
the rationalists, who have substituted empirical regularities with the illusionary certainty
of hypothetical mathematical-deductive connections, and empiricists, who are typically
involved in the much more messy and tricky business of statistical analysis (econometrics).
3 Two seminal accounts of these transformations are Karl Polanyi’s The Great
Transformation (1957 [1944]:33–102) and Barrington Moore’s Social Origins of Dictatorship
and Democracy (1969:3–39). England was a pioneering country, with the first phase of
transformations taking place in the seventeenth century and the second accompanying
the Industrial Revolution. According to Moore (1969:24–7), the consolidation of
capitalist property rights of the landlords and bourgeoisie went hand in hand with the
destruction of the medieval rights to land of the small peasantry. Polanyi (1957
[1944]:77) summarises the extent of transformations, enforced often by what he calls
‘inhuman methods’: ‘Eighteenth century society unconsciously resisted any attempt
at making it a mere appendage of the market. No market economy was conceivable
that did not include a market for labor; but to establish such a market, especially in
England’s rural civilization, implied no less than the wholesale destruction of the
traditional fabric of society.’
4 Polanyi (1957 [1944]:259–62) argues that England conducted something close to power-
balancing policies against the potential preponderance of any particular Continental state,
already 200 years before balance of power was established as a principle and system in
eighteenth-century Europe, and before Hume theorised it as a historical law or invariance.
Polanyi may, however, be a victim of the confusion of Hume. Hume’s claim about
balance of power as invariant conjunction confuses the practice of making allies (common
for instance in Ancient Greece, cultivated by modern England in the fear of the
Continental powers) and the empiricist understanding of the mechanical or Newtonian
‘law’ of balance of power. The idea of ‘balance’ is mechanical and presupposes familiarity
with the science of Galileo or Newton. Moreover, the term ‘states system’ occurred
only sporadically if at all before the Napoleonic period (Bull 1977:12), and the balance
of power system was not formalised before the Concert of Vienna in the nineteenth
century. Contrary to his own understanding, Hume was thus, in fact, a conceptual
innovator, and certainly not a mere passive observant of a universal ‘law’.
5 Although this is slightly out of the reach of my story, it is certainly not insignificant
that there were also quite different (pathological) reactions to these same developments.
By the time he had already renounced his academic ambitions as politically impossible,
in 1843, Karl Marx, in the face of censorship, had to move to Paris in order to continue
his journalistic activities. This was the turning point. Marx started to see radical
revolution as a necessary condition for any change in Germany. In 1845, he was
banished from Paris as a dangerous revolutionary. On the outbreak of the Revolution
of February 1848 he was banished from Belgium, and, in 1849, again from Germany,
and, in June 1849, again from Paris. After these traumatic events in his formative
years—he was thirty in 1848—he finally settled in London, where he wrote the bulk of
texts associated with ‘Marxism’. As a kind of mirror image of the conservative realpolitik
thinkers, Marx adopted the assumption that no peaceful changes are possible, only a
one-shot revolution. In 1895, [Link] (1968 [1895]:21) praised Engels by arguing
that ‘hatred for political despotism was exceedingly strong in’ Marx and Engels,
‘combined with a profound theoretical understanding of the connection between
political despotism and economic oppression’. Lenin, who was facing even more fierce
political oppression in Tsarist Russia (e.g. the killing of his brother), contended that
Marx and Engels ‘became socialists after being democrats’. However, it should be noted
that at this point Lenin, too, seemed to be also in favour of political liberties. See
Chapter 4 for remarks on the further developments, and Chapters 5 and 8 about the
political significance and far-reaching consequences of this multi-stage, pathological
learning process.
2 Learning from Alker
From quantitative peace research to
emancipatory humanism
After the catastrophe of the Second World War, Kantian themes re-emerged,
in a positivist guise, within peace and conflict studies. Kenneth Boulding,
Karl Deutsch, Johan Galtung and many others hoped that positivist social
sciences would help us to overcome the international problems posed by the
threat of violence and war. Although the mainstream in the USA—powerfully
boosted by the US financiers of the cold war—believed in the eternal wisdom
of political realism, peace researchers thought that scientific research could
demonstrate that political realists were wrong. Even better, they envisaged a
more peacefully organised world based on scientific knowledge. Like modern
medicine, peace and conflict studies can help save lives. Given the spectre of
thermonuclear war, this task assumed the urgency of saving the whole of
human civilisation.
In the early 1960s, while international relations (IR) as a discipline was
dominated by the classical political realism of George Kennan, Hans Morgenthau
and Henry Kissinger, the behaviouralist movement was reaching its climax in
the social sciences (particularly in the USA and Northern Europe). Hayward
Alker shared the belief of positivist peace researchers that it was possible to
improve political conditions by means of knowledge produced by scientific
method. He also attacked the loose journalism of foreign policy analysis, as
well as the classical speculative style of writing IR, both of which were common
in the early 1960s. Despite the tenets he shared with the mainstream of peace
research, Alker has been exceptional in one crucial respect: he learned from his
trials and errors and, step by step, changed his direction.1
By showing how one can learn, indeed develop, radically new approaches,
Alker makes us see the false necessity of the rigidities of a given academic
43
44 Overcoming international relations
character and goal. Even more importantly, Alker provides critical resources for
overcoming the international problematic. Alker’s numerous writings—some of
which were collected together and published in 1996 as Rediscoveries and
Reformulations: Humanistic Methodologies for International Studies2–in fact come very
close to critical realism (CR), and in part are explicitly inspired by CR. At the
end of this chapter, I shall discuss some ambiguities concerning ontological
idealism/realism, and point towards possible further steps in this learning process.
Alker’s scientific career began with measuring politics and analysing the United
Nations (UN). The angry idealism of Alker’s first phase had three major pillars:
the advocacy of modern, rigorous social sciences; a pluralist theory of conflicts,
according to which there are always many different actors and at least some of
the conflicts are based on misperceptions; and the belief in the potentialities of
the U N. Alker’s advocacy of rigorous, Newtonian social sciences
notwithstanding, even during this phase he always tried to take seriously the
claims of those who did not share his fundamental assumptions and concerns.
Alker saw open, critical discussions among representatives of different approaches
as a prerequisite for scientific progress. Reliance on a given scientific method—
whatever it may be assumed to be—was not enough for him.
The second pillar was his theory of conflicts—and particularly the cold war
conflict—that relied partially on the notion of misperception and partially on his
pluralistic account of social reality. Alker, together with Bruce Russett, asserted
for instance that ‘numerous uncertainties and misperceptions continue regarding
the nature of the distinct issues before the UN’. There are many different
political opinions, but they are often misperceived and their nature
misunderstood. Alker and Russett went on to ask whether ‘we can we find a
way of naturally and objectively summarising the issues before the General
Assembly without losing their specific content?’ (Alker and Russett 1965:9). It
is noteworthy that they indicated that the moral-political aim of their study was
the reduction of international tension.3 In arguing for this possibility, they built
on an underlying Grotian—although positivist and utilitarian—account of the
nature of international society, according to which rules and institutions matter:
Rules of order are often followed, in both local and international society,
because of various kinds of expected gains or losses. A reputation for morality
and law-abidingness can be useful, and the contrary reputation damaging.
(Alker and Russett 1965:146)
The third pillar of Alker’s first phase is related to the UN. The direct (even
if qualified) analogy between domestic party politics and world politics in the
General Assembly gives support to his trust in the positive potentialities of
Learning from Alker 45
the UN. ‘The world could be thought of as a political system in which the
major blocs are analoguous to two parties that compete for the favor of the
uncommitted voters’ (Alker and Russett 1965:147). Further, Alker claims,
with Russett (1965:148), that ‘by providing a forum where the parties must
participate in a continuing electoral competition for the allegiance of the
neutrals, the United Nations performs a major function in preserving the
system’s stability’. To summarise, the moral-political idea behind Alker’s
behaviouralist phase was to try to overcome—or at least ease—the tension
between the blocs of the cold war with the help of a more adequately
functioning UN and by utilising the increased scientific understanding provided
by empirically oriented modern social sciences.
The problems of the post-colonial world come more deeply into this picture
only gradually. It was while teaching methodology as a visiting professor in
Santiago, Chile, in the early 1970s that Alker most fully opened his mind to
radically different epistemological perspectives, which he found to cut across
different geo-historical locations. In Salvador Allende’s Chile, Alker had many
discussions on the techno-rational domination aspects of rigorous and expensive
standards of scientific practice. Eventually, he made the observation ‘that
mathematical social science…distorts social relations to the extent that it
participates in and mirrors our alienation under “capitalism”’(Alker 1996:186).
At least in his later rationalisation, Alker refers to his lengthy 1974 article
Are there structural models of voluntaristic structural action?’ as summarising
a major turning point (but see also Alker and Christensen 1972). Nonetheless,
he, like Herbert Simon, still used, even if not exclusively, the criteria of validity
of the statistical tradition. That 1974 article ‘evoked considerable resistance’
among the causal modelling logical empiricists of the time and did not find a
publisher in the USA (it was published in the European journal Quality and
Quantity in September 1974). Alker argued that
humanistic scholars are often right (sometimes for the wrong reasons) in
objecting to statistical models of complex social behavior; but [I claim] also
that a faith in positivist approaches to explaining voluntaristic social action is
sustainable if an appropriate transformation of those approaches takes place.
(Alker 1974:199–200)
In other words, he argued that the faith in positivist social sciences has to be
reconciled with the social theories of more ‘humanistic, voluntaristic social action’
scholars, otherwise the faith in positivism is not justified. In his second phase,
culminating in the Chile years and lasting till 1976, Alker was gradually moving
towards thoroughly informational—and also hermeneutico-dialectical—ontologies
as well as towards his new, more late-modern (or post-modern) mode of
globalism. All the time, he was trying to account carefully for all of his
epistemological, methodological and substantial moves. In methodology, he
was mostly interested in artificial intelligence, qualitative-mathematical analysis
46 Overcoming international relations
When doing more concrete research, he still wrote about the evolutionary
possibilities of the UN’s conflict management system, but was also increasingly
concerned about ecological problems and large-scale violent conflicts such as
the War of the Pacific. By 1977, he had developed the basics of a new dialectical
approach to the study of world politics, an approach that was not only reflective
but also very original, even idiosyncratic. An outstanding paper from 1977, in
which he outlined many of his new ideas, was called ‘Can the end of “power
politics” be part of the concepts with which its story is told?’ Even though it
was merely an unpublished APSA paper,4 it was the first time he made public
the most radical results of his long rethinking process.
Alker’s third phase can be termed as ‘emancipatory humanism’ He
summarises his position by asking whether we can combine ‘the evident
ability of the greatest historical writers to catch our moral and political
imaginations, and comment profoundly on the choices before us’, with the
‘hard won professional commitment to falsifiable scientific theorizing’ (Alker
1996:270). Indeed, this is what he is now doing, trying to combine history-
based, moral-political visions with falsifiable scientific theorising. In
explicating his innovative position further, he also comments on Immanuel
Wallerstein’s challenging point that the heavily narrative accounts of most
historical research seems not to lend itself to quantification:
This passage should give a good starting point for analysing Alker’s humanistic,
emancipatory approach to the study of world politics. In the following, I shall
Learning from Alker 47
These lessons can and should be read both as intepretative guidelines to Alker’s works
and as methodological and theoretical arguments that are based on Alker’s works.
constructed simply from the roll call votes of countries in the General Assembly
and from the (subjectively interpreted) situational characteristics of these
countries. As Alker learned that factor analysis, too, can have a causal
interpretation, he worked for a while on the assumption that empirical
phenomena are structured by some underlying causal models. Yet there seemed
to be something wrong with this approach, too, for the world political reality
was, at least apparently, able to transcend all attempts to specify a definite
causal model, with unchanging, lawlike co-efficients, of its causal interrelations.
This was the case despite the fact that the studies focused intensively on only
one functional area, collective security practices.
Perhaps there was something wrong with the most fundamental assumptions
concerning the nature of reality and data? Indeed, in Alker’s tenacious, self-critical
attempts to improve upon his past modelling exercises, there finally occurred an
explicit ontological shift from mechanical, Humean causality, first towards an
ontology of information-processing, characteristic of cybernetics and artificial
intelligence, and then, eventually, towards an ontology of conversational or discourse
analysis (see Alker 1986:2).
Most clearly, the ontological shift was evident in Alker’s new modes of
understanding what data are. For one thing, he began to rely on precedent logics
‘that are used like psychologies to interpret historical successes and failures’
(1976:51). The testing of hypotheses about precedent logics—what are the relevant
precedent cases and how do actors learn from them?—required a new kind of
data, namely verbal data. As a consequence of this and other similar
methodological shiftings, data was not seen any more merely in terms of
behaviour and (subjectively interpreted) situational characteristics. Eventually,
Alker generalised from his experiences—which made it rational for him to broaden
and deepen the notion of ‘data’—and concluded that ultimately our understanding
of the nature of data depends on our social ontology. According to the ontology of inter
subjectively constituted social entities, we have to have multi-perspective
descriptions of social events and characteristics:
refers here to the fact that the standard North American statistical practices
were—and still are—much too individualistic and mechanistic to account for
internal relations between entities. As Ollman puts it, positivist practices presuppose
that any social factor is always:
Alker himself is not arguing that there are merely internal relations (as
some Hegelians—and even Ollman—might do). Rather, his point is that the
identities of social entities, such as rules, actors, institutions, are relational,
that social entities are interconnected at the ontological level. For one thing,
this view makes it possible to claim, as Bhaskar (1986:307–8) does, that
‘positivism at once naturalises and normalises things and reflects in an endless
hall of mirrors the [individualistic and atemporal] self-image of Bourgeois
Man’. Methodologically, the dialectical view evokes the claims that
quantitative, compositional statistical knowledge is no more objective than
the essentially qualitative, relational knowledge and that, rather, one should
accept that the standard statistical knowledge is conceptually and ontologically
secondary to the much richer relational knowledge (see also Chapters 4 and
5). However, let us hear how Alker himself explains and clarifies his position:
When both internal and external relations affect case relationships, case-
matching or ‘precedent-seeking’ efforts become more complicated and
historically more interesting. Similarity matching and dissimilarity contrasting
of procedurally specified data stories are ‘external’ comparisons unless the
attributes used in such efforts are in some sense essential, characteristic,
and case-identifying ones. Obviously one wants to relate essential
characteristics. The analytical problem, for which statistics can only be
moderately helpful, is to develop ontologically cogent distinctions between
essential and inessential properties of the case descriptions.
(Alker 1996:352)
(with whom they more or less share some but not all formative perspectives).
Quite obviously, with the adoption of this kind of understanding, data lose
their seemingly innocent and neutral status. Consequently, one should get rid of
the reifying effects of the positivist social scientific practices. In line with Alker’s
new epistemological perspectivism and his lasting aim at constructing falsifiable
scientific theories, Alker advised, in 1977, scholars to follow the rule that
Later on, he generalised and radicalised his position, without ever advocating
incommensurabilism or an absolutely relativistic ‘anything goes’ attitude. All of us
who are advocating modest relativism know that the incommensurability thesis, for
instance, is vulnerable to the problems of self-reference.7 Alker’s point has always been
to enable better scientific practices. In 1988, he stated his own ‘first lesson’ for empiricists as
follows:
about the case inclusion and exclusion rules for a data set. They should also be
explicit about the normative bases, procedural preferences and political allegiances
that inform coding practices. To emphasise: this kind of normative and
methodological explicitness is a prerequisite for adequate data construction. An
ontological parallel is that in the construction of a data set, one should include
the reasons actors give for the actions they take (or avoid) and the precedents
they cite (or avoid) for justifying their (in)actions. Moreover, one should also include
different narrative accounts of the relevant episodes, as presented by the actors.
The hermeneutically, by searching for a justified, sense-making account of the
whole, reasons, precedents, and narratives of actors can then be analysed both
and critically, by assessing the validity claims of the actors.
Finally, there is the question of deviant cases. Galtung (1977:72–95) has argued
that the study of the deviant cases should teach us the crucial variables of an
invariance and thereby help to break that invariance. Trying to go beyond this
formulation, Alker also demands that one should take the best and the worst cases
and use them to uncover the practical ‘grammars’ of action and habit that make
such outcomes possible. These ‘grammars’ should then be seen as in part changeable,
that is, as both potentially negotiable or debatable and therefore political.
work that has emphasised simple specifications and often interpreted them
positivistically or behavioristically’ (1974:236). He argued, however, that these
‘empirically more adequate’ models of iterated or repeated games are necessarily
more complex than their predecessors. For instance, the artificial intelligence-
looking Emshoff model—in which a game has both a history and a future, and in
which actors are learning from the outcomes of their previous choices in a
probabilistic manner, according to a certain algorithm—is empirically more adequate
and a possible solution to the problematic of infinite reflective regress back to
mutual calculations. It is also a very different and much more complex solution
than the standard—but in most cases misleading—simple prescription of minimax
(guaranteed loss minimising) and maximin (secure-gain maximising) strategies.
In many real-world cases of collective goods, the assumption that PD games
have a history and a future, too, is adequate, but not in all cases. If the stakes
in a ‘game’ are extremely high, such as the loss of one’s job, years in prison,
death of a subordinate by the masters (perhaps in a concentration camp) or a
nuclear war [sic!], would not a single-choice ‘game’ be a more adequate model
than an iterated one? In his 1975 analysis of the descriptive foundations of
polimetrics, Alker turned, following Baumgartner and Burns, and Burns and
Buckley, to developing a new PD model by redefining it as a three-actor
domination system and not viewing it as an iterated game. In this way he was
able to analyse the logic as well as moral and causal limitations of ‘divide and
conquer’ or ‘divide and rule’ strategies of various rulers/masters of world history,
including those of the modern totalitarian nation-states.9
In contrast to the standard practice of presenting the PD in an abstract form,
Alker (1975:180–8) developed a three-party model in the context of the narrative;
it was probably developed after the original model with its numerical payoffs. In a
typical variant of the PD narrative, the position and power of the District Attorney
(DA) is not taken into account at all, although (s)he has the capability to get the
prisoners to do something they would not otherwise do. This capability is based
on the DA’s position in the system of legal and penalty practices, which enables
him or her to control communication. That control is crucial in this context in at
least two respects. First, the DA can use it (plus the absence of counsel) to decrease
the trust that Prisoner A has in Prisoner B. Second, he or she can use it to paint an
unconnected and rather black picture of Prisoner B, thus causing Prisoner A to feel
less loyalty to (which pre-exists contingently), solidarity with and sympathy for his
or her partner (see Alker 1975:180–8). The loyalty of prisoners might be, for
instance, due to the fact that they belong to the same community, movement or
organisation, which has particular normatively binding regulative rules of its own.
Seen in this way, the modality or technique of power on which the PD is based is
panopticism, described, modelled and explained by Michel Foucault:
This socially produced invisibility of individual actors is also the crucial facility
for the power of the DA. This asymmetrical power would be diminished if
there were regulative rules such as ‘prisoners should be allowed to talk with
their attorney before interrogation’, or if the prisoners nevertheless considered
each other as loyal partners instead of strategic calculators. Indeed, even the
associated PD narrative is an extremely abstract interpretation, for it abstracts
this ‘game’ from the more contextual rules as well as from the questions such
as: Are the prisoner’s really guilty? Why are they committed to egoist means-
ends rationality? Why are they not allowed to talk to their lawyers before
interrogation (a talk with an attorney would provide the possibility of indirect
communication)? Consequently, can it be said that the legal system in question
is democratic or authoritarian, just or unjust? On the basis of these
considerations, Alker developed a complex model for estimating the power of
the DA and the factors giving rise to that power.
Alker did not stop developing his approach to resolving PDs to this still
somewhat positivist model of three-party system of domination (his criticisms
of positivism notwithstanding, he was still looking for a model of ‘rational’
incentives and, indirectly, for mathematically definable and analysable structural
invariances of social action). In Alker and Hurwitz’s 1980 ‘student manual’
called Resolving Prisoner’s Dilemmas, there is an explication of some of the substantial
and edificatory (cf. Rorty 1979:359) reasons for moving towards thoroughly
conversational and dramaturgical social ontologies. It is not only that empirical
and theoretical reasons seemed to be pointing towards more complex models of
intentional and voluntaristic—although in a sense also structurally determined—
social action. It is also the case that there are close connections between the
world understandings of different research paradigms—with their parallels in
the actors’ world understandings—and of actors’ abilities to resolve PDs. These
connections include the following (Alker and Hurwitz 1980:80–82):
The difficulty which mankind experiences and which it is able to solve only
very gradually and very slowly is one of achieving a conceptualisation of the
self that will permit that reciprocity to emerge as something more than a
kind of appendage to an already fully constituted self. The crucial insight on
which Hegel’s position rests has to do with interdependence of our concept
of the self and our concept of the other and it involves the claim that these
conceptions move forward pari passu until they reach a point at which an
underlying identity of the one with the other is grasped.
(Olafson 1979:241)
Olafson is very careful in pointing out that the underlying identity is not the
Hegelian absolute one, nor should it be seen as a deterministic outcome of an
evolutionary history. Rather it involves the constitutive reciprocity and potentialities
of change inherent in the Habermasian undistorted and unconstrained
communicative interaction.10 With these points as a background, one should be
able to understand Alker and Hurwitz’s concluding lines in 1980 on PDs:
In his 1980 article with James Bennett and Dwain Mefford (1980), which fuses
Alker’s interests in the UN’s collective security system and PD modelling, he
goes on to analyse security as a contradiction between individually reasonable
security-seeking practices and collective security. That is, Alker scrutinises
problems of traditional state security in terms of a PD situation. By building
upon an organisational model based on the Emshoff model, in which actors are
Learning from Alker 55
learning from their previous choices, and by ending up with something that is
quite reminiscent of Deutsch’s well-known cybernetic model of foreign policy
making,11 Alker, Bennett and Mefford try to analyse not only the practical
intentionality of means-ends calculation but also the processes that define and
re-define political goals themselves.
By focusing their analysis on the cognitive processing within organisations,
Alker, Bennett and Mefford emphasise the importance of studying how
precedents or lessons from the past are initially acquired, and then subsequently
applied, in the practical activity of problem solving during the day-to-day conduct
of foreign policy. Their point is that an individual actor’s course of action is
principally determined by the prior construction of the set of possible initiatives
and consequences attributable to the parties involved. This is a cognitive process
that involves defining new situations on the basis of comparison with previously
experienced situations which exist as richly described incidents in individual
and collective memory. These incidents are typically structured as sequences of
action, as narratives, the elements of which are selected and linked together by
causal and intentional connectives to yield a plausible ordering or configuring.
From this perspective, there can be argued to be two ‘orders of constraint’ that
shape and select policy:
1 The first comprises factors that set a bound on the scope and diversity of the narratives
(or scenarios) the policy-making apparatus is capable of generating. In the
Foucauldian terminology, discourses are bound by the way they characterise
the field of possible objects, establish the way language functions with respect
to objects and position the speaking and acting subject. They are also bound
by the available set of narrative structures.
2 The second involves the mechanisms of choice that select and amalgamate the narratives
under consideration. In the Foucauldian terminology, this is a matter of the form
and way of localisation and circulation of discourse(s) within the foreign-
policy-making organisations. Power, also in the more traditional Weberian
sense—as the capacity of an actor positioned within a social relationship to
carry out his or her own will despite resistance—and organisational structures
come into the picture here.
Even though the original point of departure had been, a decade earlier, the
rather positivist study of collective security practices and iterated PDs, Alker is
here taking on board Foucauldian (Foucault 1991a:56–7; Foucault 1991b) and
Ricoeurian (Ricoeur 1984 and 1988) ideas in explaining changes and continuity
of foreign policy.
Also of great interest for an IR theorist is the result of Alker’s more recent
game-theoretical study entitled as ‘Fairy tales can come true: Narrative constructions
of prisoner’s dilemma game play’ (originally with Karen Rothkin and Roger
Hurwitz). The main claim is that the mechanisms generating reciprocity are not
the simplistic ‘tit-for-tat’ strategies discussed by Anatol Rapoport and [Link]
(1965) and, in an evolutionary fashion, by Robert Axelrod (1984).
56 Overcoming international relations
On the basis of Alker’s lessons, how should we analyse and help to solve collective
dilemmas? Acknowledging that there are many very different situations of collective
dilemmas, Alker’s methodological point is twofold. First, we should really study the
relation between the rules of play and the game, and the rules and the often
asymmetrical resources of different social contexts in which the PD game is
embedded, rather than abstract and reductionistic models of PD. Second, the focus
should be on studying the (logics of the) scripts about actors’ moral choices. The
precedential, narrative-embedded and dramaturgical lines of these scripts are up to
an extent revisable in the actual context of their recurrence—and thus potentially
political.
Recall now the response of Alker to Immanuel Wallerstein’s challenge, to the claim
that the heavily narrative accounts of most historical research do not seem to lend to
quantification. Alker argued, in accordance with the main results of his studies of the
late 1970s and early 1980s, that ‘we must broaden and deepen the universe of
scientifically relevant modelling approaches appropriate for the formal analysis of
interpretative and theoretical world histories’ (Alker 1996:271). From the early 1980s
onwards, Alker has followed his own suggestions and instructions rigorously. For
many in the mainstream of political science, IR and peace research, Alker’s two
hermeneutical-structuralist studies of this period may appear utterly incomprehensible,
or at least irrelevant to the concerns of ‘serious’ political science.12 In contrast to this
view, I would like to argue that these studies by Alker are among the most innovative
published IR works of the 1980s. In the ‘Two reinterpretations of Toynbee’s Jesus’,
Alker (originally with Lehnert and Schneider) poses the basic research questions as
follows:
In the light of both traditional and more recent scholarly approaches, what
are the appropriate, scientific ways to ascertain the meanings we and others
see and feel within the Jesus story? What motivating power or charisma
does this story, or ‘myth’, contain? What basic structure or structures, what
infectious, self-replicating, ‘viral’ qualities account for this power? Can we
replicably discover and empirically test some of the ways in which its deepest
structure has been rewritten into different ‘surface’ texts? Or do the ‘rewrites’
Learning from Alker 57
I take it to be one of the reasons for the relative ‘success’ of such studies is the
extent to which the authors or their fellow workers have combined drama-
like readability with an impression of historical and/or scientific
trustworthiness and the quasi-metaphysical, almost inevitable grandeur to
which seekers after total truths (including myself) are especially susceptible.
(Alker 1996:268)
Alker is also perfectly aware that the determinism—and, yes, positivism—of the
studies of Jay Forrester, Ferdinand Braudel, and Ernest Mandel does not ‘leave
much room for individual or group responses to the enormous moral, economic
and political issues raised by their accounts’ (Alker 1996:269):
Life is not a myth or a fairy tale with a guaranteed happy ending; neither is
it an inevitable tragedy, one that encompasses all of Western civilization or
the human species. Nor are most political or cultural leaders successfully
heroic. Should one then refrain from attempting to give meaningful
interpretations to world history? Or should one try to refrain from being
‘ideological’ in making such efforts, if it is indeed possible to do so? Can we
Learning from Alker 59
The bulk of the paper is devoted to the analysis of narrative structures. The point
of departure is the way Vladimir Propp, in his analysis of Russian fairy tales,
reduces all of his analysed tales to a single structure, or schema, with two variations
of the middle-of-the-stories events. First, the setting, or the preparatory aspect of
a story, is defined. Then, the story develops because of villainy or because of
some kind of lack; through the completion of some difficult tasks, either the
villain must be overcome or the lack must be resolved, or both. In the middle of
the story, there are two possibilities: either (1) a ‘struggle, then victory’ sequence
of episodes, with the branding of the hero, a struggle between hero and villain,
the defeat of the villain (this is the peak of the narrative, perhaps with some initial
misfortune), the pursuit of the hero, as well as unfounded claims by false heroes;
or (2) a ‘solution to a difficult task’ sequence of episodes, with unfounded claims
by false heroes (the hero still at home), a difficult task, the branding of the hero
(by a princess, often in and/or before a battle), with a preliminary or almost final
solution possibly occurring (this is the peak of the narrative, perhaps with some
initial misfortune) and the pursuit of the hero—with the hero being rescued from
pursuit. Finally, the hero is recognised, he is given a new appearance or new
possessions, and false heroes, as well as any still unpunished villains, are dealt
with. The end scene is that the hero is married (to the princess) and/or ascends
to the throne, etc. This narrative structure can be generalised as a set of rewrite
rules for a simple story grammar, in terms of STATES and EVENTS, and THEN
connectives, and CAUSE connective.
This more abstract and formal way of expressing the generative structure of
narratives is argued to be better at conveying text composition syntax than the
semantic and pragmatic dependencies among story elements that are still present
in Propp’s formalisation. Nonetheless, Alker sticks to the formal rewrite rules
when he illustratively analyses the structure of Mandel’s Long Waves of Capitalist
Development. The point of this illustration is that we should measure ‘actual
historical sequences’ and ‘think of possible world historical developments in
terms of alternative story grammars’ because this kind of thinking and analysis
‘can enlighten and orient us toward a better future of constrained, but multiple
possibilities’ (Alker 1996:302). In other words, the analysis of narrative structures
should be brought back to the analysis of world historical episodes, tendencies
60 Overcoming international relations
and structures, in order to make models of these better, to make them more
imaginary and more sensitive to different world historical possibilities.
What else has Alker to say about how to broaden and deepen scientifically
relevant modelling approaches in the post-positivist era? In a paper from 1981,
when commenting more generally on the global modelling studies evoked by
the Club of Rome’s amplification of debates occasioned by Jay Forrester’s World
Dynamics, Alker (1981b:353) says that ‘too many people identify political/
administrative cybernetics with Forrester-Meadows world modelling’. They may
also identify it with sophisticated econometric modelling that includes, for
example, error feedbacks, or with the more complicated models of systems
theories. However, this perspective on global modelling is very narrow and
problematic for a number of reasons:
Alker not only points to the relevant social theoretical contributions for enriching
the potentialities of global modelling, and admitting that the relevant literature is
often ‘heavily historical and not easily formalizable’ (Alker 1981b:370), but also
outlines a new set of problems for those global modellers whose background is in
the cybernetic tradition. He terms this the dialectics of state formation in centre-periphery
systems. It is dialectical because the works within this problematic should also deal
with identity-involving or constitutive relations between centres and peripheries,
and between national parts and systemic wholes. The emphasis on state formation
comes already from Deutsch’s cybernetic- and communication-centred studies on
nation building. The idea that all this occurs in larger centre—periphery structures
came, naturally, from the Dependencia school and World System Analysis, and it
was supported by the (still popular in 1981) studies of dependency relations and
autocentric strategies for their reversal. However, Alker wanted to include other
themes as well, and to show how all these could be systematically modelled:
Despite his inspiring suggestions, Alker has never systematically shown, by doing an
exemplary study, how one could and should conduct more appropriate global
modelling. Perhaps Ashley’s 1980 book The Political Economy of War and Peace, which is
a brilliant work in the lateral pressure paradigm, comes closest to what Alker has in
mind.14 Nevertheless, his methodological works from 1977 onwards offer many valuable
hints, concrete suggestions and even detailed, although only partial, modelling exercises
for anyone interested in improving the past global modelling approaches. In addition
to the models of narrative structures, one can find among his works a refined, artificial-
intelligence-based analysis of the inner relations of imperialistic powers and their self-
reconstituting, power-balancing systems (Alker 1996:184–206); a case for a ‘neo-
classical polimetrics’, which is grounded in political argumentation about practical
choices in particular contexts, argued itself in the context of analysing and quasi-
formal modelling of the ‘dialectical logic of Thucydides’ Melian dialogue’ (Alker
1996:23–63; see also 1984); and exercises in linguistic, computer-assisted political
discourse analysis (for instance, Alker et al. 1991a; Alker et al. 1991b). These and
other related works may not amount to an articulation of a full-scale approach to
rigorous post-positivist global modelling, but they are important arguments about
how to broaden and deepen scientifically relevant modelling approaches.
Many of the points made in the previous three lessons are immediately relevant
from the point of view of the question ‘How do you theorise history?’ Next I
will now finally scrutinise Alker’s approach to theorising world history by
exposing the ‘Dialectics of World Order’ project, initiated by Alker and developed
in transnational co-operation with Tahir Amin, Thomas Biersteker and Takashi
Inoguchi.15 The ‘Dialectics of World Order’ project is an attempt to theorise
recent world history and possible futures in the dialectical and post-positivist
terms of Alker’s third phase.
The first outcome of this project was the publication of the article called
‘Dialectical foundations of global disparities’ (Alker 1981a). In that article, four
contending, inter-penetrating systems of global order are proposed as appropriate
units for an inquiry: capitalist power balancing, Soviet socialism, corporatist
authoritarianism and collective self-reliance. The fact that one can no longer be
content with some of Alker’s hypotheses does not invalidate his approach. On
the contrary, the far-reaching and unexpected transformations of the 1980s and
62 Overcoming international relations
‘From imperial power balancing to people’s wars: Searching for order in the
twentieth century’, a 1989 article written by Alker, Thomas Biersteker, and
Takashi Inoguchi, continues Alker’s interest in collective security practices, and
applies the same kind of thinking to analysing the development and relative
(in)validity of different modern models of peace and order. Alker, Biersteker
and Inoguchi (1989:135) limit themselves to an emphasis on three ‘partially
implemented, globally oriented security-seeking programs of the past century:
the Eurocentric balance of power, Wilsonian collective security, and international
socialist transformation through people’s wars’. The difference between the
analysis of ‘world orders’ and this analysis is that, while the former was also
concerned with the production modes that are also constitutive, in the latter
case only the organising principles are analysed.
The Eurocentric power-balancing system, which emerged in the eighteenth
century, became formalised in the nineteenth century and, in a modified
form, also constituted many cold war practices (cf. Patomäki 1992a:212–
14). The organisational principles of this model include the idea that states
are considered and treated as unitary actors, reciprocally recognising,
interacting with and differentiating among themselves. The system has
maintained itself by a mixture of force, the threat of the use of force, dynasty-
supporting marriages, alliances and diplomatic means. Further, unrestrained
power was seen as a threat to all other system members, hence the prescriptive
principles of action that collective power, through the use of alliances, should
be used against any state or bloc that became inordinately powerful
(‘balancing’ policies, in accordance with the central principle of astronomy,
mechanics and economics of the time). Both hegemonical aspirants and
their equilibrating antagonists were expected to stop fighting rather than
eliminate a core-state actor.
Alker, Biersteker and Inoguchi (1989) treat collective security simply as a
power-balancing alternative. They argue, quite provocatively, that ‘Wilsonian
collective security was not diametrically opposed to power-balancing, but rather
redefined and globalized an older Eurocentric power-balancing system’ (Alker
Learning from Alker 63
et al. 1989:145). The only difference was/is that any threat to peace was/is
assumed to be of basic concern to all members of the international society.
Consequently, aggression was/is outlawed by a ‘balance’ of ‘all against one’.
However,
However, this call for making the world homogeneous meant, in practice, that
war-avoiding doctrines and theories of collective security became coupled to war-
fighting practices and doctrines such as Wilson’s total war designed to end all wars.
Further, Wilson was not the only one who offered progressive, total alternatives
to traditional diplomatic practices; the proletarian internationalism of Vladimir
Lenin, Karl Kautsky, Rosa Luxemburg and other Marxists was at that time
Wilson’s major world political rival. While for Wilson ‘peace’ meant also the
continuation of the status quo, Lenin was advocating a major, world-wide socio-
economic change. Consequently, Leninism formed the background for those
from Mao and Lin Biao in China in the 1930s to the Sandinistas of the 1980s,
who have developed and propagated the conception of ‘people’s wars’. This
conception usually includes the belief that a future classless society will eventually
guarantee an eternal peace. Mobilising societal cross-class support to a—violent,
if necessary—revolutionary struggle for socialism would/should thus create the
basis for a lasting peace and order.
The analysis of Alker, Biersteker and Inoguchi has practical-political
implications. They demand more contextually sensitive and historically reflective
doctrines of collective security, ‘some less total, more realistic alternatives’ (Alker et
al. 1989:159). I would like to go further and suggest that we should also seek
alternatives that do not define or presuppose an eternal order that would amount, in
particular circumstances, to function as the status quo. Any status quo that would
be preserved with the help of Wilsonian ‘all against one’ powerbalancing policies—
even though this is perhaps possible only as long as somebody is able to orchestrate
and legitimate the actions of ‘all’—will eventually work against the basic underlying
principle, that is, peace. As will be argued in Chapter 8, contextual peaceful changes
should be made an institutionally guaranteed, always open possibility.
In any case, to the extent that one is willing to accept at least some of the
methodological notions that Alker has developed during his third phase, the
summary of these studies should illustrate how to theorise history in an
interpretative and heuristic manner. This concludes the fourth and final lesson
from Alker’s development. It is finally time to consider some limitations and
problems of Alker’s approach.
64 Overcoming international relations
Up to this point, my story about Alker could have been read as a simple fairy tale.
However, as Alker has argued, life is not a myth or a fairy tale with a guaranteed
happy ending, nor is it an inevitable tragedy. We need more complex, more
interesting stories. On the one hand, Alker has already been recognised, even if his
arguments have been rarely followed; he was the President of the ISA in 1992–3.
On the other hand, although it certainly may still be plausible to envisage Alker as
a lonely, courageous and insightful figure, heroic in his attempts to re-think the
methodological foundations of IR and peace research, his constructive solutions to
the lack of convincing scientific, moral and political results are by no means
sufficient. Also, Alker’s stand on ontological realism/idealism is ambiguous.
An obvious limitation of Alker’s approach is the fact that he has been, after all,
mostly concerned with the problems of peace, war and (in)security, the only major
exception being ecological concerns (see Alker and Haas 1993). Those doing
international/global political economy or normative political theory may not be
able to find as many interesting things in Alker’s texts. Furthermore, in his third
phase Alker has also strongly emphasised communication, discourses and
narratives, which for some may indicate that Alker is not able to analyse hard,
‘material facts’ of life. This emphasis of Alker’s—and some contradictory
evidence16—notwithstanding, I think it is misleading to presume that Alker has
taken the side of the idealists in the debate between ‘idealists’ and ‘materialists’.
Yet, this is precisely the interpretation of Christopher Chase-Dunn (1989:31).
Chase-Dunn claims that Alker has ‘a cybernetic conception of systemic structure
in which social information becomes coded in cultural and symbolic systems’.
This, he claims, presupposes that Alker has adopted an idealist philosophy. The
accusation is that Alker may have ignored the role of work as material activity,
and also the role of material destructive powers of military systems. Alker qualifies
his metaphor or analogy in a manner that should make it clear that he is not an
ontological reductionist (‘idealist’):
One can take from Roy Bhaskar’s brilliant book, Scientific Realism and Human
Emancipation, a very clear statement of the guiding motivation of such a newer
conception [of the reality of things]: emancipatory peace research
epistemologically directs its ‘learning from the data’ toward the emancipatory
‘uncoupling (of) the present from the causality of the past’, replacing
‘depotentialising (disempowering, oppressive)’ psychological, social, and
ecological structures by ‘potentialising (empowering, enhancing)’ ones.
Conceiving of emancipation as a ‘special qualitative kind of becoming free’
that consists in the self-directed ‘transformation…from an unwanted and unneeded
to a wanted and needed source of determination’, Bhaskar argues that it ‘is both
causally presaged and logically entailed by explanatory theory, but that it
can only be effected in practice’.
(Alker 1996:354)
This might open a path towards a general methodology of iconic modelling. Iconic
models are based on the dialectical/reductive way of reasoning; concerned with
the explicit reflection on the metaphors and analogies used in formal and non-
formal social scientific explanations alike; able to incorporate the temporal,
narratively structured dimension of social worlds in them; and leave room for
both explanatory emancipation and imaginary edification. Although all evidence-
based interpretations of texts and explanations of social reality, when understood
in terms of iconic modelling, can only be trans-subjective and periodic, they
must also put forward much more general truth-claims, to be truly open to
critical dialogues with others. These ideas will be explicated in Chapter 6.
Learning from Alker 67
Concluding remarks
Notes
1 However, Galtung too has revised his ontological and epistemological assumptions;
see Galtung 1977 and 1996. For a critical realist attempt to develop these ideas further,
see Patomäki 2001a.
68 Overcoming international relations
2 Whenever possible, I refer to this book instead of the original papers, since many of
Alker’s papers were either circulated as unpublished mimeos or have been published
in places that are not easily accessible.
3 Alker wrote (1976:45): ‘My first “way of seeing” the UN political process was heavily
influenced by Dahl-Lipset-Haas type pluralism. And it was grounded in my personal
commitments. World cleavages might not be so cumulatively destructive if they did
not reinforce each other.’
4 Journal of Peace Research rejected it when Alker submitted it there! Eventually it ended
up in Alker 1996, as Chapter 5.
5 In the words of Gadamer (1977:94), the truth of hermeneutics is that it ‘allows
what is foreign and what is one’s own to merge in a new form by defending the
point of the other even if it be opposed to one’s own point of view’. It should be
noted that in the case of me interpreting Alker there are no really opposing
viewpoints, rather just differences of opinion and emphasis within a rather broadly
defined perspective.
6 In a later paper, Alker (1984:165) even points out that the technical advance in the
statistical programs of computers has been a disservice to the scientific development:
‘…it has left many with the false impression that experimentally oriented analyses of
variance, sample-based cross-tabulations, and regression analyses of non-experimental
data are—or should be—the analytical logics at the core of “polimetrics”’.
7 Neufeld (1993a:69) is thus simply wrong in his claim that Alker and Biersteker (1984)
‘accept that contending paradigms in International Relations are incommensurable’.
8 In the more technical terminology of the game theory, one says that in the Prisoner’s
Dilemma game the outcome of the individually rational choices is Pareto inferior (or
is not Pareto optimal). This means that there is an outcome in which both (or all)
players simultaneously could do better (for, moving towards a Pareto optimal state, it
would be enough for there to be one player who could do better while all others
would do so as well).
9 Note that, for Alker, totalitarianism is not a unique feature of Soviet or Chinese socialism
or of the Nazisms and Fascisms of Continental Europe (or of Latin America, Africa
and Asia, for that matter). Rather, Alker (1981a:91) is very explicit in saying that
‘more or less recessive variants of totalitarianism can be found in the state forms of
contemporary China, the Soviet Union, Germany, Japan, and the United States’.
10 Furthermore, the contextual Lebenswelt is in any communication situation as the ever-
present and taken-for-granted background that is ‘always already’ there when we act;
hence the attributes ‘undistorted’ and ‘unconstrained’ should be viewed to be
hermeneutically relative ideals, not absolute principles.
11 The ‘crude’—although rather complex—model of information flows in the foreign-
policy decision making can be found in the Appendix of the Nerves of Government
(Deutsch 1963:258–61); the basic assumptions of this model are discussed
sympatethically but reconstructively (from a more reflectivist position) in Alker
1981b.
12 Perhaps the bottom line for those not understanding Alker’s methodological, political
and moral development would be the following: ‘Our research activity has also been,
in a way, a Faustian romance: our entertaining commitment has been to unify a
Promethean science (Artificial Intelligence or computational text analysis) and the
ineffabilities of self-serving and self-transcending love’ (Alker 1996:141). For an
articulation of a similar, (late-) modernist moral vision that recasts the relation between
our ultimate ideals of love and empowerment, see Unger 1984.
13 Fortunately, this has been done by Grunberg (1990). She argues, after having tried to
show that ‘the theory of hegemonic stability fails to meet reasonable criteria of empirical
accuracy and analytic consistency’, that there are ‘mechanisms of mythic interference
with theory-building’ and that this interference can explain the rhetorical appeal of
the neo-realist hegemonic stability theory (Grunberg 1990:433–4).
Learning from Alker 69
14 On the cover of the 1980 Ashley book, there is a brief introductory and evaluative
note by Alker: ‘If Choucri and North’s Nations in Conflict was arguably the best book in
the 1970’s produced by what Zinnes calls the Scientific Study of International Politics
(SSIP), Ashley’s major work may well secure that position in the 1980’s. At the same
time that it fulfills the promise of that school, it transcends its unreflective
epistemological narrowness, confronting and redefining the major political issues of
the day. Thus it confirms the frightening prospect that Choucri and North’s lateral
pressure perspective, developed in their analysis of the roots of World War I, also
applies to the current USA-USSR-PRC superpower competition. Changing modes
of powerbalancing are accounted for at the same time that Soviet “social imperialism”
is shown to be both imitative of earlier Western expansionism and contagiously
infecting the Chinese. Ecologically sensitive redefinitions of demographic-technological-
bureaucratic “growth”, not just clever power balancing realignments, are shown to be
prerequisities for a peaceful future.’
15 This is the name of the planned book or ‘an advanced text on theories of international
relations’, which, however, will probably never come out. Be that as it may, most of
the intended chapters have been published somewhere. In addition to Alker (1981a),
Alker and Biersteker (1984) and Alker, Biersteker and Inoguchi (1989), one should
mention at least Amin (1985).
16 In Alker (1986:2) it is stated that he shares ‘the ontological commitments of both
Plato and Leibniz’. There, Alker also talks about the ‘ideal reality of linguistic and
conceptual transformations and the ideal-material reality of social talk’. Talk is real
also in the critical realist understanding, as are linguistic and conceptual
transformations, but the ontological commitments of Plato and Leibniz are much less
easily reconcilable with realism.
3 How to tell better stories
about world politics
In the 1980s, there was nearly a consensus on at least one point, namely that
international relations (IR) contained three basic perspectives: (1) political realism;
(2) liberalism (seen also with varying emphasis as idealism, or pluralism or
economic liberalism); and (3) marxism (seen also, again with varying emphasis,
as structuralism or as a socialist model). Each of these holds a characteristic
view of international politics. In spite of a century of sustained effort, it is not
unfair to say that these perspectives continue to exist only in a rather speculative
outline. Also, as far as (1) and (2) are concerned, they mostly repeat the cognitive
structure of international problematic that was articulated by Hume and Kant,
and established in the nineteenth century.
Should we be happy with this ‘pluralism’ and continue the ‘great debates’ in
the field? Or is it possible to imagine the prospect that one of these rather
sketchy perspectives will eventually emerge as the accepted truth in this area of
analysis? In recent years there have been a decreasing number of self-proclaimed
Marxist scholars around, with the notable exception of the field of international
or global political economy.1 In the 1990s, after the short visit of Marxism in
the mainstream of IR, there has been, perhaps more than ever, a tendency to
reduce all problems of IR to an almost eternal dispute between political realism
and liberalism. Many aim at a synthesis of these two perspectives, a new mixed
perspective that could transcend some of the ‘great debates’ and contribute to
intellectual progress in the field.2 Postmodernists’ protests aside (cf. Ashley and
Walker 1990; George 1989), should we, as IR scholars, have this kind of
consensus as the ultimate telos of our research?
It is true, as Rob Walker (1987, 1989) has argued, that the IR perspectives
are also historical sites for a number of theoretical and metaphysical disputes.
These (meta)theoretical debates have, among other things, been about the
relationships between history and theory or history and structure (Buzan and
Little 1994; Schroeder 1994). IR scholars have discussed, for instance, whether
we should prefer the classical approach, emphasising history and conceptual
analysis, or the more strictly scientific approach, seeking to find behavioural
invariances by strictly controlled deductive or statistical means. Despite the
apparent depth of some of these conversations, mostly they have contributed
merely to the reproduction of the international problematic.
70
Better stories about world politics 71
we can best avoid confusion if we reserve levels of analysis talk for questions
about what drives the behaviour of exogenously given actors, and agent-
structure talk for questions about what constitutes the properties of those
actors in the first place.
(Wendt 1992b:185)
Wendt claims that to construct system-level theories, one must take into account
how actors constitute themselves in interaction (cf. also Wendt 1992a). Thus,
Wendt wants to restrict the applicability of ‘levels of analysis talk’. Wendt assumes
that the actors in the international system are sovereign states. For Hollis and
72 Overcoming international relations
Smith, however, both system and state ‘levels involve questions about the nature
of agency’ (Hollis and Smith 1992:188). Indeed, at any given level of analysis,
the question of ‘what it means to be an actor’ emerges. On these grounds, they
argue that Wendt’s distinction between explaining the behaviour of the units,
on the one hand, and what, on the other, constitutes their properties, is essentially
empty. They use these points first and foremost to defend the general applicability
of the traditional levels-of-analysis thinking in IR.
After these two rounds, Carlsnaes and Smith (1994) continued the debate.
Whereas Smith (1994) tends to repeat his (and Hollis’s) earlier claims, Carlsnaes
(1994:277–87) defends Wendt’s original contention that we need to take into
account both inside (‘understanding’) and outside (‘explanation’) factors in our
analyses. Carlsnaes attacks the either/or character of the two-stories view by
distinguishing between philosophical theories of the relative weight of agential
and structural factors and empirical studies of the linkage between the two. He
combines these points in his call to the discipline ‘to integrate the explanatory
and understanding traditions within social science as well as to resolve the agency-
structure problem tout court’ (Carlsnaes 1994:282). Carlsnaes’s own contribution
to this goal is found in his argument that we should distinguish between three
explanatory levels: accounts of the ‘in order to’ intentions of foreign-policy actions;
explanations of why actors have these particular purposes (in terms of ‘because-
of accounts of the dispositions of actors); and explanations of how the structural
environment of actors (both domestic and international) affects the two levels.
He argues that if we view intentions, dispositions and external structures as a
reciprocal process of agency-structure interaction over time, we should be able
to build more adequate models for the analysis of foreign-policy behaviour (he
has explicated his own suggestions in Carlsnaes 1992 and 1993).
In what follows I shall continue and re-direct this debate. First, in line with
what Hollis and Smith as well as Wendt have indicated, I maintain that questions
concerning the existence of social entities and causal explanation are, indeed,
very closely related. However, I will go on to argue that it follows from the
critical realist notions of causality and social ontology, which I discuss tentatively
in this chapter, that the levels-of-analysis problematic cannot be plausibly
defended. It obscures more than it clarifies. Even though it is natural for a
critical scientific realist to think about levels in ontological terms, I contend that
‘international’, ‘inside-of-a-nation-state’ and ‘individuals’ are not levels in any of
the usual senses. Rather, we should talk about different kinds of interpenetrated,
relational contexts and look for depth in other directions.
Second, although Wendt states in his reply that he is now persuaded by
Hollis and Smith about the relationship between causal and interpretative
analysis, I shall defend and radicalise Wendt’s original position;
understanding and explanation are complementary and interdependent modes
of analysis. In fact, it is quite easy to show how explanation and understanding
are inter-related. Hermeneutic understanding is necessary for causal analysis,
because reasons are causes, yet that kind of analysis is not in itself enough.
Explanation presupposes holistic depth analysis of causes, including structural
Better stories about world politics 73
In his review essay, Wendt (1991:387) criticises Hollis and Smith for ‘conflating two
distinct problems: the levels of analysis problem and the agent-structure problem’.
Wendt goes on to claim that the levels-of-analysis problem ‘is a problem of
explanation: of assessing the relative importance of causal factors at different levels of
aggregation in explaining the behaviour of a given unit of analysis’ (Wendt
1991:387). Both in David Singer’s original formulation (1961) and in Wendt’s
reformulation of it, the given unit of analysis, as well as the ‘dependent variable’, is
the behaviour of the nation state—in particular its foreign policy. The explanatory
variables can be located at the level of ‘system’, ‘states’ or ‘individuals’, or viewed as
a combination of them. Wendt (1991:388) claims that Hollis and Smith have
changed the issue by taking the phenomenon to be explained to be either the
behaviour of the state actors or the behaviour of the international system, or even the
system or units themselves. By stipulating that the international system is real and
perhaps also able to behave, Hollis and Smith are effecting a shift from the original
behaviouralist research focus towards an ontology, towards the problem of emergent
collective properties and their explanation. Hollis and Smith reply as follows:
Briefly an explanation of state behaviour may or may not explain the state
away, depending on where one locates its sources—a theoretical as well as
empirical question involving ontology. But a general ontology is never handed
to the theorist on a plate, so to speak, and its justification is connected, although
indirectly, to explanatory merits. The level-of-analysis problem in international
relations cannot be tackled in isolation from the view taken of what is meant
by international system, and that is an ontological question.
(Hollis and Smith 1991:395)
What Hollis and Smith seem to be arguing here is that any attempt to explain
something in sworld politics necessarily implicates hypotheses concerning the
existence and nature of certain social entities, particularly the international system
and different ‘units’ (states, bureaucracies, individuals, etc.). Hollis and Smith are
not claiming that one cannot make different ontological assumptions in constructing
explanations at a given level-of-analysis, only that all explanations involve some
existential (or ontological) assumptions. This is a very important point.
74 Overcoming international relations
What then is Wendt’s complaint? For one, Wendt may think that Hollis and
Smith are in fact making unacceptable ontological or existential assumptions. Does
the international system behave? If it does not, perhaps it then has characteristics
that can be explained? But can we explain these properties of the international
system merely in systemic terms? Although a Waltzian would maintain a clear
separation between theories, respectively, of international politics and foreign policy,
the ‘dependent variables’, such as stability and coalition formations, seem to be
outcomes of foreign-policy making. As Wendt (1999:12) later explained, it ‘is
impossible for structures to have effects apart from the attributes and interactions of
agents’. For instance, ‘stability’ is often conceived as the opposite of violent conflicts
and war. Even in a step-by-step escalatory process, some foreign-policy actors must
decide and wage war. To avoid this conclusion, Waltz (1979:161–2) later qualified
his views by arguing that international stability means only two things, namely that
‘anarchy’ (=non-hierarchy) prevails and that the number of principal parties that
constitute the ‘system’ remains unchanged. This move makes the Waltzian ‘theory
of international polities’ either trivial or absurd. Besides coalitions, Waltz’s main
explanandum is stability. Waltz’s theory tries to predict the number of great powers
by referring, tautologically, to the number of poles in a system of power balancing.
I consider this to be the reductio ad absurdum of Waltz’s neo-realism. It also constitutes
a prima-facie case for Wendt’s contention that we should focus on foreign-policy
actions and their (perhaps constantly reproduced) outcomes, including war, peace,
state relations and regimes of co-operation.
For Wendt, then, what IR perspectives typically try to explain are the foreign
policies of states and their consequences. This is the case even though he takes
a ‘systemic view’, that is, he treats the identities and interests of states as analogical
to those of unitary, yet socially constructed, individuals who are continuously
reconstructing each other in the course of their systematically structured
interaction (Wendt 1992a; cf. Wendt 1994). Hence, given Wendt’s anti-Waltzian
ontological presuppositions and his ‘systemic’ theoretical aims, his complaint
might be simply that Singer’s original formulation of the levels-of-analysis
problem is superior to that of Hollis and Smith. This is due to the more plausible
existential hypotheses and a more clearly formulated research problem.
Yet there is something more to Wendt’s criticism, even though he does not
go very far in articulating it. Wendt (1991:388) might well be saying that he
prefers the agency-structure problematic to that of the level-of-analysis
problematic. If so, he should develop his case further. In particular, it is my
claim that, as a critical realist,4 Wendt should suspect that there is something
wrong with the whole levels-of-analysis problematic, since it is based on a
positivist (or empirical realist)5 account of science. It is not surprising that Hollis
and Smith adopt the standard positivist account of explanation and causality.6
statement conceals much dispute about the exact relation between a cause
and its effect, about the right way to define ‘cause’, and about the nature of
causality, both as a concept and in the world. So what follows is very
preliminary. But the broad idea is that events are governed by laws of nature
which apply whenever similar events occur in similar conditions. Science
progresses by learning which similarities are the key to which sequences.
(Hollis and Smith 1991:3)
In other words, Hollis and Smith agree with Singer that explanation is about
finding law-like regularities at work, and hence that explanation follows the so-
called deductive-nomological model of causal explanation.7 Hollis and Smith are
also of the opinion that explanatory theories should, above all, be parsimonious.
They criticise David Dessler for claiming that ‘the richer and more comprehensive
the underlying ontology, the better the theory’ (Dessler 1989:446). In this they
follow positivism, which has always emphasised simplicity. As Singer puts it:
Our model must have such analytical capabilities as to treat the causal
relationships in a fashion which is not only valid and thorough, but
parsimonious; this latter requirement is often overlooked, yet its implications
for research strategy are not inconsequential. It should be asserted here that
the primary purpose of theory is to explain, and when descriptive and
explanatory requirements are in conflict, the latter ought to be given priority,
even at the cost of some representational inaccuracy.
(Singer 1961:79)
These presuppositions explain why Hollis and Smith are willing to define the field
of IR in terms of the levels-of-analysis problematic. As far as the outsider’s story is
concerned, they think—with Singer—that the primary purpose of theory is to explain
phenomena in accordance with the deductive-nomological model, and in as simple
a manner as possible. They also seem to contend that this kind of explanation
amounts to assessing the relative causal importance of different levels of analysis.
Hollis and Smith do not question that empirical invariances are necessary
for causality. They should. Like in other social sciences, there are very few, if
any, uncontested invariances in IR.8 The fact that mainstream scholars go about
their business as if this were the case is no argument. As Friedrich Kratochwil
(1993:68) has pointed out, ‘activities in the profession are more often driven by
methodological fads and clientalism than by problems and sustained attention
to a research programme’. Moreover, these fads tend to remain within the highly
repetitious cognitive structure of the international problematic. The absence of
universal regularities does not, however, mean that there is no causality. Perhaps
there is something wrong with the prevalent notion of causality? Perhaps we
should qualify the Humean causality by making it more complex and contextual.
Instead of simple universal regularities, perhaps we should be searching for
more conditional regularities?
76 Overcoming international relations
Even after the end of the Cold War, some neo-realists have opted for the stubborn
Humean simplicity (cf. Mearsheimer 1990; for the criticism that neo-realism cannot
explain or even describe the end of the Cold War, see Gaddis 1992; Patomäki 1992a).
However, we know that neither the Millian (‘interactionism’) nor the Comtean
(‘reductionism’) strategy has been very successful during the past century and a half.
To import complexity and contextuality into IR is certainly a better solution than
sticking to the simplicities of failed Humean theories of the past. On the other hand,
after all these years of unsuccessful attempts of finding invariances, perhaps there is
room for more radical solutions? Perhaps empirical invariances and causality are two
totally distinct phenomena? Even more importantly, perhaps there are conceptions of
causality that explain this absence of universal empirical regularities?
It can be easily shown that the idea that causality involves empirical invariances
presupposes closed systems. Constant conjunctions of events obtain only in the absence
of outside intervention and qualitative variation (see Bhaskar 1979:160). Where
a genuine, non-changing causal mechanism has been isolated, can we say that ‘every
time {A, B, C…} occurs, X follows’. However, entities do change and systems are
more or less open except when carefully set up experimental conditions obtain.
In fact, ‘if we retain a Humean definition of causation as regular succession, we
will discover no causal laws outside astronomy, where the incapacity of other
mechanisms to deflect heavenly bodies from their courses approximates to a
natural closure’ (Collier 1994:34). In open systems, probabilistic empirical
invariances that seem to connect phenomena are often causally unrelated. This is
the well-known post hoc ergo propter hoc—fallacy.
A non-Humean, critical realist conception of causality would take as its
point of departure the idea that real causal mechanisms and complexes produce
effects in open systems. From this perspective, it is wrong to assume that causality
can be analysed in terms of simply necessary or sufficient conditions, yet this
is presupposed if one talks about empirical invariances. Causality should be
understood not only in terms of complexes, but also in terms of the subject
matter in question. The modified definition of cause as a so-called INUS-
condition (see Bhaskar 1979:165, 207 n. 23; Mackie 1976) is a leap forward
According to this definition, cause is an Insufficient but Non-redundant
element of a complex that is itself Unnecessary but Sufficient for the production
Better stories about world politics 77
Clearly the experts are not saying that the short-circuit was a necessary
condition for this house’s catching fire at this time; they know perfectly well
that a short-circuit somewhere else, or the overturning of a lighted oil stove,
or any number of other things might, if it had occurred, have set the house
on fire. Equally, they are not saying that the short-circuit was a sufficient
condition for this house’s catching fire; for if the short-circuit had occurred,
but there had been no inflammable material nearby, the fire would not have
broken out, and even given both the short-circuit and the inflammable
material, the fire would not have occurred if, say, there had been an efficient
automatic sprinkler at just the right spot.
(Mackie 1976:308)
In other words, they are saying that there were a number of conditions that were
themselves unnecessary but sufficient for the short circuit to produce the fire,
given certain natural mechanisms and their causal powers. I suggest that causality
always works like this. However, we should also make specifications in terms of
the subject matter in question. We need social ontology to specify the major features
of being we are interested in. Social ontology is an abstract model—or articulation—
of social being, not a metaphysical point of departure. Any social ontology is itself
a horizon, a hermeneutic standpoint that enables and limits the possibilities of
interpretation, and should thus itself be open to change. Richard Rorty (1991:38)
has rightly criticised the ‘metaphysical thinker’ who ‘thinks that if you can get the
right understanding of Being—the one that gets Being right—then you are home’.
In contrast to the claims of traditional metaphysicians, there is no safe ‘permanent
home’, just home-like stopping points. Social ontology can be criticised both on
the grounds of philosophical arguments showing the conditions for the possibility
of X and/or in terms of explanatory and other (dis)merits of different ontological
assumptions and existential hypotheses.
Critical realism is based on a combination of ontological realism,
epistemological relativism and judgemental rationalism. In the critical realist
view, social relations located in time and space form the relatively intransitive
dimension of the social sciences that, even in the case of research on the
contemporary era, to a large extent exist independently of the particular
knowledge that particular researchers may have of them. Discovery and cognitive
or discursive development, which include ontological theories and hypotheses,
belong to the epistemological dimension, which is relativistic and transitive.
Finally, the idea of judgemental rationality implies that it is possible to give
good, even if contextsensitive, temporal and contestable, reasons for their validity
of a judgement about truth (and truthfulness, rightfulness, good, etc.).
78 Overcoming international relations
Thus far I have made an argument for a realist conception of science. In such
conceptions, the metaphor of ‘level’ is widely used. This might suggest that
there is room, after all, for levels in such an approach to IR.11 However, levels
in these realist senses cannot be used to defend IR levels of analysis. There are,
it is true, emergent levels and society is such an emergent layer. We can also
talk about levels and depth within the emergent social layer. However, depth in
these senses has little to do with the ‘level-of-analysis’ problematic in IR theory.
Real emergent powers and properties do not seem to be pre-organised neatly in
accordance with the territorial logic of states.
With reference to depth we can, first, talk about temporal depth, in a sense of
the emergence and sedimentation of practices and social relations, which, if
systematically connected, may give rise to emergent powers and properties.
The genealogy of the international problematic, in Chapter 1, is a partial analysis
of such emergence, focusing on the preconceptions of practices and the
consequent problematic, but also discussing the emergent powers and properties
of capitalist market society and modern states. Second, there is the stratification
of human agency and action (see Giddens 1979:54; Bhaskar 1986:164). This
will be discussed in more detail in Chapter 6. Suffice it to say here that
(unconscious) motivation of, and real reasons for, action may be distorted in
self-understanding and public discourses, giving rise to various, sedimented
and institutionalised rationalisations. These distortions may be connected to
(systematic) unintended consequences of actions and practices.
Like motivation and reasons, knowledge in general is stratified. A distinction
can here be made between analytically and qualitatively layered depth, defined
as follows (see Bunge 1963:39–44).
Among possible levels of social realities are layers of practical and discursive
knowledge, and the stratification of agency and action. There are also other
realist conceptions of levels, some of them verging on positivism. These have
been summarised by Mario Bunge (1960; 1963: Ch. 3). Levels can be conceived
in terms of degrees, as when one talks about degrees of population density,
degrees of integration, adaptation, learning or orientation, or degrees of
importance. In this sense, IR levels do not have any meaning, and, in any case,
it would here be more appropriate to use the term ‘degree’ rather than ‘level’.
Levels can also be conceived with reference to complexity. Prima facie, it is plausible
to assume that international or world politics is more complex than national or
local politics. However, as the emphasis on the importance of simplicity and
parsimoniousness shows, this is not what Waltz or Singer had in mind when they
introduced this problematic; Waltz in fact takes the requirement of parsimony and
simplicity to its extremes. Thus, although it may be reasonable to talk about degrees
of complexity and even call them ‘levels’ (although I prefer not to do so), this is
something that goes radically against the underlying tenets of Singer, Waltz and
their followers. If it is complexity that is referred to, then the quest for accuracy and
depth should imply pursuit for highly complex iconic models of these realities.
Levels are often conceived of as layers, i.e. as superposed strata arranged
according to the order of their emergence in time, or in terms of their logical
precedence. A layer or stratum is a section of reality characterised by emergent
qualities. This is the essence of a common realist meaning of level. However,
‘international system’, as distinct from states, is not a layer in this sense. Modern
sovereign states and the European institutions of international (or inter-state)
society emerged at the same time. “‘International relations” is coeval with the
origins of nation-states’ (Giddens 1987b:4). Nation-states and IR are internally—
even if ambiguously—related to each other (see Bull 1977:36–7; Camilleri 1990;
Giddens 1987:257–8, 281–2; Koskenniemi 1989:55–98, 192–3). Neither would
be what it is without standing in relationship to the other. Sovereign states and
international relations emerged from the complex re-organisation of political
space in Europe. IR did not emerge from a lower ‘stratum’ of pre-existing
Better stories about world politics 83
sovereign states but by way of conceptual and practical innovations, and this
made it possible to re-organise social relations and practices in seventeenth- and
eighteenth-century Europe in terms of sovereign states and diplomatic relations
between them. Sovereign states and IR were forged from the pre- and co-existing
materials. The conceptual innovations were built on the emergent conception
of capitalist man and metaphors drawn from the new sciences. Both the state
and IR were conceived in terms of the modern problem of order, which appeared
to have been intensified by the emergent powers of modern organisations.
It is possible to conceive of levels also as ranks. Ranks belong to asymmetric
dependence relations, that is, to hierarchies. Staircase pyramids exemplify such
hierarchies. States—even allegedly democratic states—are often conceived in terms
of hierarchy. Thus, the head of a state or a foreign minister is ‘above the ordinary
citizens or (wo)men on the street’. Consequently, perhaps we could talk about
two levels at least in this context (and add some other levels in between as
well). However, what about ‘international’? For a Waltzian, at least, this kind of
conceptualisation of levels would create a dilemma. It is, after all, the absence
of hierarchies that makes the international system anarchic for him. How could
anything be in this sense above states? If we, by analogy, extend the notion of
‘rank’ to include causal relations (the higher ‘levels as ranks’ causing the features
and actions of the lower ‘levels’), we would, first of all, dilute the meaning of a
‘rank’ considerably, and second, re-open the question as to what it is that produces
these causal effects. This question can be phrased in terms of the legitimate
realist question about emergent powers and properties of social systems.
In the case of an ideal-typical hierarchical organisation, it may seem obvious
what or who is responsible for the effects: the top-rank executives, officials or
managers. Waltz, for instance, can indeed be read as saying that (the leaders of)
great powers are such members of the hierarchy within the international system.
However, this would imply the admission that there is a formalised legal hierarchy
within the international system and that the international system is hence state-
like, not an anarchy. Thus, this understanding of IR levels would contradict the
basic thrust of the Waltz-Singer argument. I think, therefore, that any study of
world-wide asymmetric power and dependency relations would require an
ontology that is drastically different from that of Waltz.
Of all realist notions of levels, only two would seem possibly relevant to a
reconceptualisation of something akin to IR levels, namely hierarchical power
relations and the emergent powers and properties of systems. Social systems
have no essences independent of the contexts of action; they change together
with those relational contexts. Social systems can, nonetheless, have emergent
powers and properties. They must be analysable in terms of actions and
positioned practices occurring in concrete spatial settings. For instance, since
the seventeenth century, the rules and principles of international society have
defined, with increased depth and scope, not only the nature and rights of
sovereignty but also enabled capitalist world markets, through universal private
property rights and the ‘stability of possessions’ (cf. Bull 1977:5). It included
also the creation of labour markets and industrial organisations, at first in
84 Overcoming international relations
The IR levels would preclude open mindedly studying levels in these two possible
senses. As will be explained in more theoretical terms in Chapter 4, social
contexts and systems are not only open but are also, among other things,
overlapping and inter-penetrated in time and space. In parts, they can also be
intra-related, i.e. there are essential relations connecting parts of different systems
or wholes. Consider, for instance, the example of the Bretton Woods institutions
located in Washington DC, next door to the White House, linking, in the early
2000s, the US government and the treasuries and central banks of most states
on the planet, also through the discursive practices of orthodox economics. The
US government may, in parts, be essentially intra-related to the practices and
actors of Wall Street. And so on. Emergent powers and properties should not
be organised, in one’s mystifying imagination, to accord with the simple IR
levels just to yield the convenient possibility of continuing to reproduce the
international problematic. They should be analysed as they really are.
In sum, the scientific realist reconstruction of the traditional IR levels seems a
non-starter. Even the conventional practices of international society do not follow,
in terms of spatial settings of their (re)production, the neat territorial logic of horizontal
inside/outside distinction that is so often transformed into, and then reified as, the
vertical level-of-analysis problematic. Many, if not most, relevant components of
contemporary causal complexes may not be easily locatable in territorial statist terms.
The vertical metaphor of ‘above’ hardly adds to the explanatory power of the
territorial ‘outside’. More concrete and specific claims have to be made.
Responding to critics
Throwing out the language does not necessarily clear things up. In place of
levels, Patomäki would have us ‘talk about different kinds of interpenetrated
contexts’. If contexts penetrate each other, it is hard to know what makes
them different in kind. To say that ‘we are faced with qualitatively different
kinds of contexts of social action’ is hardly a clarification.
(Onuf 1998:195)
86 Overcoming international relations
Onuf wants to ‘clear things up’. However, why should we restrain ourselves to
a mere clarification of the meaning of ‘IR levels’? If these levels are ontologically
and explanatorily unsound, should we not criticise the prevailing practices of
organising the field in terms of them? Onuf’s argument that it is difficult to
identify social contexts is no less shallow Contexts can be identified if social
practices and relations can be identified and their relevance for a given social
agency and action discerned. I am sure Onuf is not claiming that relations and
practices are unidentifiable, or that their relevance to agency and action cannot
be determined.12 What is his point then? Revealingly, it seems to be about
respect for conventions. Onuf (1998:218) defends IR levels in terms of
convenience of ‘treating the primary units [states] as unproblematic’. This verges
on the dogmatic. The closure is completed by ‘in our culture, as in the field of
International Relations, we would have difficulty getting along without the
language of levels’ (Onuf 1998:218).
Wendt published his magnum opus Social Theory of International Politics in 1999.
Wendt applies scientific realism to defend and articulate a sociological version of
state-systems theory. On prudential grounds, Wendt ignores arguments against the
levels of analysis and the related dichotomy of inside/outside. Wendt (1999:13)
argues that systems theory required that ‘domestic or unit and systemic levels can be
separated’.
Wendt does not deny the force of these arguments. He ignores them. The legal
organisation of political authority distinguishes between the domestic and
international spheres. This is partly due ‘to the international institution of state
sovereignty, in which states recognize each other as having exclusive political
authority within separate territories’ (Wendt 1999:13). The inside rules are
thus different from the outside rules. Wendt thinks that this opens up the
possibility for a states system theory. The purpose of the states system theory is
to explain why and how states, conceived as analogical to persons, interact in a
systemic setting, and with what outcomes.
Although aware of the fact that ‘in not problematizing something we are
temporarily naturalizing or reifying it’ (Wendt 1999:89), Wendt nonetheless
points out that if we want to find out how the states system works, ‘we will
have to take the existence of states as given’ (Wendt 1999:246). This is the crux
of the matter. Not unlike Onuf, Wendt appeals to the convenience of taking the
states as given, in order to continue with the business of international theorising.
Better stories about world politics 87
Reasons for a particular action in a given context are the causes of the
meaningful activities of the agent in question, leading in turn to various outcomes.
Another way of putting this is that the agent’s reasons are a necessary condition
for the particular bodily movements—including those needed for speaking and
writing—that occur in purposive action, in the straightforward sense that, had
the agent not possessed them, these actions would not have occurred in the first
place (unless over-determined). According to Harré and Secord (1972:86–7),
human powers of this kind include the following: the power to initiate change;
the capacity of being aware of things beyond oneself and knowing what they
are; and the power of speech. Furthermore, agents must know what they are
about to do, what they are doing and what they have done, in the sense of
being capable of describing their actions. Without these kinds of causal powers
of embodied social actors, there can be no social practices and institutions.
This solution seems so obvious that it may well give rise to immediate scepticism.
In Finnish, for instance, there is a word (syy) that refers to both causes and reasons
without distinguishing between the two. Can a scientific ontology be grounded in
such a commonsense notion? Although I cannot enter into a debate on this issue
here (but see Bhaskar 1979:102–46; Harré and Secord 1973), let me just very
briefly discuss some of the objections usually raised against this claim. In addition
to the obsolete behaviouralist insistence on the relevance of the concepts of stimulus
and response,16 there are basically two kinds of objections of relevance here, the
one philosophical and the other logical. The philosophical objection is that, if we
conceive of reasons as causes, we are either postulating temporally later state of
affairs X as teleological causes for an earlier Y, or we are committing a vicious
regress argument of a sort that ‘the causes for reasons R must be other reasons
R1, which in turn are caused by R2 etc., ad infinitum’. However, as Bhaskar has
noted, teleological explanations can be ‘most naturally construed as a species of
ordinary causal explanation, where the cause is, in the case of reasons, some
antecedent state of mind’ (Bhaskar 1979:107). Similarly, the question of whether
the reason R was the cause of A does not depend on an account, explication or
explanation of the process through which reason R has been (socially) generated
and constituted. Bhaskar thus notes the following:
This does not, of course, mean that actors do not have reasons for their political
identities—on the contrary. It simply means that a commitment to social
democracy (seen as a state or disposition) can, in certain circumstances, in itself
Better stories about world politics 89
be a sufficient reason for voting for the SDP. In this example, a causal explanation
of a particular voting act is given in terms of this sufficient reason R, that is, in
terms of a commitment to social democracy. Naturally, the commitment itself
can also be explained, in which case we would have to move from a surface
account to a deeper account.
Finally, there is the logical connection argument, according to which (1)
causes must be logically distinct from effects, yet (2) reasons are logically
(conceptually, internally) connected to the actions they explain. Hence, it is
claimed, reasons cannot be the causes of actions (see Winch 1971 [1958]; von
Wright 1971). Although this tradition of (analytical) hermeneutics is right in
emphasising the existence of conceptual and internal relations in social worlds,
the problem is its total denial of causality in these worlds. The consequence of
this denial is the Kantian dualism between the causally deterministic natural
world, conceived in Humean terms, and the world of human (and perhaps
superhuman and divine) reason. This dualism makes it impossible, for instance,
to grasp the effects of human reason (through technology etc.) on nature or, vice
versa, the effects of the bodily mechanisms on human reasons or reasoning.
Moreover, logic connects statements, not events, actions and the like. Even if
there exists an internal connection between the statements ‘causing x’ and ‘x’,
sunburn is not a logical consequence of the sentence ‘prolonged exposure to the
sun results in sunburn’ (Bhaskar 1979:109–10). Statements do not cause sunburn.
In an analogous manner, neither thinking, talking or writing the statement
‘switching the computer off’, nor the sentence in itself, can switch off the
computer. A corporeal and reasoning human being has to do something for this
effect to occur. In other words, his or her reasons for doing something must be
causally efficient. This is the ground for being able to distinguish between an
agent’s knowledge about what he or she is about to do, what he or she is
actually doing, and what he or she has done, i.e. distinguishing between an
agent’s own understanding and what he or she is actually, in a causally efficacious
manner, doing.
Let us at this point return to the claim made by Hollis and Smith, namely
that the ‘understanding’ and ‘explanation’ cannot be synthesised into one
approach, and hence that one must perforce choose between the two. However,
reasons are causes, that is, non-redundant and existentially necessary elements
of any causal complex. The first stage in any explanation is thus to find these
causally efficient reasons. Regardless, what is the relationship between the reasons
and intentions of a person and inter-subjective meaning? In trying to give an
answer to this query, Habermas’s theory of understanding is, in my view, of
great help.
In order to understand something, Habermas (1987 [1981]:18–19) writes,
one has to make the assumption that there are inter-subjective standards in the light
of which human actors can decide whether they are following appropriate
discursive or practical rules in their interaction with others. Every (speech) act
presupposes a number of rules. Habermas (1984:112–13) notes that the social
scientist cannot treat human utterances as mere facts. He or she must be able
90 Overcoming international relations
In other words, language mediates the constitution of social ‘data’ or ‘facts’ in a double
way—as the linguistic and theoretical framework of the social scientist, and as the
language of acting subject-objects in their constitutive understanding of their
institutions and situations (Apel 1984:75–6). There is also interaction between these
two spheres, whether weak and indirect or strong and, perhaps, immediate. Thus,
meanings cannot be treated as mere facts in the positivist sense of the term, although
they are constitutive of the causal complexes capable of producing outcomes. The basic
thrust of my argument is, thus, that it is always necessary to understand the meanings
that actors attribute to their actions and practices if we are to acquire knowledge of the
causal complexes of social worlds. The consequence of this is to undermine the claim
that ‘sometimes one type of account makes more sense; sometimes another’. Both are
needed, always.
Better stories about world politics 91
In his 1987 article, Wendt (1987:362) states that explanations are basically
answers to certain kinds of questions. Hence, the adequacy of an explanation
depends on the question posed. Wendt goes on to distinguish between two
types of questions: ‘How is action X possible?’ and ‘What made X actual rather
than Y?’ For him, structural analysis explains the ‘possible’, while historical
analysis explains the ‘actual’. Hollis and Smith are critical of this distinction,
using the Gulf War as an example:
Suppose one sets about explaining the occurrence of the 1991 Gulf War in
this spirit. How was it possible for war to be declared? This question might
invite an account of the rules which constitute the conditions needing to be
satisfied for it to be true that a state of war exists; or it might invite an
account of the recent history of the United Nations and the Middle East—
historical conditions which made this war possible; or some mixture of both.
These are very different sorts of account, as is clear when we enquire how
exactly the mixture is to blend theoretical analysis with historical trends. To
be brief, Wendt’s ‘how’? question encapsulates several running disputes about
explaining or understanding how a war could happen.
(Hollis and Smith 1991:406)
This criticism is convincing only in so far as one has accepted the criterion of parsimony
and the quest for deductive-nomological explanations. I hope that I have been able to
give some idea of the complexities involved in any geohistorical explanation, and thus
to give some plausibility to the alternative idea that the richness and depth of an
explanation is of higher value than its parsimonious nature. Consequently, it is not a
problem to pose different ‘how’ questions in one and the same study.
Indeed, we can distinguish between different ‘How is action X possible?’
questions. When we are concerned with the identity of X, we need an account
of relevant constitutive rules. This analysis freezes, so to speak, the social world
in question. Then we can move towards the world of historical, causally effective
interaction and ask: What are the INUS-components which made that action
possible? At this level we are analysing processual complexes in open systems,
and we cannot find general necessary conditions of X. Instead, we are interested
in the actual constellation of conditions that made X contingently possible. Other
constellations could have made it contingently possible as well. After this we
can ask: What made X actual rather than Y? When posing this question, we
are interested in reasons, justifications and the like, organised as discourses, as
well as in the interactive actions of various actors. Finally, we can pose this
genealogical query: How was X’s identity and the relevant constitutive rules
produced in the course of historical interaction?
This tentative list of questions is a plea for complexity and nuances, and for
patient, reflexively interpretative data-collection and model-building. For some
92 Overcoming international relations
it may sound like the more or less well-established practice of historical research.
However, it is much more than that. It is a plea for admitting the plurality of
possible interpretations and opening up the possibility for a genuine dialogue.
This plea is coupled with an appeal for causal in-depth explanations in terms of
relational complexes as they really are. It is not enough to stay at the level of
prima-facie reasons for actions; we must also analyse the conceptual and causal
relations of social action in their inexhaustible complexity, fullness and depth.
By systematic and iconic modelling it is possible to form pictures of the wholes
in question and to give deeper explanations by answering the questions ‘How
possible? and ‘Why actual’? Aspects of models can be generalised, given that
certain (deep) structures and relational mechanisms and/or complexes can be
found to be causally efficacious across many open-systematic contexts.
The prospect of avoiding reification opens up entirely novel possibilities concerning
the aims of story-telling about foreign policies and, much more generally, world politics.
It makes emancipation and edification possible. Interpretative in-depth explanations
try to make phenomena more understandable and intelligible. They have to enter the
processes of assessing truth-claims that take place within social practices. By virtue of
being holistic and aiming at explanatory power, these interpretations, based also on
truth-judgements, can be critical of actors’ understandings. Social relations that are
shown to render, in some important senses, anomalistic, false, reificatory, illusory and/
or superficial discursive formations should come to be criticised in being explained.
Novel interpretations of the social worlds and their inherent possibilities can
contribute to the process of re-signification of practices and thereby make possible
better, that is, more empowered, more ethical and more virtuous ways of being and
action. To enable this, we have to go beyond the ‘great’ debates of the international
problematic. It is time to transcend these limits by getting rid of at least parts of the
conceptual and metaphysical ballast weighing down the field. Instead of an eternal
debate or attempt at a synthesis between the sketchy perspectives called political
realism and idealism—amounting to the mere reproduction of the international
problematic—we can enable the possibility of genuinely novel interpretative possibilities.
By claiming that reasons are causes, and that, more generally, causality can be
understood in terms of INUS-components in open (and intra- and inter-related)
social complexes and systems, I have tried to open up a space for the study of
world politics in a more emancipatory and edificatory manner. Perhaps it is now
possible to overcome some of the limits of the contemporary political imagination,
as well as to improve on the way scientific historical accounts approach value
questions, structural constraints and human choice possibilities. However, first we
have to articulate the CR ontology and research methodology in more detail.
Notes
1 In standard textbooks (e.g. Gill and Law 1988; Gilpin 1987), Marxism is still taken
seriously as the third perspective; and (post-)Gramscian historical materialism (e.g. Cox
1987; Gill 1993) is often conceived as the nearly hegemonic movement within IPE.
Better stories about world politics 93
2 Wæver (1994a:257) has been careful to acknowledge that the debate has changed:
‘During the 1980’s, realism became neorealism and liberalism became neo-liberal
institutionalism. Both underwent a self-limiting redefinition towards an anti-
philosophical, theoretical minimalism, and they became thereby increasingly
compatible. A dominant neo-neo synthesis became thereby the research programme.’
This interpretation not only confirms my point about the state of the art but also
nicely illustrates the phase of Western cultural development within which the adjective
new is used rather liberally. As Honnef (1994:14) has pointed out in analysing the
trends and fashions of contemporary art, the inflationary use of the term new never
actually appears on its own but always as a prefix (neo-) or as an adjective qualifying
a trend that already exists. The same goes for the fashions within IR.
3 In addition to the Wendt and Hollis and Smith debate, major IR contributions include
Wendt (1987, 1992a, 1994 and 1999), Dessler (1989), Carlsnaes (1992, 1993), Wæver
(1994a:260–71), Doty (1997) and Wight (1999, 2001).
4 However, there are different kinds of (critical) scientific realisms. For some, the emphasis
is on the word ‘realism’. Thus, Wendt (1991:397) writes, in distinguishing himself
from empiricism, that a ‘related dispute concerns the requirements of scientific
explanation, where empiricists seek to subsume phenomena under lawlike regularities,
whereas realists seek to identify the underlying causal mechanisms that generate the
phenomena’. In an e-mail conversation in 1992–3, Wendt often repeated the point
that he is a scientific realist precisely and only in this sense. As will emerge later on, in
my interpretation critical realist social sciences would sometimes do better without
the metaphor of mechanism. More generally, I define critical realism in terms of a
well-articulated ontological realism, epistemological relativism and judgemental
rationalism, and emphasise the role of CR theory of emancipation.
5 Scientific realist interpretation of a Humean phenomenology of constant conjunctions
is in no way a step forward from empiricism/positivism.
6 In fairness, it should be noted that in their reply to Wendt, Hollis and Smith (1991:397)
admit that there are other conceptions of causality. However, they never discuss or
take on board these other conceptions.
7 Even though they later, in Chapter 9, move in different directions, Hollis to a more
hermeneutic position and Smith to a scientific realist stand, they never distance
themselves from a Humean view of causality.
8 The best candidate would be the democratic peace theory, according to which liberal-
democratic states do not fight each other. As currently formulated, this theory is
reliant on the identification of what appears to be a strong and enduring constant
conjunction: liberal-democracies do not fight each other. That is, the argument is that
X (liberal-democratic regimes) explains the absence of Y (war, organised violence between
two states). The assumption must be that Y is a normal thing to occur, and that X is
the mechanism or thing that explains why it does not occur in this special situation
(or more strongly, prevents it from happening). The existence of a well-functioning,
on-switched lamp explains the absence of darkness in a room in the night, but would
be quite irrelevant in a bright sunshine. In a situation of constant sunshine, it is
darkness that should be explained (if it were to occur).
Moreover, as Barkaw and Laffey (1999:414–15) have argued, the correlation is
based, in part, on statist categories. Thus, for instance, the US covert actions against
democratically elected governments in the Third World resulted in significant
violence. However, they are not counted, and thus cannot refute the democratic
peace proposition. From a CR perspective, it is also the very idea of a constant
conjunction that is problematic. Constant conjunctions only obtain in closed systems.
However, the world of international politics has been, and is, anything but closed.
This suggests that it cannot simply be factors internal to democratic nation-states
that are responsible for this apparent constant conjunction. Even if a mechanism
(perhaps of a Kantian form) or, rather, a manifold structured complex were identified,
94 Overcoming international relations
it would still only be transfactually operative in an open system. This would mean
that there was something about the other relevant complexes and processes that
was also playing a causal role.
Hence, an adequate analysis must also focus on these other relations and processes,
forming a picture of the relevant parts of the geo-historical whole in question. Here,
the explanans and, up to an extent, also the explanandum of the theory changes. What is
the overall and possibly changing causal complex, which has tended to make the
twentieth-century relations between capitalist, Western-minded liberal-democracies
peaceful? Which of these components are changing in the early twenty-first century?
In effect, a CR analysis would suggest that if there is anything to explain, it must be
explained in terms of real tendencies and real working of spatio-temporal causal
complexes in open and therefore changing systems. See also Patomäki and Wight
2000; and Chapter 8 for an articulation of a rather different (partial) explanation in
terms of a pluralist security community.
9 Typically, main activities reproducing these institutions are located in the
governmental complexes or diplomatic settings of capital cities of different countries,
or in the diplomatic settings of the headquarters of various international
organisations. Spatially, these can be either inside or outside of any particular
territorial state, but always inside of at least one state (and usually many
simultaneously, given the modern communication technologies and the need of
diplomats to consult their governments).
10 Changing social practices may give rise to emergent spatio-temporalities. Long-distance
calls or computerised systems of information flows have disembedded, relatively
speaking, space from time. Immediate contacts have become possible over long
distances. A truly global multinational enterprise—which, although rare, can be taken
as an ideal case—presupposes the disembedding of space from place and relies in its
practices on the capability of disembedding space from time (cf. Bhaskar 1994:70).
These emergent spatio-temporalities may also enter into the discursive practices of
political identity—and interest—formation.
11 When I asked Bhaskar (personal communication, London, 22 July 1995) about the
plausibility of IR levels, his immediate reaction was: ‘Although I do not know the
subject matter very well, it seems, at least prima facie, plausible.’ Also Wendt, in his
comment on an earlier version of this chapter, asked whether it could be possible to
think about IR levels in terms of layered mechanisms. Thus, we could try to explain,
at least partially, higher-level laws in terms of layered mechanisms. We could also try
to explain, at least partially, higher-level laws in terms of mechanisms of lower levels.
For instance, Wæver (1994a:264) interprets IR levels in these terms.
12 The methodology of iconic modelling is devised for the purpose of identifying causal
complexes and social contexts; it will be explicated in detail in Chapter 5.
13 These new possibilities include: identity-constituting moves; learning-processes;
development of (dis)trust; altercasting; development of a sense of common fate; and
the formation of collective identities (see Wendt 1999: Chapters 6 and 7). In some
contexts, given particular research problems, Wendt’s ‘social theory of international
polities’ can thus provide highly useful and interesting hypotheses for concrete iconic
models.
14 Thus, in Wendt’s theory, structures are reduced to the ideas ‘states hold’. There is not
an unequivocal ‘logic of anarchy’, as in Waltz’s neo-realism, but three ‘cultures of
anarchy’—named after Hobbes, Locke and Kant—and processes of moving from one
to the other. This may be a richer and more nuanced or realist theory than that of
Waltz’s. It is nonetheless a theory of a closed list of interpretative and transformative
possibilities. These possibilities are derived from both philosophical theories and the
conventional international problematic rather than from any sustained empirical
research of concrete problems. Theory in this sense not only precludes transformative
Better stories about world politics 95
Explicating a critical
realist methodology
4 Critical realist
social ontology
The nature of objects determines their cognitive possibilities. This critical realist
(CR) idea, drawn from Aristotle, has also increasingly gained ground in
international relations (IR). Martin Hollis and Steve Smith have argued that
any attempt to explain anything in world politics necessarily implies hypotheses
about the existence and nature of entities. They contend that the justification of
ontology is connected to its explanatory merits. David Dessler (1989:446)
maintains that ‘the richer and more comprehensive the underlying ontology,
the better the theory’. Alexander Wendt (1999) seems to disagree with this
criterion, but has nonetheless written a remarkable book on the ontology of the
states system. Open-mindedly, Hayward Alker has learned, through trial and
error in his careful empirical studies, that ultimately our understanding of the
nature of data depends on our social ontology. Better social ontology is thus the
key to constructing more adequate sets of empirical evidence.
Individualism claims that ultimately there are only individuals or their minds,
preferences and decisions. The roots of individualism are in the capitalist
conception of man. This conception has never gone unchallenged. The
methodological debates between individualism and structuralism started some
time in the nineteenth century. These included the famous Methodenstreit in
economics; debates about Marx and Marxism; and manifold debates in sociology,
symbolised by the classic works of Auguste Comte and Emile Durkheim, among
others. In various guises, structuralism has claimed that there are emergent
social wholes that determine individuals and their behaviour or actions.
Anthony Giddens’s theory of structuration and Roy Bhaskar’s
transformational model of social activity (TMSA) are important attempts to
transcend this debate in systematic ontological terms. Giddens and Bhaskar
have many things in common. Both rely on a form of scientific realism, which
has incorporated basic insights of hermeneutics—as represented by Ludwig
Wittgenstein, Peter Winch and Hans-Georg Gadamer—and the critical theory
of the Frankfurt School. There are also differences between these two theories.
It has been claimed that Giddens’s theory is ultimately hermeneutic or voluntarist
in its implications (see, for example, Gallinicos 1985:133–66; Wendt 1987:335–
70), and that Bhaskar’s conception of social reality is a good example of Marxist
and structuralist thought (Shatzki 1988:239–60).
99
100 Explicating a critical realist methodology
In this chapter, I shall try to show some of the weaknesses and inadequacies
of both theories. My aims are, however, reconstructive. I try to clarify the
subject matter and elaborate further on the CR concepts of ‘action’, ‘structure’
and ‘power’. A central ontological theme is the nature of, and the relationshiph
between, internal and external relations. First, I develop the point that to explain
the (re)production of internal relations, we also need a theory of practical and
communicative action. Second, I argue that an adequate theory of external
relations requires an iconic model of different modes of social action. One of
these modes, it is argued, is indeed communicative action. Third, I attempt to
detach Giddens’s concept of power—which connects action essentially to power
as transformative capacity—from its Weberian undertone and incorporate into it
an important insight of Michel Foucault.
There is a far-reaching tension between Giddens’s and Bhaskar’s views on the
concept of structure. I criticise Bhaskar’s naturalist metaphors and analogies, but,
contra Giddens, maintain that a relational conception of structure is ontologically
more sound than Giddens’s ‘rules and resources’. As a final point, I elucidate
how the dialectics of emergent powers and properties of open historical systems
should be conceptualised and studied. Instead of the futile IR levels, I articulate
an alternative account of systemic emergence and stratification. In conclusion, I
collect the different strands of my argument together with the help of the notion
of a layered, relational causal complex. This ontological notion is central to the
whole of this book, to my explication of a CR research methodology.
Science employs two basic criteria for the ascription of reality to a posited
object: a perceptual criterion and a causal one. The causal criterion turns on the
capacity of the entity whose existence is in doubt to bring about changes in
material things.
(Bhaskar 1979:15–16)
The discovery of these entities always includes the formation of an iconic model,
which is basically concerned with qualitative analysis and picturesque description
of causal mechanisms, which are responsible for the phenomena we observe.
The formation of an iconic model must be done under plausibility control
(Harré 1970: Ch. 2).
On the other hand, philosophical reflection can reveal that reality must be
stratified, differentiated and structured if scientific activities are to be intelligible,
Critical realist social ontology 101
This definition implies, among other things, that there may be many different
sufficient conditions capable of producing the same result. Theodore Shatzki
(1988:24) doubts whether social structures can be treated as causes, because ‘talk
of causal efficacy, causal power, and being a generative structure requires that
anything that possesses these properties be something that actually brings about
changes’. There is some truth in Shatzki’s doubt, as we shall see. Nevertheless,
Bhaskar claims only that the dual character of society means that:
102 Explicating a critical realist methodology
Society is both the ever-present condition (material cause) and the continually
reproduced outcome of human agency. And praxis is both work, that is
conscious production, and (normally unconscious) reproduction of the
condition of the production, that is society. One could refer to the former as
the duality of structure, and the latter as the duality of praxis.
(Bhaskar 1979:43–4)
If social structures are always a necessary condition for any sufficient condition for
social action, as the ontological duality of structure implies, then they are also
INUS-conditions (def. 1), and hence causes by definition. Thus, social structures are
real. On the other hand, the problem with the magnetic-field analogy—and this may
have misled Bhaskar in some crucial points—is that it is quite reasonable to say that
a magnetic field sometimes produces or brings about changes (alone), whereas social
structures are never able to do so (alone), because social structures do not have
action-independent causal effects. Or to use Aristotelian terminology, social
structures are mere material causes, whereas magnetic fields can also be efficient
causes.
Are social structures always a necessary condition for action? In the TMSA
the ‘point of contact’ between human agency and social structures is the mediating
system of positions (places, functions, tasks, duties, commitments, rights, etc.)
occupied (filled, assumed, enacted, etc.) by individuals, and of the practices
(activities, etc.) in which, by virtue of their occupancy of these positions (and
vice versa), they engage. These positions and practices can be individuated
only relationally (Bhaskar 1979:51). To this must be added that social structures
exist only by virtue of the activities they govern, enable and constrain, and that
they are materially present only in persons and the results of their actions
(Bhaskar 1986:130–1).
In what sense are these kinds of structures always a necessary and pre-
existing condition for action? A distinction must first be made between internal
and external relations. These concern substantial relations of connection and
interaction, not formal relations of similarity or dissimilarity.1
Def. 2: A Relation RAB is internal if and only if A would not be what it essentially
is unless B is related to it in the way that it is.
(Bhaskar 1979:54)
Whereas external relations are conceptually contingent, internal relations are conceptually
necessary. Both can be found in society When Bhaskar claims that action (in
general) requires social structures as a necessary condition, it seems that he has
in mind only structures of internal relations. This claim stems from the
hermeneutic character of the social world: social world is meaningful; social
actors are intentional and intelligible; and their actions are always rule-enabled.
To have identity, actors and actions must be constituted by rules. There is a
difference between constitutive and regulative rules. The meaning of a particular
Critical realist social ontology 103
action derives from the relevant constitutive rules, which in turn are related to
other constitutive and regulative rules. Social phenomena retain their identity
under regulative rule violation, but not under constitutive rule violation. Consider
the example of the social practice of trial by jury as presented by John Greenwood
(Greenwood 1987:171–204). The constitutive rules of this institution would
include something like the following (Greenwood 1987:188):
Because of these definitions or constitutive rules, the trial, juror and judge are
internally related to each other. In this sense social structures are the necessary and
ever-present condition of all social action. Only they make meaningful social action
possible.
It is important to be very careful about differences between ontological and
epistemological matters. We—as researchers—may define trial, juror and judge as above,
but then we are making existential hypotheses; we can then ask: ‘Is the world really like
this?’ (ontology). The attempt to answer this question will involve what Giddens calls
the ‘double hermeneutics’ of social sciences (epistemology): a theoretically reflective
scientist tries to investigate and interpret a world that is itself meaningful (Giddens
1977:12).
Epistemologically, as Sayer (1992:56) notes, the conceptual relations we can
plausibly posit between entities are theory-dependent. For example, whether it
is conceptually inconsistent to describe an individual actor both as a capitalist
and a proletarian depends on what we take to be the possible nature of the
person, means of production, etc. It is in fact a theoretical question whether we
should accept the categories of capitalist and proletarian at all. Ontologically,
the existence of internal relations in the social worlds—outside academia—depends
on the explicit or implicit ‘theories’ of relevant lay actors and thus it is not
possible to infer them purely theoretically, or objectively. This is probably the
reason why John Shotter (1983) characterises social processes as intrinsically
vague and incomplete. Bhaskar comments on Shotter’s point only briefly:
The spheres of ambivalence and contradiction are also real (ontology), not
merely indispensable categories for social analysis (epistemology). I do not,
however, mean the real and ‘objective’ dialectical contradictions inherent in
class relations, as Marx at least sometimes did—although in general the conception
of dialectical causality is important (cf. Bunge 1959:160–5)—but rather the
dialectical disputations and contradictions in the ‘shading, soft-edged, open-
textured conceptual world of Wittgenstein and Waismann’.
There is usually no universal and definitive consensus among lay actors about
the constitutive rules of institutions, or about the definitions of position-practice
systems. Hence the ‘essential internal relations’ (def. 2) in the social world are
always both ‘vague and incomplete’. Consider the following quotation from Winch:
If social relations between men exist only in and through their ideas, then,
since the relations between ideas are internal relations, social relations must
be a species of internal relation, too. This brings me into conflict with a
widely accepted principle of Hume’s: There is no object, which implies the
existence of any other if we consider these objects in themselves and never
look beyond the ideas we form of them.
(Winch 1971 [1958]:128)
The point of critical realism is, contra Winch, that there are both internal and external
relations—and that Hume seriously mis-specified the nature of external relations.
However, what exactly is the nature of internal relations? How do internal relations
as ‘ideas’ come into being? How can there be shared ‘ideas’ and internal social relations
in the first place? The crucial theoretical problem is thus: In what way are the constitutive
rules and social meanings of action produced, mediated, reproduced and changed?
How is the mutual understanding of social (constitutive) rules historically possible?
Giddens sees the first question as being at the core of the problem of order.
Innovatively, Giddens defines the problem of order in terms of realist spatio-
temporal causality rather than an analogy to the Humean problem of induction.
‘The “problem of order” is the problem of how it comes about that social
systems [practices, relations] transcend time and space’ (Giddens 1983:78). In
other words, what mechanisms are capable of producing the outcome, namely
that the ontologically critical internal relations, practices and systems transcend
time, place and, also, the biological existence of human beings?
According to Habermas (1984 [1981]:274–5), the actions of several actors
are linked to one another by means of the mechanism of reaching understanding.
This is the way actions are interlaced in social spaces and historical times.
Critical realist social ontology 105
However, Habermas still works with the Hobbesian (Humean) problem of order
he has inherited from Talcott Parsons. Rather than ‘co-ordination of actions’,
the real problem is to explain the endurance and (re)production of ontologically
critical internal relations of social practices. However, Habermas (1987:295)
speaks also of ‘communicatively structured lifeworlds that reproduce themselves
via the palpable medium of action oriented to mutual agreement’, i.e. via
communicative action.
Reaching understanding seems to be the condition for human powers in
general. It is in this sense that we should talk about communicative action as
the (re)productive mechanism.
Both human powers and internal social relations are dependent on language
and systems of signs.
The social being is meaningful, and internal relations of social being must be
located at the level of understanding. However, there must also be lower-level
mechanisms capable of (re)producing language and ‘ideas’, or systems of signs.
These mechanisms include the genetically programmed, bodily powers of human
beings to learn practical skills and, thereby, work with and through meanings.
Practical bodily skills include walking, or making bodily gestures, or using the
human voice to make signs or handling a pen or knife and fork. Possession of
language requires practical skills, and practical skills are subject to natural laws
and determinations. ‘We never cease, and never can cease, to sustain relations
with all three orders of reality—natural, practical and social’ (Archer 2000:9).
However, in order to work with meanings one has to learn, via practical skills,
to understand meanings and follow social rules, both constitutive and regulative.
From the very early infancy onwards, embodied actors have the power to resist,
or take exception to, these rules. Later, with maturity, they can reflect upon,
criticise and ultimately change these rules.
The conception of structures of internal relations thus presupposes a theory—
or an iconic model—of both practical and communicative action. Hence, (internal)
social relations are intimately connected to action. This indicates that social
sciences cannot specialise in studying (internal) relations in isolation of
interactions, although Bhaskar (1979:56) seems to claim so. Social sciences have
two simultaneous sides: the analysis of action and the analysis of structures.
True, actions and structures are different kinds of things, irreducible to each
other, although ontologically dependent. They can be separated momentarily
for the analytical purpose of studying continuity and change, and the temporal
dialectics between agency and structure (cf. Archer 1995:168–70 passim). My
106 Explicating a critical realist methodology
point is, however, that historical social sciences must explain outcomes, which
occur only in shifting and changing open systems. Therefore, any explanatory
iconic model of concrete outcomes has to articulate a unified account of the complex of
relevant material and efficient causes at work in a well-specified area of time and space.
Internal social relations are typically both vague and ambiguous. They are
vague because lay-actors have partially incoherent and somewhat wavering
practical and discursive theories of the social world in question. Internal relations
cannot be more exact and clear than the meanings circulated, stored and
transformed in the practical social world, in social practices. Internal relations
are ambiguous because actors hold conflicting theories about social institutions
and practices, and these conflicting views are connected to differentiated horizons
and horizon-constituted interests in human societies.
In other words, in a complex world, as contrasted to, say, a more
homogeneous and unchanging medieval village, there is neither any a priori
guarantee of the sameness of the meanings, nor any singular causal mechanism
capable of producing the same meanings for all actors (despite the existence,
since the eighteenth and nineteenth centuries, of centralised national systems of
education devised also for the purpose of homogenising language and meanings).
Because there can be no social reality without meanings and internal relations,
the thesis concerning ambiguousness can also be read as a thesis about the
social ontology of discordance (Connolly 1987:10–16). Because the mechanisms
producing meanings are essentially communicative, and because of the ubiquity
of ontological discordance, there tend to be constant dialectical disputations
and contradictions in the ‘shading, soft-edged, open-textured conceptual world
of Wittgenstein and Waismann’.
Dialectical contradictions, in the sense specified by Nicholas Rescher (1977:62–
3), do thus exist, and they generate, among other things, open disputations and
political argumentations. These conflicts are a central element of the political
aspect of the social world (see, for example, Alker 1988:805–20). Consider
Figure 4.1. The disputation begins with an assertion of a thesis (P) with one
grounding (Q), and a categorical assertion! Q (‘it is the case that Q’). The opponent
responds in this case with a provisoed denial ‘if both Q & R then ~P, and I
suspect that Q & R’. The first assertion in this kind of debate may, for instance,
be that ‘a trial should be defined, as I do, as a public examination of evidence…,
because otherwise there would be no guarantee of the fairness of trial’, and the
opponent might respond that ‘you may be partially right, but there are also, I
suppose, many occasions where publicity would only do harm to fairness, and
hence I can see no reason to define the trial in terms of publicity’. In the
dialectical logic the denial of ~P is not just to assert P, for there is also refinement
(amplification, improvement, introduction of new and deeper grounds) in the
transition from Q to (Q & R & S), for instance (Rescher 1977:66).
Critical realist social ontology 107
Consider the previous example of the social practice of trial by jury, which
might include regulative rules something like the following (Greenwood
1987:188):
1 ‘Jurors should not let their own prejudices influence their judgement.’
2 ‘Counsel should not make emotional appeals to the jury.’
3 ‘Judges should not intimidate witnesses.’
108 Explicating a critical realist methodology
For example, the second rule regulates the external relation between counsel and
jury engaged in the practice of trial by jury. Of course, they may be also
internally related. Generally, internal and external relations do not exclude
each other. Giddens claims that ‘these two senses of right and wrong always
intersect in the actual constitution of social practices’ (Giddens 1979:82). Onuf
goes even further and asserts that all rules are both constitutive and regulative:
‘a constructivist always sees this distinction as arbitrary and unhelpful’ (Onuf
1989:85). However, this contention would imply that a violation of a rule
such as ‘counsel should not make emotional appeals to the jury’ (2, above)
would change the meaning (identity) of trial, counsel, jury or legal argument,
which is clearly not normally the case. Hence, the two senses of right and
wrong may well intersect in the actual constitution of social practices, and yet
the distinction between constitutive and regulative rules is not only important
but also ontologically necessary.
Suppose that counsels, jurors and judges believe that these (customary,
moral or legal) regulative rules exist effectively. This does not imply that they
follow these rules. How, then, can structures of external social relations generate
action and changes? Let us first examine how Bhaskar conceptualises these
kinds of structures. Bhaskar (1979:51) uses the term ‘position’ in connection
with the concepts of ‘places’, ‘functions’, ‘rules’, ‘tasks’, ‘duties’, ‘rights’, etc.,
but he unfortunately does not specify their relationships. ‘Places’ and ‘functions’
seem to be near to the concept of ‘position’, and rules partially constitute
positions. These concepts do not, however, illuminate the matter very much.
Strikingly, terms like ‘tasks’, ‘duties’ and ‘rights’ seem to point to the
Durkheimian direction in social theory. According to Emile Durkheim (1969
[1938]:246–7), ‘social facts’ consist of ways of acting, thinking and feeling,
external to the individual, and endowed with the power of emotional, moral or
legal coercion. This characterisation notwithstanding, Durkheim is not able
to show that ‘social facts’ necessarily force individuals to conform to external
and/or regulative social rules. This holds true for Bhaskar’s TMSA as well:
regulative rules alone cannot produce changes. Nor are they a necessary
condition for action in general, for the relation between regulative rules and
action is by definition contingent (although, of course, a regulative rule is
typically an I NU S-condition for a particular action). Because of this
contingency, the analogy to a magnetic field does not hold at all in the case of
structures of external social relations, and thus it seems that this analogy
carries a misleading picture of social relations.
Any theory or analysis of external social relations requires a theory of social
action. A theory of social action should include taxonomy of homoeomorphic
iconic models of action. A homoeomorphic iconic model is one where the
subject of the model is the same as its source of analogy (picture, metaphor).
Teleiomorphic models are a sub-category of homoeomorphic models, and they
are constructed by means of selective idealisations (some of the properties of
the source-subject are made, according to some scale of values, more perfect
than the source-subject’s properties) and abstractions (some properties of the
Critical realist social ontology 109
source-subject are chosen by some relevancy criteria) (see Harré 1970: Ch. 2,
esp. pp. 41–2). Weber’s ideal-types (Weber 1969) are a case in point, except
that Weber does not distinguish clearly between idealisation and abstraction.
According to Weber, an ideal-type:
it. This illustrates that while positioned practices always contain external relations
which are real, their existence and causes of existence are conditional. Their
causal effectiveness always depends on other necessary conditions, too (see def.
1), especially on the causally effective reasons and motivation for action. Models
of action are essentially models about the prevalent rules and principles of practical
reasoning. We are thus back to studying constitutive rules and internal relations
of different historical social worlds.
Consequently, models of social action must be falsifiable by means of double-
hermeneutic empirical research. We may assume that certain modes of action
are institutionalised into positioned-practices systems. For example, it is very
common to argue—or presuppose—that teleological/strategic action is internally
related to the capitalist market economy system. However, this is not
automatically or ahistorically so. For instance, the neo-liberalism of Thatcher
and Reagan can be seen as a political project, which attempts to make actors
comply with the strategic model of action in all of their everyday practices, thus
extending and intensifying the transformations already partially accomplished
by eighteenth- and nineteenth-century economic liberalism. Yet this
transformation cannot be completed. The transcendence of social systems in
time makes communicative and other non-strategic modes of action necessary.
In fact, these modes can be found even in capitalist practices.
It is possible to distinguish between different kinds of strategic action:
difference maximising, hasty egoism, far-sighted egoism and altruism, depending
on how actors understand their (external) relation to other actors (see Taylor
1976; Axelrod 1984; Frochlich and Oppenheimer 1984). These types occur in
a capitalist market society. Second, different types of strategic action can be
seen as orientations, which are based on deeper grounds, theories and motives (Hollis
1988:140–1). Thus, forms of strategic action may be essentially connected to
normative or expressive logic of action.
A capitalist market society consists not only of market transactions but also of
social relations inside hierarchical organisations, such as multinational corporations,
or between, say, privately owned organisations and state compartments.
Organisations facilitate co-operation among actors. This facilitation of co-operation
is to a varying extent due to the local degree of hierarchical surveillance and
control, but also because organisations create enduring contexts for social action.
In organisations there can develop a trustful atmosphere where the mode of
action changes, to some extent, towards that of normatively regulated action, at
least in the ‘high trust positions’ (Giddens 1987a:157–62). Also, communicative
action in organisations has better prerequisites for success than in the context of
markets (cf. Williamson 1975:37–9; also Perrow 1986). Furthermore, Galbraith
(1967) has shown how the actions of the technocratic staff of large firms can also
be interpreted in terms of the dramaturgical model.
So we should not be blinded by our definitions and analysis of the internal
relations between modes of action and position-practice systems, but at least allow
sufficient ‘flexibility and a certain amount of cross-checking of observation or
reflection under one group of concepts by another’ (Sayer 1984:54). For example,
112 Explicating a critical realist methodology
Def. 4: First, the notion of action has reference to the activities of an agent. Second, it is
a necessary feature of action that, at any point in time, the agent ‘could have done
otherwise’. Third, action involves a stream of actual or contemplated causal interventions
of corporeal beings in the ongoing process of events-in-the-world, thus producing various
outcomes.
Actors more or less know what they are doing when they do it and can often
be quite clear in articulating it. But it does not follow that the broader
consequences of these local actions are coordinated, …[for actors] ‘don’t
know what they do does’.
(Dreyfus andRabinow 1982:187)
I think these examples, and the general Foucauldian point, indicate that there are
good reasons to widen the definition of power to include the longer-term
existential (ontological) effects of power. As will be seen, this wider definition can
liberate from the theoretical puzzles that Giddens has been struggling with. Thus
the simple def. 5:
114 Explicating a critical realist methodology
Social structure
the initial conditions in any concrete social explanation must always include or
tacitly presuppose reference to some or other social relation (however the
generative structures invoked are themselves best conceived). And it is, I suggest,
in the (explanation of the) differentiation and stratification, production and
reproduction, mutation and transformation, continual remoulding and incessant
shifting, of the relatively enduring relation presupposed by particular social
forms and structures that sociology’s distinctive theoretical interest lies.
(Bhaskar 1979:51–2)
Critical realist social ontology 115
This suggests that Bhaskar makes an epistemological case for a special discipline
called ‘sociology’. (This argument may include ontological components, inasmuch
as it is grounded on the reality of internal relations.) Bhaskar, however,
distinguishes between generative structures and social relations, which therefore
must be different things. Since he too apparently views generative structures in
terms of ‘rules and resources’, then the claim that sociology’s distinctive
theoretical interest lies in the sphere of social relations seems to be somewhat
undermined. However, consider the following:
The social sciences abstract from human agency to study the structure of
reproduced outcomes, the enduring practices and their relations; the social
psychological sciences abstract from these reproduced outcomes to focus on
the rules governing the mobilisation of resources by agents in their interaction
with one another and nature. If the sphere of the former is social structure,
that of the latter is social (and socio-natural) interaction. They may be linked
by the study of society as such, where society is conceived as the system of
relations between the positions and practices (or positioned-practices) which
agents reproduce or transform….
(Bhaskar 1986:124)
After this, Bhaskar refers to Karl Marx, who wrote in the Grundrisse that ‘society
does not consist of individuals, but expresses the sum of interrelations, the
relations within which the individuals stand’. Bhaskar seems to argue that the
emergent properties at the level of society are social relations, that these relations
form structures, or mechanisms, which generate changes, and that these
mechanisms are analogous to natural mechanisms like a magnetic field. Giddens,
on the other hand, lays the emphasis on rules and resources, which constitute
and/or generate social relations through actions. For Giddens, social relations
seem to be by-products of the action, rules and resources nexus.
I have already argued that the analogy between a magnetic field and structures
of social relations does not hold. First, social rules and resources usually only
enable or allow certain possibilities and disable, disallow or exclude certain
others (Dessler 1989:453). Moreover, if it is true that rules and resources
constitute and/or conjunctively generate social relations, then rules and resources
are ontologically prior to external social relations, as Giddens seems to be arguing.
Consider two examples. Let RAB be (1) a relation between a wife (A) and a
husband (B), and (2) a relation between a modern statesman (e.g. a prime
minister, a foreign minister or a president A) and a national electorate (B). Both
relations can be analysed in terms of position-practice systems (a marriage;
liberal-democratic elections and the imagined community of a ‘nation’ in the
nation-state system), rules (constitutive and regulative), resources (connected to
powers or transformative capabilities) and related modes of action.
116 Explicating a critical realist methodology
Without these substantial qualities the relations (1) and (2) are empty, abstract
in almost the same way as formal deductive logic is. Moreover, these relations
cannot be understood or explained without explaining relevant action, rules
and resources, which are necessary for these relations to exist. Finally, as a
direct consequence of the dual character of action and structures (which implies
the need for a theory of action), there is a necessary—although perhaps loose—
connection between individual actors’ reasons and the meaning of social actions
and relations, although Bhaskar argues that:
separable, and real powers and capabilities cannot be reduced to actual instances
of interaction. Hence, I would like to propose the following definition:
Def. 6: The term ‘social structure’ refers to internal and external relations of a
positioned practice; these relations are implicated and/or generated by the
components of the relevant causal complex.
The internal and external relations are constituted and generated by the relatively
enduring nexus of rules, resources and (re)productive actions. Constitutive rules
make action possible and also regulate it. Constitutive rules imply internal
relations, which form wholes with all kinds of logical relations between the
parts. They are also action-dependent in a sense that actors have the power to
reflect upon, criticise, deviate from and change the constitutive rules. External
relations are the outcome of action generated by (1) rules; (2) resources as
embodied capabilities and position-endowed facilities; and (3) intentional reasons
for actions. External relations are contingent. They are (re)produced in and
through social practices, with the possibility of ‘doing otherwise’ by the actors
as ever present. Intentional reasons, even if idiosyncratic, are embedded in the
contexts of interaction as well as in the background discourses (and related
practical skills and capabilities). Also, the logics of action are constituted by
historical rules and principles. In concrete empirical research, to uncover these
layers and components requires a lot of painstaking, even meticulous, work.
Def. 6 gives, clearly, a relational definition of structure. Is it also possible to
define social structure in compositional terms (e.g. ‘age structure’, ‘occupational
structure’, ‘structure of industry’)? Although this is certainly possible, and in
some contexts also legitimate, two essential things must be kept in mind. First,
from the causal analysis point of view it is necessary to examine substantial
relations of connection and interaction, but not formal relations of similarity
and dissimilarity. Thus, in order to perform causal analysis, compositions either
should be interpreted in relational terms, translated into them or explained by
means of relational conceptions. Second, all statistical information is the product
of a two-stage interpretation process. The first stage is the official process of
definition prescribing the criteria for inclusion in the category. The second stage
is the registration of instances, which is done by persons of varying competence
and variously biased (Maclver 1964:137; see also Alker 1996:332–54). Hence,
(1) the relatively quantitative compositional knowledge is no more objective
than the essentially qualitative relational knowledge; rather, (2) it is causally
and, therefore, ontologically secondary to the conceptually much richer relational
knowledge.
With def. 6, we both can see society as a relational entity (somewhat contra
Giddens) and at the same time deny that social structures are ontologically or
118 Explicating a critical realist methodology
This is a relational definition, but one that does not deny that social activities
occur only in tim e and space. All collectivities are social systems, but
there are other wholes, which can be identified as systems as well, such as
international society or capitalist world markets. Social systems have no essences
independent of the contexts of action; they change together with those relational
contexts. Social systems can, however, have emergent powers and properties.
The problem is to avoid reification and mystification of systemic powers and
properties. They must be analysable in terms of actions and positioned practices
occurring in concrete spatial settings, as already outlined in Chapter 3.
The IR levels would preclude studying open-mindedly levels in these two
possible senses. Social systems are not only open, they are also overlapping
and, in parts, intra-related. States, for instance, are themselves open systems,
intra-and inter-related to various other systems consisting of, say, practices of
capitalist world economy, international institutions and systems of governance
of global political economy. In spatial terms, local developments within states,
as well as parts of the state apparatuses themselves, can be causally and
existentially both intra- and inter-related to a number of collective actors or
systems. Emergent powers and properties, as well as hierarchical power relations,
should be studied open-mindedly from this kind of realist perspective. Emergent
powers and properties should not be organised, in one’s mystifying imagination,
to accord with the simple IR levels just to yield the convenient possibility of
continuing to reproduce the international problematic. To repeat once more the
argument against armchair philosophising, the dialectics of these layered,
relational and open wholes can only be described and explained by means of
detailed and systematic empirical studies of concrete practices and relations.
Critical realist social ontology 119
The notions of context and complex form the core of CR methodology. According
to def. 1, cause is an insufficient but necessary part of a condition that is itself
unnecessary but sufficient for the production of a result, i.e. the INUS-condition.
There are five necessary social components in any causal complex (K) capable
of producing events, episodes, tendencies and the like, namely: actors (AR);
regulative and constitutive rules (RU); resources as competencies and facilities
(RE); relational and positioned practices (PRA); and meaningful action (AN).
Also, actors and their characteristic logics of action are historically constituted.
Together, these related and interdependent components, and the emergent powers
and properties of consequent social systems, create the sufficient but unnecessary
complex for the production of a result. Hence, there is never a single cause but
instead always a causal complex K={AR, RU, RE, PRA, AN}, perhaps including
emergent properties and powers. This formulation does not disclose the layers
of social beings and systems, or the unintended consequences and the spatio-
temporality of (interactions. These have to be studied as well. Nonetheless, the
notion of causal complex provides a useful way of systematising the elements
that will have to be studied in any given context.
A result X may be produced both by K1 and K(i > 1), i.e. there may be many
different sufficient conditions capable of producing the same result. On the
other hand, action is one of the elements, and actors can do things otherwise.
A context of action is relative to particular actor j, AR j and to AR j ’s and other
actors’ interpretations. As Giddens (1983:78) puts it, ‘context’ is always the
contextuality of action. A context must also be relative to a particular moment
in the continuous flow of time. Thus, CONTEXT=Kt _ {AN j t}, i.e. the causal
complex at moment t with the meaningful action of AR j at the moment t
excluded. This comes close to the difference between material and efficient
causes. CONTEXT forms the complex of material causes for AN j t,whereas
the real reasons for this action form the efficient causes of the actions of this
positioned agent ARj . His or her actions, in turn, are part of the CONTEXT
of other actors. Reasons for action are layered and may be circulated in
institutionalised practices where they can have a taken-for-granted nature, and
where many of the deeper assumptions—and the required practical skills—often
go unnoticed. Yet there is always a variety of possible actions in a given
context. This is just a consequence of def. 4, according to which it is a
necessary feature of action that, at any point in time, the agent ‘could have
done otherwise’. Two cases can be distinguished:
particular can action be inconsistent in a context? Action can break the constitutive
or regulative rules of the context. This may be due to a mistake (Winch 1971
[1958]:32), or it may be due to intentional deviance. In both cases, there may also
be some innovative implications, intended or unintended, learning-inducing or
pathological, leading to a closure. When innovative implications are unintended
they will reveal their innovative character only later in the flow of time, though
this is often the case for intended innovations as well. In a ny case, a careful
examination of the CONTEXT of AN j t and the nature of AN j t will disclose the
novelty and originality of that act(ion) (cf. Skinner 1988:79–118).
An individual actor can move from one context to another. It is possible that
an actor can act simultaneously in two or more different contexts, or at least
contexts can penetrate each other. An individual actor exists as a bodily being,
with embodied competencies. He or she is capable of remembering his or her
own past and reflecting, planning and ‘doing otherwise’. Through his or her
memory and embodied practical skills and capabilities, he or she is in fact also
a personified and personal ‘intersection’ of different contexts. For instance, an
individual actor can bring the generative rules and capabilities he or she has
acquired in one context to bear in another context. This may turn out to be,
say, inappropriate, curious, amusing, exotic, stubborn, immoral, rebellious or
creative. The ‘intersection’ is personified because the actor’s complicated and
(also always) ambiguous identity, as well as his or her stock of resources and
expectations concerning rule-following, is formed during his or her life path
through different relational contexts. Also, the ‘intersection’ is personal because
it is a unique composition, and it may include innovative features.
Freedom of human action is obviously not limited to the individual freedom to
‘act otherwise’. It is also a matter of changing realities: the constitutive and
regulative rules; the social relations; the generative structures and complexes. To
effect changes is typically a collective endeavour. The forging of transformative
social identity is a complicated and fragile process. If this process is successful, it
yields the parties emergent powers and properties—in the context of unintended
consequences of action. The realist grounds for transformative actions will be the
topic of Chapter 6.
Conclusion
The primary purpose of this chapter has been the explication of CR social
ontology by discussing the concepts of ‘action’, ‘structure’, ‘power’ and ‘system’.
The result is just an open-ended articulation, a flexible starting point for
systematic social scientific explanations. It is not a philosophical foundation.
Rather, it is—and should remain—open to further argumentation, experience,
criticism and change. That what is should be analysed and explained by social
scientists and lay actors, not by metaphysicians.
The main objects of criticism of this chapter included Bhaskar’s naturalist
metaphors and his vacillating notion of social structure. I used a re-interpreted
Critical realist social ontology 121
Notes
1 See Sayer (1992:82). A ‘good’ example of analysis of internal and external relations
without this distinction can be found in Elster (1978:22–5).
2 Habermas has, for sure, attempted to escape from the phenomenological philosophy
of subject/consciousness. Habermas (1987:295) speaks of ‘communicatively structured
lifeworlds that reproduce themselves via the palpable medium of action oriented to
mutual agreement’, i.e. via communicative action. Thus ‘lifeworlds’ are not, in
Habermas’s theory, situated in subject/consciousness. However, there are problems
in Habermas’s theory. First, he argues that the task of reconstructive and critically
Kantian philosophy is to articulate the always already presupposed normative validity
claims of speech-acts, and then he differentiates lifeworlds in accordance with these
validity claims. Second, Habermas grounds his theory of action on the theory of
differentiated lifeworlds. Finally, Habermas bases his theory concerning ‘system’ and
‘lifeworld’ on his theory of social action. The result is a philosophical-ontological
theory of modern societies, with mutually exclusive spheres. This theory exceeds the
reasonable limits of philosophical ontology. In contrast, an iconic model is proposed
in terms of fallible, corrigible and controllable existential hypotheses.
3 Habermas (1984 [1981]:86), speaks of ‘interpretation, which refers to negotiating
definitions of the situation which admit of consensus’. My re-formulation of this
indicates that I would like to substitute ‘a model of contextually structured and situated
communication, which presupposes merely the possibility of consensus together with
some normative prerequisites’ for the Habermasian ‘model of unconstrained consensus
formation in a communication community standing under cooperative constraints’.
The problem with the Habermasian model is that it is partially based on the notion of
ideal communication community. Furthermore, it curiously implies that language
and communication are used merely for ‘consensus formation’ or ‘negotiations’.
However, de facto this is not the case, and it is not the only normative ideal of language
use. Habermas (1987:326) seems to be of a different opinion, for he writes: ‘To the
extent that the lifeworld fulfills the resource function, it has the character of an intuitive,
unshakeably certain, and holistic knowledge, which cannot be made problematic at
122 Explicating a critical realist methodology
will—and in this respect it does not represent “knowledge” in any strict sense of the
word. This amalgam of background assumptions, solidarities, and skills bred through
socialization constitutes a conservative counterweight against the risk of dissent inherent
in processes of reaching understanding that work through validity claims.’
4 About the first, original nineteenth-century round of the pathological learning process,
articulated by Karl Marx and Friedrich Engels, which eventually led to Communist
dictatorship in Russia, see Chapter 1, note 5.
5 Double hermeneutics
of iconic modelling
123
124 Explicating a critical realist methodology
In the last section of this chapter, I shall discuss the role of time and space in
explanations. Following Andrew Sayer (2000:106–154), I contend that explication
of spatiality is usually relevant, but not always the key issue. However, temporality
is fundamental to explanations and actions alike. Drawing on Paul Ricoeur’s
theory of narratives, I conclude this chapter by clarifying the double hermeneutic
logic of temporality. This argument anticipates the logic of explanatory
emancipation.
genres of drama (tragedy, comedy, fairy tale) can also provide explicit sources
(possibly or usually in addition to science and technology). Paramorphs are
usually constructed to model processes. Paramorphs have two aspects: relations
to source and to subject. The relation of a paramorphic model to its subject
may be of three different kinds. In partially analogous models, the initial and
final states of a model and subject may be exacdy alike, while the processes by
which they were reached may differ. A complete paramorphic analogy models
all relevant details of its subject. In a partial paramorphic homologue the model
and subject would have an identical manner of working but only analogous
inputs and outputs. A singly connected paramorphic model draws on the
principles of one source, but a model can also be multiply connected to many
different sources (Harré 1970:44–5).
Examples of paramorphic models in social sciences include social physics,
sociological functionalism, evolutionary models and cybernetics. All of them
have been used in international relations (IR). Arms races have been modelled
in the Richardson—Rapoport tradition in accordance with the schemes of
Newtonian physics.1 World-System Analysis, functionalist theories of co-operation
and Waltz’s neo-realism have used sociological functionalism and evolutionary
models, albeit in very different ways.2 Traces of cybernetics can be found already
in David Mitrany, but it was Karl Deutsch who explicitly brought cybernetic
models to the field. It is noteworthy that, of all these schools of thought and
thinkers, only Deutsch has been really clear about the fact that he is constructing
models whose subject matter and sources differ. When the subject matter and
sources differ, models can be constructed only by way of analogy.
The field of the subject of the model can also be roughly the same as the
source on which the model is based. These models are called homoeomorphs.
Homoeomorphs can be divided into three subcategories. Only teleiomorphs
are interesting from the point of view of social sciences. The term ‘teleiomorph’
is intended to suggest that teleiomorphs are, in some respect or respects,
improvements over their subjects. Two kinds of teleiomorphs can be
distinguished, depending on just how they are related to their source-subject
(which may be a particular concrete entity, representing the type of all members
of some class of entities, or it may be some kind of an average). There are
128 Explicating a critical realist methodology
idealisations and there are abstractions. A pure idealisation has all the relevant
characteristics of its source subject, but at least some of them are, according to
some scale of value, more ‘perfect’ than the source-subjects’ properties.
Abstraction, in turn, can be analysed as follows. If the source-subject has the
relevant properties pl…pn, then an abstraction has the properties pj…pk (1<j<k<n);
that is, fewer than its source subject (Harré 1970:41–2).3 Max Weber’s ideal
types are a case in point. For Weber, ideal types offer an ideal picture of events,
types of action and social relationships by an ‘analytical accentuation’ of certain
elements of reality. In more Marxist terms, one can also talk about abstraction
as isolation in thought of a partial aspect of an object. It is important to keep in
mind what we abstract from, and whether the abstraction preserves essential
properties of the object (Sayer 1992:86–7).
However, teleiomorphs also involve analogies, metaphors and metonyms. In
the case of teleiomorphs, the typical analogies are geo-historical rather than
analogies to the (subject matter of) natural and technical sciences. Many of the
metaphors used may well go unnoticed. There is no simple way of ‘accentuating
reality’. Consider a few familiar examples of apparently literal ways of talking
about world economy and politics:
• ‘Germany is in the centre and Tanzania in the periphery of the world economy.’
• ‘There are three great powers in the world.’
• ‘The power of the USA was in decline in the 1960s and 1970s.’
• ‘Nixon bombed Hanoi.’
• ‘Finland felt that she had to be more active with respect to her policy in the
United Nations.’
• ‘Unilever has raised its prices of washing liquids again.’
should not use these metaphors in describing the world uncritically, at least not
without being as explicit as possible about such a usage. Rather, we should
grasp and analyse metaphorically structured constitutive rules and definitions
for what they are, as constitutive, meaningful and intentional (even if contested)
features of the real world we are studying. Even if constitutive, they may
nonetheless be highly misleading as descriptive claims.
An iconic model stands in for real causal complexes and mechanisms. Actors
must have some familiarity with the social complex and related (or consequent)
mechanisms; their competencies and knowledge are the condition for social relations
to exist. However, we can assume that they are partially ignorant of at least some
layers and aspects of the worlds, in which they exist and act, and they may also
have false understandings of some or many of these realities. An iconic model
includes the following kinds of hypotheses (modified from Harré 1970:55–6):
This scheme of iconic modelling clarifies the nature of many social scientific
practices. Yet it can also critically re-direct them and lead to more systematic
and open-minded practices. It can help to overcome many of the standard
dichotomies—such as the one between ‘explanation’ and ‘understanding’—which
mislead more than enlighten.
Richard Rorty has, however, rightly criticised ‘the metaphysical thinker’ who
‘thinks that if you can get the right understanding of Being—the one that gets Being
right—then you are home’ (Rorty 1991:38). Thus, we also need constant self-
Double hermeneutics of iconic modelling 133
Empirical evidence
Assuming that after a critical scrutiny of its presuppositions, the research problem
has been sustained, even if only in a qualified form; and that at least a few
interesting candidates for endorsement as components of possible iconic models
have been specified; the next job is to assess and refine these candidates. In an
open-ended, realist research, empirical evidence must play a critical role in the
assessment and refinement of candidates. Although quantitative evidence is
indispensable, the reasons of actors and the meanings of actions and practices
can only be studied by qualitative means.
Quantitative data
Qualitative evidence
evidence is primary. Internal social relations exist as shared (even if often vague
and ambiguous) concepts. Internal relations are the necessary material causes
of social action. They are implicated in the causally efficacious reasons for
action, and involved in the outcomes of any generative social complex. Moreover,
regulative rules, as well as the external conditions and circumstances of action,
have to be given meaning by the actors, otherwise they could not enter into
their reasoning processes. Hence, to model a layered causal complex adequately,
one has to study the meanings of relevant actions and practices.
This is the second level problem of hermeneutics in social sciences, concerning
the movement between the researcher, his or her explanatory models and the lay
meanings. The mode of access to lay meanings is not external observation but
communication (cf. Habermas 1984 [1981]:102–13). If we study past worlds and
the relevant people are already dead, communication cannot be mutual. Rather, it
is about interpreting texts or text-like traces of the past. These texts might have
been constitutive of the meanings under study (first-hand evidence) or they may
make references to them (second-hand sources). If we study contemporary worlds,
and the relevant people are still mostly alive, mutual communication is possible.
Participatory ‘observation’, interviews, discussions and other forms of mutual
communication can complement the study of texts and text-like traces.
Despite the detachment of the researcher and his or her object of study in
terms of time, space and causal responsibility, the subject—object relationship is
complicated even when the object of study is historical. There may be (asymmetric)
internal relations between the being of the researcher and the constitutive meanings
and concepts of past objects. The researcher also has fore-knowledge and
anticipations about these meanings and related validity claims. How is it possible,
then, to achieve better understandings in a conscious and controlled manner? In
Hans Georg Gadamer’s view, it is actually impossible to find a method that would
guarantee objectivity in the traditional scientist sense. There is no interpretation
totally free of presuppositions and prejudices. While the interpreter may free
himself or herself from this or that situation, he or she cannot free himself or
herself from his or her own facticity, from the oncological condition of always
already having a finite temporal situation as the horizon, as the standpoint from
which the presumed possibilities and anticipations stem.6 It is this horizon that
should be put systematically at risk when it is confronted by new qualitative
evidence or experience (practical or communicative).
There are thus two sides to collecting and analysing qualitative evidence.
The onto-ethical relation of the researcher towards the unfamiliar or alien
meanings, actors and practices, and the methodological problem of classifying
types of evidence and analysing them systematically for the purpose of building
a well-endorsed iconic model. The onto-ethical relationship must be based on
virtues of openness and learning. ‘Historical consciousness is aware of its own
otherness’ (Gadamer 1975:306). The process of understanding meanings must
follow the logic of hermeneutic circle and occur through constant fusion of horizons
(see Gadamer 1975 [1965]:306–7 passim). The available evidence should be
classified in terms of what relationship they bear to the actions and contexts
Double hermeneutics of iconic modelling 137
Human beings are temporal and spatial beings. We cannot go back in time;
whatever we do is an intervention in irreversible causal processes; physically,
we cannot be in two places simultaneously; and the specific area of space we fill
in—‘this chair’—cannot be occupied by anybody else simultaneously. Bodily action
always involves a stream of actual or contemplated causal (and thus material)
interventions in the ongoing geo-historical processes, thus producing various
outcomes. Causal powers can also be activated or mediated across both spatial
and temporal distance, ‘provided that there are intervening media through which
they can exert their influence’ (Sayer 2000:112).
To what extent is it necessary to spell out the spatial and temporal location (or
distance) and organisation of the components of causal complexes, including actors
and actions? Interest in studying social space seems to coincide with the crisis of the
short-lived twentieth-century claim that territorial nation-states de-limit the space of
most social relations and processes. What is crucial is whether spatial constellations
and organisations make any difference to how concrete practices, powers and causally
efficacious complexes work. It cannot be decided a priori that spatial location, distance
and organisation are particularly important aspects of causal complexes. It has to be
left to the process of iconic modelling to determine how and why space matters.
Temporality seems essential to any social interaction and process. All
components of causal complexes are geo-historical, except the natural powers
and mechanisms that make society possible (and they are subject to long-term
processes of change too). Moreover, actors cannot change the past or intervene
in future processes they do not anticipate. In social interaction, the timing of
action is often crucial and, in some social contexts, it is also meticulously
organised. It is at least as important that in making something actual (present),
138 Explicating a critical realist methodology
actors unite reasoning about the past and possible futures in terms of narratives.
A social scientist, in his or her turn, tells stories and may thereby re-signify
practices. Also, the timing and media of his or her intervention makes a difference.
Space
In the early twentieth century, the science of geo-politics emerged with the
exhaustion of the geographical space for Western colonialists and explorers. It aimed
at viewing the planet earth as a whole and stipulating positivist ‘laws’ about the
statist or imperial governance of the limited global space (Tuathail 1996). However,
geo-politics has been an exception rather than the rule. In the good part of the
mainstream of twentieth-century economics, politics and sociology, it has been
assumed that economy and society mean or at least correspond to the nation-state.
Often, space has been abstracted away with little or no concern about the
consequences (Sayer 2000:108–9).
Henri Lefebvre, David Harvey and others have complained that the tendency
to abstract away from space—or to make simplistic assumptions about it—has kept
social scientists ignorant about the deep conditions of the social objects they are
studying. This ignorance has also caused blindness about the possibility of (the
ongoing) profound changes. For Lefebvre (1991; 1974:11), space has an active
operational and instrumental role in the organisation of social systems, including
production and exchange. Hegemonic forces or classes also make systematic use
of space in their attempt to gain or secure preferred practices and relations.
Following Lefebvre, Harvey (1990:201–39) argues that deep cultural conceptions
of space and time are necessarily created through material practices and processes
of production, which must be organised spatially. Command of time and space
are also important resources of power. In turn, practical organisations and cultural
conceptions of time and space create senses of belonging and identity, and also
render particular cultural and political projects worthwhile.
The term ‘geo’ comes from a Greek word for earth. Social processes are geo-
historical in the sense that, until very recently, human powers and social actions
have always occurred somewhere on the surface of the planet earth. However,
actions and their contexts can be independent of particular spatial settings.
Embodied competencies, available facilities, rules, actor identities, practices and
modes of action can remain unchanged across various locations of actors and
their facilities (Harvey 1990:118). Facilities such as long-distance mobile phone
calls, the Internet and trans-continental flights have further contributed to the
(relative) detachment of action from location or place, at least for those endowed
with sufficient resources (cf. Bhaskar 1994:70).
Space has no causal powers independent of the objects constituting it. It is
one of the problems of theories of ‘centre’ and ‘periphery’ in the capitalist
world economy that they offer a generalised spatial location as the crucial
component of explanation of under-development, failing to specify the relevant
causal powers (or their absence) and related geo-historical processes adequately
Double hermeneutics of iconic modelling 139
Time
All social phenomena, all social interaction, anything that exists in society, is
temporal. Time is not a simple succession of abstract ‘nows’. Time is involved
in social phenomena in a very complex manner. An adequate iconic model
should be able to capture the relevant aspects of temporality. The historical
nature of social being includes the geo-historical processes of formation of all
components of causal complexes. Hence the need to write genealogies of these
components and the fields and problematics they give rise to. Moreover, social
interaction occurs in time. Adequate iconic models have to account for the
irreversible temporal sequences of episodes and processes.
Temporality is also an essential part of social action in a deeper sense.
According to Paul Ricoeur (1984:68–84), there is a unity of having-been, coming-
towards and making-present, since these are thought and acted upon together by
the actors. This is the temporality of practical experience and action. The making-
present of practical action stems from the anticipation of possibilities of
transformative action producing outcomes on the basis of understanding of that
which has-been (history). The horizon of action is thus inherently temporal.
Moreover, particularly the appeal of more total narratives may be reinforced by
140 Explicating a critical realist methodology
awareness of one’s own mortality. Many political projects derive their motivation
from the sense they render to our/their lives. This opens up the way to the
questions posed by Alker:
What motivating power or charisma does this story, or ‘myth’, contain? What
basic structure or structures, what infectious, self-replicating, ‘viral’ qualities
account for this power? Can we replicably discover and empirically test some
of the ways in which its deepest structure has been rewritten into different
‘surface’ texts?
(Alker 1996:105)
A research project may thus also begin with a pre-understanding of some terminal
consequences—actual or potential—which are assumed to be important. The choice
of terminal consequences must itself be a responsible choice. In principle it is
possible that the same material of temporal actors, rules and resources, practices,
interactions, circumstances and intended and unintended outcomes can be
ordered according to different actual or potential terminal consequences. All
these temporal interpretations can be true with regard to the causal powers and
sequences upon which they are elaborated. There may thus be many stories to
tell on the basis of the same material.
Ricoeur (1984:70–82) argues that narrative has its full meaning only when
it is restored back to the time of action and human practices. Mimesis3 marks
the intersection between the world of social scientific modelling and the world
of the hearer and the reader. Although there may already be redundancy of
meaning in the social world itself, there is always potentiality for new and
better stories such as is assumed to be the case in psychoanalysis (by both
parties) or trial (by the judge). The main point of mimesis3 is that it re-signifies
human practices in their temporal dimension, to the extent that explaining and
narrating is to remake action. Ricoeur calls this process iconic augmentation of practices.
Iconic augmentation depends upon the symbolic structuring of the prefigurated
time. The practical question is: How plausible is the new and purportedly more
true and better explanatory story when compared to the prefigurated one(s)?
More structurally, it is also a question of whether the new stories have access to
the world of prefigurated practices.
Conclusion
It is not easy to build adequate social scientific explanations. The real difficulty
does not lie in any technical or formal details. It is all too easy to stick to the
ampliative approach and imagine that one can substitute complex thinking and
research with a few technical operations of deductive or theoretical reasoning,
or statistical analysis. In the reductive approach to iconic modelling, it is rather
the multiplicity of possibilities, layers and components that have to be taken
into account, which forms the real difficulty.
This chapter has given some methodical guidelines about how to construct
an explanatory iconic model. First there is a question and an X (some ‘terminal
consequences’) to be explained. The presuppositions of this question, and the
description of the X or ‘terminal consequences’, have to be scrutinised critically.
The next task is to specify available interpretative hypotheses and stories, and
possibly envisage new ones; and determine sources of available evidence, whether
quantitative or qualitative. Are there many interesting candidates for
endorsement? Is evidence abundantly available? If not, one has to use all possible
clues to construct a plausible iconic model and narrative. If yes, different
hypotheses have to be assessed systematically against available evidence, and,
eventually, a judgement about the truth of these hypotheses has to be made.
142 Explicating a critical realist methodology
The components of a layered, relational causal complex and all the relevant
material and efficient causes must be specified in sufficient detail. The researcher
has to decide the extent to which spatial location, distance and organisation are
important aspects of the causal complex. Moreover, she will also have to thematise
the temporal structure of the explanation at the levels of mimesis1 and mimesis2.
In every step of the double hermeneutics of iconic modelling, there is, first, a
movement between a wide variety of theoretical concepts, and, second, a
movement between the researcher, her explanatory models and the lay meanings.
In mimesis3, the explanatory story assumes its full meaning when it is restored
back to the time of action and human practices.
However, mimesis3 is not the only practical implication of social scientific
explanations. In Chapter 6, I shall explicate the logic of explanatory emancipation
and its relation to legitimate transformative actions. Social sciences also concern
normative issues.
Notes
1 In his highly acclaimed book Fights, Games and Debates, Rapoport (1960) argues that,
although social physics is a good starting point, the theory of games implies a partial
critique of that model, while the notion of debate implies a more thorough critique.
Thus, social physics can yield at best only a very partial insight into social processes.
2 For an explication of the metaphoric and analogical basis of these theories, and for an
analysis of the (lack of) explanatory power of these theories in the context of explaining
the end of the cold war, see Patomäki 1992a.
3 To speak of the characteristics of an entity in this manner implies that there are relevancy
criteria, which, however, are always partially relative to the task.
4 The notion of thematising comes from Heidegger, but I have modified and re-
interpreted it. Heidegger attempts to do justice to the possibility of positivist or scientific
realist natural sciences with the help of this notion; I am using it in developing
methodological tools for the social sciences based on critical realism. See Heidegger
(1990 [1962]):414–15, and Dreyfus 1991:82–3.
5 Peace research has also been elevated by this vision. See Chapter 2 for lessons of
Alker’s development; and Patomäki 200la for a reconstructive analysis of the
development of Galtungian peace research.
6 This idea is summarised by David [Link] in the ‘Editor’s introduction’ for Gadamer
1977:xlvii. The definition of ‘horizon’ can be found in Gadamer 1975 [1965]:269.
6 The normative logic
of emancipation
Truth and social scientific explanations have normative and political implications.
In contexts in which the transformation of politics into violence is more than a
mere abstract and remote possibility, the politics of truth is far from being innocent.
Grand narratives about the end of war have justified wars and violence. For instance,
the aim of people’s wars has typically been defined, as Mao Zedong said, by a
historical movement from capitalism to socialism: ‘Once man has eliminated
capitalism, he will attain the era of perpetual peace, and there will be no more need
for war’ (Alker et al. 1989:152). This view is based on a theory of society and a
story of history as a whole. It presupposes judgements about the truth of that
theory and the story. Similarly, the USA, led by Woodrow Wilson, took part in the
intervention against the Soviet Union in 1918. In accordance with the Kantian
liberalist theory of world politics, Wilson saw the anti-democratic Bolsheviks as a
general threat to peace and security, besides rendering more difficult the defeat of
Germany in the ‘war to end all wars’. Hence, in Kantian thought, wars may seem
justified in terms of making the world more peaceful and orderly. Modern
emancipatory projects have also confronted each other on the battlefield.
‘Isn’t the most general of political problems the problem of truth?’ asks
Michel Foucault (1991b:82). Indeed, there is a close connection between politics
and truth—truth understood socio-historically as a system of ordered procedures
for the production, regulation and operation of statements (Foucault 1986
[1971]:74). Truth has effects of power. Under suitable conditions, a particular
truth can produce, via actions, parts of social reality. Truth also has the effect of
including and excluding, legitimating and de-legitimating potential or actual
components of reality, including military actions. Foucault is not alone in seeing
a very close connection between politics or its failures and truth. Any notion of
truth implies criteria, standards and measures, which have social and political
consequences. Strict standards and measures have also lost much of their
credibility, because intellectuals would like to freeze and dismantle the guillotine,
as well as other modern machines of violence stockpiled all over the fragile
planet (see, for example, Schürmann 1984:175–6). Hannah Arendt (1973:82)
articulated this sentiment three decades ago: ‘At least we can learn from Rousseau
and Robespierre that absolute goodness [truth] is hardly any less dangerous
than absolute evil [untruth and deceit].’
143
144 Explicating a critical realist methodology
Epistemological relativism
Epistemological relativism means that all beliefs are socially produced. Knowledge
is transient, and neither truth-values nor criteria of rationality exist outside geo-
historical processes. Epistemological relativism should be distinguished from
the incorrect thesis of judgemental relativism, which asserts that all beliefs (or
statements) are equally valid, in the sense that there can be no rational grounds
for preferring one to another (Bhaskar 1979:73). This distinction is necessary.
Any kind of action presupposes some criteria:
it is clear that if one is to act at all there must be grounds for preferring one
belief (about some domain) to another; and that such activity is typically
codifiable in the form of systems of rules, implicitly or explicitly followed.
(Bhaskar 1979:74)
Anything does not go. To take a simple example (from Putnam 1989:54): if
someone really believed he could fly and were foolish enough to pick a conceptual
system that confirmed this, and then acted upon it by jumping out of a window,
he would, if lucky enough to survive, see the weakness of the conceptual system
at once. Social relations are similarly real, because they have causal effects via
actions. For instance, breaking regulative rules—say by failing to complete a census
form or by hauling your boss over the coals—tends to have definite consequences.
Bhaskar (1986:24) argues that judgemental rationality depends upon the
recognition of ontological realism and epistemological relativism. Explicit denial
of this condition would lead to a theory/practice contradiction (cf. also Habermas
1979:2 and 66–8). Action presupposes reality, which is independent of beliefs of
The normative logic of emancipation 145
any particular actor, otherwise the whole idea of action would be unintelligible.
People could be jumping out of windows if they just believed that they could fly.
Yet relativism regarding explanations prevails. This is also a condition for rational
research and discussions. If there were absolutely true or self-evident claims or
statements, there would be no room for research or debates about their validity.
Moreover, because a choice between theories or explanatory models is a matter
of geo-historically situated human judgements, the choice also depends upon the
ethical responsibility of the cognitive agents concerned (Bhaskar 1986:24–5).
Duality of truth?
It is not possible to know the world except under certain particular, geo-
historically, transient descriptions, as Bhaskar (1986:99) argues. What, then, is
truth in Bhaskar’s theory? It seems to relegate the notion of correspondence
between epistemic statements and ontic objects to the status of a mere metaphor
for the goal of an adequating practice, in which ‘cognitive matter is fashioned
into a matching representation of a non-cognitive object’. However, Bhaskar
goes on to explain his notion of truth as follows:
Bhaskar maintains that his theory of truth can sustain the duality of the epistemic
and ontic aspects of truth instead of reducing these two aspects to the ‘oneness’
implicit in the (conventional) correspondence theories of truth.2 The self-
proclaimed pragmatist Richard Rorty (1987), for example, would strongly object
to Bhaskar’s views. For the pragmatist, Rorty argues, the notion of ‘truth’ as
something ‘objective’ is just a confusion between the following:
The pragmatist agrees with (1) but cannot make sense of (2). According to
Rorty, the realist tries to explain (2) with a further idea:
146 Explicating a critical realist methodology
The pragmatist, Rorty (1987:41) says, can only retort that many centuries of
attempts to explain what that correspondence is have failed. What is Bhaskar’s
position here? There is no disagreement between Rorty and Bhaskar as far as
(1) is concerned, but there is certainly disagreement in respect of (2), for this is
one of the basic claims of Bhaskar’s transcendental realism. In effect, (3) is the
‘metaphysically realist notion of truth’—truth as ‘correspondence to the facts’,
or something of that kind. Hilary Putnam (1982:4), among many others, denies
this on the grounds that we do not have notions of the ‘existence’ of things or
of the ‘truth’ of statements that are independent of our conceptual constructions,
procedures and practices. Also, Bhaskar says clearly that it is not possible to
know the world except under particular geo-historically transient descriptions.
Where, then, does the difference between Bhaskar and the critics lie?
Bhaskar is not arguing for a metaphysically realist notion of truth (3). ‘The
supposition of the possibility of a (logocentric) correspondence, if interpreted as
a logical thesis, is meaningless or at the very best unverifiable’ (Bhaskar
1979:201). Rather, he maintains that there is ‘a difference which makes a
difference between (a) “it works because it’s true” and (b) “it’s true because it
works“’ (Bhaskar 1991:7). In open systems, where invariant regularities do not
occur, only (a) makes sense. Applications require knowledge about transfactually
efficacious mechanisms or structures. Therefore Bhaskar, unlike Rorty, accepts
claim (2). Bhaskar’s argument is largely based on the presuppositions of the
natural scientists’ historical practice of experimentation:
Bhaskar argues explicitly against Rorty, in that from the argument ‘there is no
way to know a thing except under a particular description’ one cannot infer
that ‘there is no way to know that thing exists and acts independently of that
description’. In fact, one must know that the world has existed prior to science,
The normative logic of emancipation 147
and one can know that it exists and acts independently of science, because this
is a practical presupposition of science (Bhaskar 1989:152). Philosophy, as a
fallible mode of investigation of the necessary presuppositions of geo-historical
practices, can tell us that it is a condition of the possibility of scientific activities
that the real world is stratified, differentiated and structured, and that change
and emergence are possible, independently of our knowledge concerning its
specific content (Bhaskar 1979:8). Hence, the claim (2) that there is something
out there in addition to pragmatically known true statements, or in addition to
what is called ‘the truth about the world’, which is not only possible but also
necessary for scientific practices (Rorty does not reject scientific practices or
their fallible results). Therefore, we can also speak about the manner in which
‘cognitive matter is fashioned into a matching representation of a non-cognitive
object’, and we can also make judgements on this (mis-)match.
So far, so good. Bhaskar nevertheless argues that there is also an ontological
aspect of truth. In Plato etc., Bhaskar (1994:63–4) has further distinguished between
truth as ontological (‘achievable when referential detachment occurs’) and as
alethic (the non-propositional truth of or reason for objects). What are these
aspects of truth? What do they add to Bhaskar’s scientific realism? If ‘many
centuries of attempts to explain what correspondence between statement/theory
and facts is have failed’, is it possible to explain what ontological and alethic
aspects of truth are (unless they amount merely to a restatement of ontological
realism)?
According to Rom Harré, a statement using the word ‘true’ is typically
used merely (1) to refer to a statement, and (2) to pass a well-justified judgement
on it. In some contexts what matters is whether a statement ‘corresponds’ to
the objects of knowledge, while in other contexts our assessment may depend
more upon whether the statement under review coheres with other statements
(see Harré 1970:190). In both cases, intersubjective evidence and reasons for
a judgement are required. This would seem to indicate that truth is merely an
evidence-based human judgement about descriptions and explanatory
statements, concerning how the differentiated, layered, structured and changing
world really is.
However, truth must be seen as partially independent of justification here
and now: truth is something more than the actually available evidence. In the
sense of ontological realism, there are independently existing conditions for
truths, i.e. the way the world really is (or has been). Thus, concrete truth-
judgements should never be seen as total or unambiguous.3 Rather, the emergence
of being into ‘unconcealment’ is best seen as the covering up of ‘truth in its
inexhaustible fullness’ (cf. Palmer 1969:245). Because of the condition of
relativism, reality can never be reduced to our current knowledge of it. We
should accept that even when we are right—given the available evidence—in
making a judgement that something is true, there are always more things left to
be learned: perhaps a finer network of relations and deeper layers of reality;
perhaps new perspectives and conceptions that would make it possible to see
new aspects of reality; perhaps new evidence, which would allow us to revise
148 Explicating a critical realist methodology
‘if one starts making an exposition of a theory more and more precise in a definite
direction, what is gained in preciseness is lost in neutrality and comparability,
and vice versa’. However, as Thomas Kuhn (1970:202) has noticed in responding
to his critics, new languages and theoretical systems can be learned, and the
meaning of the statements belonging to a particular ‘system’ can be translated—
though only contingently and without a resort to a neutral ‘third’ language. In
this sense, human beings have the power to break through languages and
theoretical systems. Furthermore, although Naess is right that theory-comparison
and choice is problematic, it does not follow ‘that there are no good reasons for
being persuaded. Nor does it even imply that the reasons for choice are different
from those usually listed by the philosophers of science: accuracy, simplicity,
fruitfulness, and the like.’ What is suggested, however, is that ‘there is no algorithm
for theory-choice, no systemic decision procedure which, properly applied, must
lead each individual in the group to the same decision’ (Kuhn 1970:199–200).
There are also differences between ‘comprehensive views’, or ‘systems’, in the
sense that they may be more or less open to different possibilities of making a
theory/model precise. The more precise a statement is, the easier it is to assess it
against available evidence. Thus, we need a plurality of initial hypothesis drawn
from different theories; constant hermeneutic mediations between theoretical
concepts; and the reductive approach to iconic modelling. Moreover, some
comprehensive views are also capable of admitting that there are holes, silences,
contingent incommensurabilities, ambivalences and ambiguities; and that the
open texture of all discourses is, in fact, a rich source for potential development
(Bhaskar 1989:155). By admitting this much, as I think CR does, a comprehensive
discourse can remain open to change and learning. Openness and learning enable
more adequate, yet geo-historically situated, truth-judgements.
the possible emergent powers of social system; and so on. It may well be that
there are many historical episodes, practices and/or systems that can be grasped
in terms of a particular iconic model. It is also possible to form higher-order
models Mn with wider range but with lesser accuracy than the concrete models
An–1 Bn–1 Cn–1, etc., which in some relevant respects are essentially similar.
The hermeneutic assumption of relative privilege of an agent’s own account
of his or her reasons depends upon the causal efficacy of his or her practical
competencies and reasons—for otherwise there would be no reason to prefer her
account to that of an outsider. However, causal efficacy establishes an ontic
distance between the agent’s competencies and reasons and any particular
expression or statement of them. Moreover, the description of even our own
sensations, competencies, beliefs and emotions is profoundly affected by a host
of conceptual choices. The ontic distance and interpretative openness mean
that any description of the agent’s real beliefs must be corrigible. (Bhaskar
1986:164). Also, a social scientific explanation can reveal that beliefs of actors
are in some regards false or illusory. An explanatory model can explain why
they must be false or illusory. A rationalisation of action does not necessarily
correspond to the real causes of the action, nor is it necessarily able to grasp the
wider consequences of action. Yet that very rationalisation can be necessary for
the (re)production of particular social relations and practices. The stratification
model of human action and agency captures this idea.
According to this model, illustrated in Figure 6.1, agency and action are
conditioned (i.e. caused materially) by social structures, which are, dialectically,
also the continuously (re)produced outcome of human agency and actions. The
consequences of actions include thus (re)production of particular structures.
This consequence can provide motivation M(r) both for reasons r and particular
rationalisation R(r) of these reasons; motivation M(r) may also be partially
unconscious; and rationalisation R(r) may conceal, mystify or reify the
consequence, which, at this level, is likely to be unintended.
Bhaskar (1979:74) distinguishes three ways in which interference between a
subject’s interest in an object and his or her knowledge of it could operate. It
could operate consciously (as in lying); it could operate semi-consciously, as in
the wishful thinking of the incurable optimist or the special pleading of a pressure
group; or it could operate unconsciously, whether or not it may subsequently
become accessible to consciousness. The conclusions of the unconscious mode
of interference can be either rationalisations of motivation, or they can constitute
mystifications or reifications of social structure (ideologies). This is the ontological
ground for the possibility of explanatory critique. Explanatory critique is directly
connected to emancipatory political change:
the object that renders illusory (or superficial) beliefs necessary comes…, at
least in the absence of any overriding considerations, to be criticized in being
explained; so that the point now becomes, ceteris paribus, to change it.
(Bhaskar 1979:67)
152 Explicating a critical realist methodology
The force of explanatory criticism comes from the positive value of truth. In
the following, I shall explain in detail the necessary connections between
explanation and criticism, and take Bhaskar’s argument a few steps forward in
political theoretical terms.
will and power may easily lead…[even] to self-destructive learning, for they
may imply the overvaluation of the past against the present and the future,
the overvaluation of the experiences acquired in a limited environment against
the vastness of the universe around us; and the overvaluation of present
expectations against all possibilities of surprise, discovery, and change.
(Deutsch 1963:248)
The normative logic of emancipation 153
Deutsch argues that pathological learning processes often occur in the context
of fear, stress and conflict. Background discourses, which naturalise, reify and
mystify social relations, can stimulate pathological learning. Layered interests
can also be related to aspects of structures such as the occupation of social
positions (‘power’ in the conservative sense) or resources for transformative
actions (‘power’ in the radical sense of political mobilisation). Once real, whether
fully conscious or not, interests can intervene in the formation of agency by
providing motivation M(r) both for reasons r and particular rationalisation R(r)
of them. Illusion can also become institutionalised and appear for a given actor
as a natural part of assuming a position in practices and organisations.
According to Bhaskar (1979:80), one is justified in characterising a discursive
formation (DF) as ‘ideological’ if both (1) some relevant elements of DF are
false; that is, one possesses a superior explanation T1 for the phenomena, and
(2) one possesses an explanation T2 of the falsity of the beliefs of DF in question
and why they are held. Where different theories are required for the satisfaction
of (1) and (2), they must at least be consistent with each other. (1) is the critical,
(2) the explanatory condition. (1) suffices for criticism and all intellectual criticism
satisfies this condition; (2) is required if the criticism is to be explanatory and
directed at emancipatory social change (Bhaskar 1986:179). Schematically, the
logic of explanatory emancipation can be summarised as follows:
Bhaskar (1986:178) claims that his version of arg. 1 is naturalist, not moral, in
character. ‘The notion of false consciousness employed here simply involves in the
first instance that of disjuncture, mismatch or lack of correspondence (representative
adequacy) between belief and object.’ Bhaskar’s main target is the positivist fact/
value-distinction:
Might it not be objected, however, that the fact/value distinction only breaks
down in this way, because one is committed to the prior valuation that truth
154 Explicating a critical realist methodology
is a good, so that one is not deriving a value judgement from entirely factual
(natural) premises? But that truth is a good (‘ceteris paribus’) is not only a
condition of moral discourse, it is a condition of any discourse at all.
Commitment to truth and consistency apply to factual as much to value
discourse; and so cannot be seized upon as a concealed (value) premise to
rescue the autonomy of values from factual discourse, without destroying
the distinction between the two, the distinction that it is the point of the
objections to uphold.
(Bhaskar 1979:81)
In social sciences, this hesitancy may be reinforced by the fact that there are no
neutral criteria of truth to which to appeal. Epistemic criteria differ and are
dependent on the wider system to which they belong. Theories can be made
precise in many ways, and the hermeneutic movement between theoretical concepts
is always precarious. Moreover, in social sciences, subject and object tend to be
internally and externally related. These relations are also a matter of ethical concern.
Derrida’s concept of undecidability addresses these ethical problems. Undecidability
pertains that any judgement about truth or other normative issues has to assume
ethico-political responsibility. ‘A decision can only come into being in a space that
The normative logic of emancipation 155
exceeds the calculable program that would destroy all responsibility by transforming
it into a programmable effect of determinate causes’ (1988:116). For Derrida
(1978:79–153), this ethico-political responsibility is also a question of violence.
Determinate decisions about truth of statements, models and stories can be made
on rational and pragmatic grounds. However, by talking about undecidability,
Derrida (1978:83; see also 1988:112 and 150–3) calls for responsibility of decision
and non-violent ethical relationships with the others.
This explication also makes the criticised actors equal partners in a sustained
dialogue. It also counters Arendt’s (1973 [1965]:78) scepticism about politics based
on revealing sources of unconscious motivation: ‘Since [this demand to reveal
innermost motivation] actually demands the impossible, transforms all actors into
hypocrites; the moment the display of motives begins, hypocrisy begins to poison
all human relations.’ Indeed, is it possible to put forward critical arg. 1 without
destroying the trust, which is necessary for all communication? I think this is a
potential source of violence in CR, and therefore it has to be addressed explicitly.
Any social theory has to be aware of the violence inherent in its own categories,
and of the violence potentially justified in terms of its categories. Arg. 2 tackles
this problem.
What if public communicative action is not possible because of political
repression and violence, or has no success, perhaps because DF ignores criticism
The normative logic of emancipation 157
or remains deaf and closed for other reasons? Should we conclude a positive
evaluation of strategic action directed at removing or changing relevant parts of
the complex Ki? Only a conditional conclusion to this effect is possible. Rational
judgements cannot, and the consequences of rational judgements should not,
contradict the notion of good reasons and openness to further critical debate.
Hence we could continue arg. 2 as follows:
Arg. 2 (9) Either despite the public communicative actions, TDF (and
DF) remains unchanged, or critical communicative actions
are not politically possible with respect to the relevant
political situation.
Does (10) of arg. 2 exclude the use of any violent means to achieve the desired
end? I agree with Derrida (1988:112) that violence may ‘remain in fact (almost)
ineradicable’. The aim is, nonetheless, to argue for a peace culture of social
sciences and for reflective non-violence. Reflective analysis of violence and its
conditions in one’s own categories are among ‘the least violent gestures’ (Derrida
1988:112). The openness of social systems points towards the same conclusion
(10). In open systems, within which social actors possess generic powers,
qualitative changes and emergence are possible, but predictions are not. Changes
may also concern one’s own geo-historical, open-ended being. Moreover, in
open systems, unintended consequences of action are ubiquitous.
The requirement (10) can be read as a moral obligation to limit strategic action to
such modes that are conducive to preserving or building a sphere for public politics
(see Patomäki 2001a). Even in precarious and dangerous political situations, this may
be more feasible than it appears. Periods of relative stability, and widespread
reifications and mystifications of social structures, may make us forget the basic insight
of the theory of non-violent political action. Power is ‘fragile, always dependent for its
strength and existence upon a replenishment of its sources by the co-operation of a
multitude of institutions and people—co-operation which may or may not continue’
(Sharp 1984:8).
(1993:286, 297) has introduced the term ‘concrete Utopia’.7 A concrete Utopia
does not currently exist, but should be politically possible to achieve, and feasible
as an alternative way of organising social practices and relations. The realisation
of a concrete Utopia involves practical wisdom, lessons drawn from past or
contemporary models, counterfactual reasoning about the possible effects of an
altered context, as well as thought-experiments about the consequences of the
transformed practices and systems (See Sayer 2000:160–5).
Practical wisdom is an Aristotelian intellectual virtue, a state that allows an
actor who attains it to be able to ascertain what is ethico-politically good, and
then to deliberate about how best to reach that good. This calls for knowledge
about the concrete political situation, and a systematic reflection on the conditions
of action. The more difficult part may concern, however, ascertaining what
would be a better, yet feasible set of practices or parts of a system. Lessons
drawn from the past or present practices may provide ‘drafts’ or models for
transformative projects, whether successful or not (Sayer 2000:162–3). The
lack of success of a particular practice or project could have been dependent on
the particular geo-historical context in which it originally occurred. What would
have been the possible effects of a somewhat different context? Is it possible to
build components of a complex that would make something similar more feasible
in the future? Roberto Unger emphasises the action- and context-dependency
of political projects:
Some of our projects may fail and by their failure may cast a critical light on
the descriptive and explanatory assumptions that informed them. But we
can never be entirely sure that these same projects might not succeed under
slightly different circumstances or with the help of slightly different methods.
(Unger 1987b:323)
Notes
1 Galtung defines ‘cultural violence’ as those aspects of culture, the symbolic sphere of
our existence—exemplified by religion and ideology, language and art, empirical science
and formal science (logic, mathematics)—that can be used to justify or legitimise
violence. In this sense, even peace research may be a form of cultural violence. Galtung
goes on to argue ‘that a major task of peace research, and the peace movement in
general, is that never-ending search for a peace culture…’ (see Galtung 1990:291).
2 Bhaskar 1986:100–1. Consider, for instance, Popper’s theory of truth, which is largely
based on Tarski’s account of the meaning of truth. According to Popper, it is ‘decisive
to realise that knowing what truth means, or under what conditions a statement is
called true, is not the same as, and must be clearly distinguished from, possessing a
means of deciding—a “criterion” for deciding—whether a given statement is true or
false’. In this theory, the meaning of truth is that ‘an assertion, proposition, statement,
or belief, is true if, and only if, it corresponds to the facts’. However, there are no
general criteria of truth, and hence ‘science is fallible, because science is human’, and
because ‘we may err’. Nevertheless, we can say ‘that a statement “a” gets nearer to the
truth than a statement “b” if and only if its truth content has increased without an
increase in its falsity content’. Moreover, ‘it should be very clearly understood that
the idea of one statement “a” getting nearer to the truth than another statement “b” in
no way interferes with the idea that every statement is either true or false, and there
is no third possibility’, for ‘it only takes account of the fact that there may be a lot of
truth in a false statement’. Finally, although Popper argues that in principle nothing is
exempt from criticism, he also maintains that ‘at least at the moment it seems to me
unlikely that criticism of the logical theory of truth and the theory of getting nearer to
truth will succeed’ (see Popper 1971:369–96).
This is a conventional version of the correspondence theory of truth, which, like the
other versions, is untenable. The problem with Tarski’s account of the meaning of truth
is that, as far as it is seen as philosophically neutral, it is tautological, trivial and non-
informative. As such, it cannot be used as a philosophical theory of truth. However,
Popper’s theory of getting nearer to truth, which utilises Tarski’s account, is neither
neutral nor trivial. The problem is that it presupposes that the truth is already known,
for, unless the truth is already known, how would it be possible to tell something about
the truth-content of an assertion, statement, belief and the like? In other words, in fact
Popper, if he is to keep his notions of truth and ‘verisimilitude’, slips back towards
absolutism, which he is so eager to refute, at least as far as Plato, Hegel and Marx are
concerned, (see Harré 1986:41–5; Margolis 1986:89–96). Hence, the consequence is
that there is no longer any difference between the best available ‘scientific’ theory or
interpretation of the world—the one decided to be the yardstick of truth, or the description
of the ‘facts’—and the world. This is the implicit ‘oneness’ of being and knowledge of it
of all traditional accounts of the correspondence theories of truth.
3 As Galtung notes, if the logical tool used is bivalent logic with its strict line between
valid and invalid, then no interference can be made with ambiguous truth-values for
the antecedents or the interference, and ‘truth’ becomes absolutist in nature (Galtung
1988:162–75; Galtung 1990:301). That this has been seen as a limitation is also clear
from the effort to develop the language of polyvalent logic with more than two truth-
values, by adding ‘indeterminate’, ‘undefined’ or ‘possible’ truth-value(s) to the
standard two-values system (see Haack 1974: particularly 30–5). Haack argues for
162 Explicating a critical realist methodology
‘deviant logics’ on the grounds that the positivist and logicist analytic/synthetic
distinction should be rejected: logic is not about ‘pure form’, as in Kant’s view, but
rather about more or less substantial rules of thought, and ‘no factual statement is
immune from revision’. However, if the classical laws of thought used in deriving
theorems in polyvalent logic are still based on bivalent logic, and/or if the ‘third
possibilities’ are of the nature ‘not yet decided to be true or false…some of the more
plausible interpretations of the intermediate values of many-valued systems are such
that “true” and “false” remain mutually exclusive and jointly exhaustive’ (Haack
1974:61), then it is hard to see that any real exception has been made to the following
‘classical’ rules:
Many philosophers use the expressions tertium non datur and ‘principle of bivalence’
interchangeably, or take for granted that these principles are equivalent. However, it
is, in principle, possible to deny ‘bivalence’ and still hold that (~p v p) is a theorem of
the logical system. Nevertheless, if tertium non datur is a theorem of a system, and if
Tarski’s principle that ‘A’ is true if ‘A’ holds for that system, then the principle of
bivalence must also hold (see Haack 1974:67–8) (This seems to suggest, moreover,
that it has been very commonplace to assume Tarski’s principle implicitly, as a self-
evident truth). Moreover, although ‘tertium non datur’ does not exclude the possibility
that (p & ~p), it is thought, almost without exception, that ‘from a contradiction
anything is derivable, so that such a logic [which accepts (p & ~p)] would be useless
for the purpose of discriminating valid from invalid inferences’ (Haack 1974:36).
Now, if logic and mathematics are viewed as a formal bivalent game with one basic
rule, that a theorem T and its negation ~T cannot both be valid, then there may be
violent social and political consequences in the sense analysed in this chapter: truths
come to be seen as absolute and non-compromised (even though the possibility of
errors is admitted, in principle). Indeed, the praxis of scientific theory-choice and
criteria-choice requires and presupposes ‘alternative’ logical systems. Moreover, bivalent
logic is also inadequate in assessing the validity of a descriptive and picturesque model
concerning an aspect and part of the differentiated, structured and stratified reality
with its complex and ‘inexhaustible fullness’. From this point of view, it is easy to
agree with Joseph Margolis, who argues that relativism is a thesis according to which
‘it is methodologically advisable to retreat from insisting on a strong bipolar model of
truth and falsity’ (see Margolis 1986:111).
4 Modified from Giddens 1979:54; Bhaskar 1986:128.
5 These points rely on Habermas 1984 [1981]: Ch. 1. Habermas argues (1984 [1981]:
8) that ‘rationality has less to do with the possession of knowledge than with how
speaking and acting subjects acquire and use knowledge’. Further, he bases ‘the
rationality of an expression on its criticizability’ (Habermas 1984 [1981]:10). Further,
‘corresponding to the openness of rational expressions to being explained, there is,
on the side of persons who behave rationally, a willingness to expose themselves to
criticism and, if necessary, to participate properly in argumentation. In view of their
criticizability, rational expressions also admit of improvement; we can correct failed
attempts if we can successfully identify our mistakes. The concept of “grounding” is
interwoven with that of “learning”. Argumentation plays an important role in the
learning processes as well…. The medium in which these negative experiences (failed
attempts) can be productively assimilated is “theoretical discourse” that is, the form
of argumentation in which controversial truth claims are thematized’ (Habermas 1984
[1981]:18–19).
The normative logic of emancipation 163
6 In fact, the later Bhaskar seems to oscillate between the speculative illusion of a
Hegelian philosopher and the romantic, radical democrat illusion about a free,
eudaemonic society, within which free, concretely singular individuals are capable
of emancipation that stems directly from ‘an inner urge that flows universally from
the logic of elementary absence (lack, need, want or desire)’ (Bhaskar 1993:299).
Both illusions would make CR social sciences redundant and therefore also negate
the original purpose of CR, which was to ground and show a place for critical
social sciences.
7 The oscillation mentioned in note 6 applies also to Bhaskar’s version of the idea of
concrete Utopia. The following explication is therefore based more on Sayer, Unger
and my own analysis than on Bhaskar.
Part III
Visions of world
politics
7 Modelling Thucydides’
Melos episode
To explicate the research program of Realism, I begin with two classic works,
one ancient, the other modern: The Peloponnessian War, by Thucydides, and
Politics Among Nations, by Morgenthau. The three most fundamental Realist
assumptions are evident in these books: that the most important actors in
world politics are territorially organised entities (city-states or modern states);
that state behavior can be explained rationally; and that states seek power
and calculate their interests in terms of power, relative to the nature of the
international system they face…. In reconstructing state calculations,
Thucydides and Morgenthau both assume that states will act to protect their
power positions, perhaps even to the point of seeking to maximize their
167
168 Visions of world politics
Keohane refers to one of the most contested passages in The Peloponnessian War (I:
23), according to which the real cause of the Peloponnesian war was ‘the growth of
the power of Athens, and the alarm which this inspired in Lacedaemon’. This
‘made war inevitable’. This was probably written two decades after its textual
context, as a post-war afterthought. Keohane takes it as evidence of Thucydides’
neo-realism.5 On this reading, it is also obvious to assume that the Melos massacre
is an indication of a universal invariance. This is the law of the stronger: great
powers naturally impose their will; weaker powers accommodate this or are crushed.
Athens acted inevitably as any great power would do, yet the Melians did not
submit. Hence, the Melian men were killed, and the women and children enslaved.
Readings of Thucydides legitimise or criticise contemporary practices. This is
why Thucydides remains important. I shall argue that available evidence points to a
very different explanation than that offered by followers of Hume. A crucial cause of
the massacre was that Athens as a collective actor changed in the course of the war.
The Athenians’ claim ‘that it is a necessary law of nature to rule whatever one can’
(V:105) is not a statement about a Newtonian law but an attempt to justify their
imperialism and, genealogically, a causal result of the pathological processes that
restructured the opinions and actions of Athenian citizens. The development that led
to these changes is part of Thucydides’ overall explanatory and moral narrative
about the fate of Athens and, also, Hellas. The Peloponnesian war was, to use
contemporary categories, also a civil war. Thucydides seems to have viewed history
as an open process, produced by actors capable of arguing and acting otherwise. A
few passages point towards a more necessitarian story, and they suggest that the
Peloponnesian war is best conceived as a tragedy. Methodical iconic modelling can
help to explicate relevant components of the causal complex and also to reflect
systematically upon different possible temporal structures of the explanation.
A rational-choice model
As is well known, the Melos episode—which occurred during the Nicias Peace
period 416 BC, in the middle of the 27-year-long Peloponnesian war—consists
of a long dialogue and a short statement of the subsequent events. The subsequent
statement explains laconically that Melos was attacked, all adult men were
killed and other Melians were enslaved by the Athenians (Thucydides V:84–
116). Why did the massacre happen? What caused it?
At the outset, it is enough to have interesting possibilities and somewhat plausible
candidates for endorsement. The first possibility is to articulate the neo-realist
logic in terms of a rational-choice model. The Melos massacre can be seen as an
outcome of strategic choices of two state-actors, Athens and Melos. Actually, this
Modelling Thucydides’ Melos episode 169
According to Latsis, the temptations for the social scientist to employ such a
method are considerable, for it makes the application of mathematical techniques
easier—and the utilisation of mathematical tools is often equated with the quality
of being ‘scientific’. It also renders his field of study relatively autonomous and
hence immune to criticism from outside.6 Some general, regulative, philosophical
principles of this programme are (Latsis 1972:208–9):
How could we analyse the Melos episode in terms of the situational determinist
research programme? There are a few selected passages in Thucydides that are
usually taken up and presented to justify a political realist or a rational-choice
reading. For instance, ‘my habit has been to make the speakers say what was in
my opinion demanded of them by the various occasions, of course adhering as
closely as possible to the general sense of what they really said’ (I:22). This
passage is often combined with the view that Thucydides believed that brute
power politics ruled relations between states. It is assumed that Thucydides
made a distinction between relations within, and between, states. In the
170 Visions of world politics
The ‘what was called for by each situation’ must therefore be based on
calculations of available force. Consequently, rhetorical moves stem from the
requirements of the power political situation. Or as Keohane puts the idea,
‘Thucydides seeks to go beneath the surface of events to the power realities that
are fundamental to state action.’
The Athenians sent envoys to Melos in 416 BC, at the time of a temporary
peace between Athens and Sparta, in order to persuade the Melians to be
subordinated under the power of the Athenian empire. The Athenians considered
the terms of submission to be moderate and reasonable, namely ‘alliance on a
tribute-paying basis and liberty to enjoy your own property’ (V:111). They
argued, moreover, that it would be in the interests of both parties if Melos
agreed. From this perspective, it seems that the following arguments reveal the
essence of the Melian dialogue:
MELIANS: And how, pray, could it turn out as good for us to serve as for you to rule?
ATHENIANS: Because you would have the advantage of submitting before
suffering the worst, and we should gain by not destroying.
(Thucydides V:92–3)
Here, actors express their preferences about possible states of affairs in these
statements. The only options open to Athens seemed to be either to destroy
Melos or to rule it. It would be much easier and more profitable to rule than to
destroy. For Melos, it would obviously be better to surrender than to be destroyed.
The statement by the Athenians can be read first of all as a statement of facts,
but simultaneously also as an ultimatum. From these, we can derive the strategic
Modelling Thucydides’ Melos episode 171
game-tree presented in Figure 7.1. (For the notation and general technique, see,
for instance, Ordershook 1986.)
Melos faces a choice between resisting and surrendering, while Athens may,
depending on the choice of Melos, face the choice between conquering Melos
with force and letting Melos be independent. For both parlies the preference
order is (a>b>c). Had Melos had perfect information about the situation, the
only real exit option would have been to ‘surrender’. Melos did not surrender,
however. Consequently, either Melos was irrational, or the model specified is
not a good one. Thus, further complications of the model are needed (given
that the rational-choice approach is not questioned, only the specific model).
There must have been uncertainty as to Athens‘s real capacities and intentions.
It can be assumed that when Melos was forced to make its choice, it did not
know what Athens was going to choose if Melos itself chose resistance. To
indicate this, an information set is drawn into the picture. Melians may have
given a high probability (p) to the Athenians doing nothing if they themselves
just resisted. Consequently, ‘resist’ is a rational choice for Melos if:
In other words, if the expected value of ‘resist’ was higher than the value of
‘surrender’, the strategically rational choice would have been to ‘resist’. Because
this actually occurred, surely this must have been the case. The massacre is, therefore,
best explained by an unfortunate misjudgement on the side of the Melians.
The rational-choice model predicts, after the event, the actual outcome. Is this
an explanation of the Melos episode? It is always possible to construct a
172 Visions of world politics
rational-choice model that predicts any outcome that has already occurred. If
‘prediction’ in this sense is the only criterion, then anything goes—at least as far
as it is articulated in rational-choice terms. In the case of genuine predictions,
the same immunity to criticism can also be effected with a ceteris paribus clausul.7
The question is whether the rational-choice model provides us with a true
explanation. If other hypotheses or evidence is given no chance, if there is no
accuracy and depth, and if there is no illuminatory power, can there be any
real explanation either? Of all possible hypotheses, does the evidence support
only the rational-choice model? What other hypotheses could have been
explicated and assessed? Driven by the scholarly virtue of truth, should not
we test other hypotheses and check the available evidence? Have all relevant
components of the causal complex been brought to the fore; all relevant aspects
of the story been told? Are we satisfied with the depth achieved in the model?
Any episode of social interaction can be re-described as a utility-maximising
decision-making problem of multiple interdependent actors. The first question
is: Is this re-description adequate? What does this re-description explain, if
anything? When rationality is understood as learning and openness to evidence,
how ‘rational’ is the model (7.1) as an explanation? For instance, historical
reasons cast a shadow of doubt on the accuracy of applying probabilistic, utility-
based choice-theoretical terms to explaining episodes in ancient Greece. There
is also direct textual evidence in Peloponnesian Wars that contradicts the rational-
choice model. By a careful analysis of the evidence, it is possible to move
towards deeper layers of social reality and also achieve depth epistemologically.
By specifying and writing a genealogy of the relevant key components, a different
story about the Melos episode can be told.
The conception of strategic rationality was alien to the Greeks. They were
not familiar with the Humean notion of causality or calculative, utility-maximising
action, which in seventeenth-century Europe was modelled on book-keeping
and capital accounting. For ancient Greeks, causes were rather defined in terms
of possibility, potentiality and teleological nature of entities rather than in terms
of empirical invariance.8 Thus, their strategic reasoning must have been different
from the modern economistic cost—benefit analysis.
Moreover, to understand the evidence Thucydides provides about the real
causes of events and processes, it is important to grasp what Thucydides was
doing when he was writing the History. It is not at all clear that Thucydides had
any idea of writing ‘scientific’ history in a modern sense. Although he seems to
have sought, quite methodically, accuracy and true descriptions, and although
he criticised the ancient poets for making up stories, he worked within the
genre of serious arts, particularly tragedy.
possessed, in common with his contemporaries at Athens, the cast of mind induced
by an early education consisting almost exclusively in the study of poets.
(Cornford 1965 [1907]:ix–x)
Thucydides did not, for instance, make references to his sources or have his book
scrutinised by a scholarly community. He says only that his tried to check different
accounts to the extent possible, but this was difficult because ‘different eyewitnesses
give different accounts of the same events’ (I:22). More importantly, I take Francis
Cornford’s point to indicate that at the level of mimesis2 (see Chapter 5),
Thucydides worked with the poetic narrative structures available to him, given
his education. Moreover, like the authors of great tragedies, it seems that
Thucydides was writing moral philosophy as concrete lessons that he meant to
be drawn from his story, which he claimed to be true (see Farrar 1988:126).
The Melians are a colony of Lacedaemon that would not submit to the
Athenians like the other islanders, and at first remained neutral and took no
part in the struggle, but afterwards upon the Athenians using violence and
plundering their territory, assumed an attitude of open hostility. Cleomedes,
son of Lycomedes, and Tisias, son of Tisimachus, the generals, encamping
in their territory with the above armament, before doing any harm to their
land, sent envoys to negotiate. These the Melians did not bring before the
people, but made them state the object of their mission to the magistrates
and the few; upon which the Athenian envoys spoke as follows.
(Thucydides V:84)
In Table 7.1, the first seven substantial moves of the Melian dialogue are
reproduced and in some cases summarised. For a brief introduction to Nicholas
Rescher’s (1977) formalised dialectics, see Figure 4.1 and the discussion thereof.
The basic idea is that the proponent takes the initiative and bears the burden of
proof. The opponent has three or four possible countermoves, including cautious
denial, provisoed denial and weak distinction or exception. The Melian dialogue
can be seen as a dialectical disputation—initiated by the Athenians—about whether
it is in the interest of the Melians to submit to the Athenian empire. In Table
7.2, the seven substantial moves of the Melos dialogue are presented formally
by using this notation. Flexible Rescherian rules of rational argumentation seem
to fit to this political debate quite well, even though the survival of the other
party is at stake. However, this formalisation is in itself nothing more than a
systematic reconstruction of the argumentation. Could the dialogue also help to
reveal deeper layers of the reality and truth about the Melos massacre?
The dialogue begins with what a situational determinist would consider as
the essence of the whole episode (moves 1–3). However, this is only the surface
level. The preferences and the prima-facie cases are stated here, but the reasons
for these (dis)positions are not yet given. Moreover, as the debate moves towards
deeper discursive levels, relevant components of the causal complex are disclosed,
such as partially shared practices, relevant actors and the dynamics of their
positioning. The debate also indicates that there are two different, yet equally
constitutive, perspectives about reality, the Athenian and the Melian. Reality is
not the same thing as the Athenian interpretation of it. Rather this reality involves
Modelling Thucydides’ Melos episode 175
at least two different and partially contradictory interpretations (and soon more
differences will be revealed).
Move 4 shifts the debate to qualitatively novel levels. The Melians ask:
‘Why could we not remain neutrals, friends instead of enemies, but allies of
neither side?’ The Athenian envoys answer in move 5: ‘Your hostility cannot
hurt us, but your friendship will be an argument to our subjects of our weakness,
and your enmity of our power.’ Finally, in their move 7, the Athenians make
public a number of still deeper theories and rules:
1 A constitutive rule, which establishes Athenians as rulers: Athenians define and interpret
themselves essentially as sea rulers and seem to think that others do so as well
176 Visions of world politics
(‘we rule the sea and you are islanders’); this establishes an internal relation
between Athenians and all islanders.
2 A ‘realpolitik’ view about the dangers of any signs of Athenian ‘weakness’ as an imperial
ruler, coupled with the claim that morality is irrelevant: [Athens’s subjects think that]
‘those still independent do so because they are strong, and if we fail to attack
them it is because we are afraid’; [others, and subjects in particular] ‘see no
difference between right and wrong’; [it is therefore particularly important
that] ‘you should not escape’.
(1) and (2) explain the Athenian position, as well as their original claim. However,
to explain the Melian position in a similar way, it is necessary to analyse the
Melian dialogue further. Graphical argumentation analysis techniques have a
number of advantages over Rescherian formalisation. First, a graphical tableau
is relatively easy to read, because the natural language expressions are preserved.
Second, graphical techniques make it possible both to analyse the relations of
substantiation, grounding and criticism, and to present them in a visually
illustrative manner. Further, it is also possible to indicate how actors often make
each others’ implicit assumptions explicit (or at least claim to do that). In Figure
7.2, the rest of the Melian dialogue is presented from move 5 onwards.
There is a systematic pattern in this argumentation. Arguments M6, M10 and
M12 are all about equity, courage, honour, bravery and justice. M12 also supports
M10 while simultaneously attacking A11. M10 (‘does not this mean that you are
strengthening your enemies?’), in turn, partially supports MM10 (the Melian
statement that ‘we who are still free would show ourselves great cowards and
weaklings if we failed to face everything that comes rather than submit to
slavery’). This is because M10 indicates that other people, too, can act out of
ethical considerations. This gives reasons to believe that the Athenians are taking
a great risk by acting so aggressively. This claim contradicts the Athenian realpolitik
view about the dangers of being ‘weak’ ((2), above). Evidently, the crux of
Thucydides’ story is that, in the end, the Melians proved to be right. Athens’s
aggressive imperial actions had adverse, counterfinal consequences and
contributed to its final defeat.
Athenians never denied that the Melians were right, in principle, about
matters of justice and honour. Their arguments concerned whether there was
space for questions of right and wrong. ‘You should be sensible, this is not
Figure 7.2 A graphical presentation of the rest of the Melian dialogue
Sources: Thucydides 1900:V:84–116; cf. Alker 1988b.
178 Visions of world politics
So far as the favour of gods is concerned, we think we have as much right to that
as you have…. Our opinion of the gods and our knowledge of men lead us to
conclude that it is a general and necessary law of nature to rule whatever one can.
This is not a law that we made ourselves, nor were we first to act upon it when it
was made. We found it already in existence, and we shall leave it to exist forever
among those who come after us. We are merely acting in accordance with it, and we
know that you or anybody else with the same power as ours would be acting in
precisely the same way.
(Thucydides V:105; my emphasis)
The Athenians acted upon a realpolitik theory (2). Now we can see that this theory
included naturalisation (reification) of social worlds: ‘it is a general and necessary
law of nature to rule whatever one can’. This alleged law is by no means Newtonian.
It is nonetheless necessitarian and used explicitly to justify actions. The Athenians
think that this ‘natural law’ makes their actions right, also in the eyes of gods.10
The Athenians’ realpolitik theory is a set of rules of interpretation and judgement
that they followed. At that time, it was a deep element of the Greek social reality.
Whereas the Melians were making ethical and moral arguments, the Athenians
tried to persuade them that this was not sensible, that there is not even a slightest
sign of hope that these considerations would help them. However, the Melians,
like many others, never accepted this claim. The Melians were ‘not prepared to
give up in a short moment the liberty which our city has enjoyed from its
foundation for 700 years’ (Thucydides V:112).
In the light of this analysis of evidence, what were the causes of the occupation and
massacre? Two at least tentatively plausible counterfactual possibilities can be
constructed:
CF1 is the onlys counterfactual possibility that can be seen from the perspective
of political realism and the rational-choice model. Political realism is compatible
with the Athenian view. Political realism sees it as the only possibility and
therefore contributes to legitimising, long after the event, the Athenian imperialism
and acts of violence. Many have seen the second counterfactual (CF2) as more
important, either for prudential or moral reasons. However, there are multiple
contradictory perspectives in reality, and all of them are real and also products
of particular geo-historical processes. It is my argument that from a causal
explanation point of view, CF2 is decisive.
According to Thucydides’ narrative about the Peloponnesian war, the Athenians
acted quite differently earlier in the war. Pericles—the famous leader of the golden time
of the Athenian rule—warned, for reasons of honour and prudence, against adopting
a policy of expansionist imperialism and aggression. Hence, a different kind of
Athenian policy had already been actual, not only a possibility. Moreover, even at the
time of the Melos episode, in 416 BC, Athenian imperial expansionism was contested
within Athens.
Thucydides distinguished between two types of causes, between aitia and prophasis.
Aitia refers to the rationalisation of action, and prophasis to the underlying causes,
to the causes which are ‘behind’ or ‘under’ the level of rationalisations and
concrete events (see, for instance, Edmunds 1975:172–3). For political realists,
this means that ‘Thucydides seeks to go beneath the surface of events to the
power realities that are fundamental to state action’ (as Keohane put it). However,
there are other possibilities. For a critical realist, there are a number of things in
social reality that do not belong to the level of aitia, without there being any
fixed set of empirical invariances, which would determine the aitia.
From a CR perspective, the ‘going beyond’ of rationalisation can be done at
least in three directions. The first possibility is to move deeper into the discursive
formations and meanings, possibly towards explicating the implied internal
relations between social entities, possibly by moving analytically towards
unconscious levels (e.g. by analysing the mythical structuration of meanings,
perhaps in relation to the interplay between partially unintended consequences
of action and real reasons for a particular rationalisation). The second possibility
is to explain the concrete possibilities open to positioned actors—particularly the
possibilities taken seriously by the actors. These can be analysed in terms of the
relevant rule-constituted and rule-governed contexts of positioned practices and
by writing genealogies of the key elements of these contexts. The third possibility
is to analyse causal or existential, often unintended, consequences of action,
such as effects of power; the—often coincidental—timing of the occurrence of
these consequences; as well as the external relations of systemic
(inter)dependence, which may give rise to emergent powers and properties, and
which are (re)produced in the context of unintended consequences of action.
180 Visions of world politics
It was the Spartans who first began to dress simply and in accordance with
our modern taste, with the rich leading a life that was as much as possible
like the life of the ordinary people. They, too, were the first to play games
naked, to take off their clothes openly, and to rub themselves down with
olive oil after their exercise. In ancient times even at the Olympic Games the
athletes used to wear coverings for their loins, and indeed this practice was
Modelling Thucydides’ Melos episode 181
still in existence not very many years ago…. Indeed, one could point to a
number of other instances where the manners of the ancient Hellenic world
are very similar to the manners of foreigners today.
(Thucydides I:6)
The Olympic Games were an institution that brought the whole Hellenic world
regularly together and reinforced shared assumptions of Hellenic identity and
common ‘modern’ manners. From the Trojan War onwards, Hellas had also
been united many times against external enemies. Besides external enemies,
Hellenic co-operation was based on the partially shared linguistic, cultural and
political identity (e.g. Thucydides I:23 distinguishes between Hellenic and foreign
armies). Very importantly, the Hellenic unity was also based on pacification of
land and sea, which enabled trade and accumulation of wealth (Thucydides
I:4–9). According to Thucydides’ genealogy, ‘as Hellas became more powerful
182 Visions of world politics
and as the importance of acquiring money became more and more evident,
tyrannies were established in nearly all the cities, revenues increased, shipbuilding
flourished, and ambition turned to sea-power’ (Thucydides I:13). After the
Spartans had ‘put down tyranny in the rest of Greece’, the final obstacle to the
rise of Hellas was removed (Thucydides I:18). Soon after the Persian wars,
however, Hellas was divided into two competing camps:
‘Not many years after the end of tyrannies in Hellas the battle of Marathon
was fought between the Persians and the Athenians. Ten years later the foreign
enemy returned with his vast armada for the conquest of Hellas, and at this moment
of peril the Spartans, since they were the leading power, were in command of
the allied Hellenic forces. In face of the invasion the Athenians decided to abandon
their city; they broke up their homes, took to their ships, and became a
people of sailors. It was by a common effort that the foreign invasion was
repelled; but not longer afterwards the Hellenes…split into two divisions, one
group following Athens and the other Sparta.
(Thucydides I:18; my emphasis)
By the time of the Melian dialogue, the Athenians were losing the legitimacy of
their leadership. They had originally acquired this leadership in the Persian
wars. Ethical notions (mutually agreed regulative rules) played a decreasing
role. The Athenians attempted to replace legitimate leadership with brute force.
Party strife and inter-polis violence were thus making the shared practices and
common Greek identity less important in determining the course of actions.
However, the Nician peace treaty between Athens and Sparta, which was
beginning to break down in 416 BC, included rules regulating aggression between
the camps. The most important constitutive rule, depicted in Table 7.3, was the
self-definition of Athens as the sea ruler. Trade and imperial fees gave Athens
many resources; it was considered the greatest Hellenic power economically and
militarily. Moreover, at this moment, it was neither engaged in war with Sparta
(the Lacedaemonians), nor was its effective transformative capacity yet being
weakened due to the growing lack of legitimacy.
In Melos, the Athenian envoys represented Athenian citizenry. The constitutive
rule they acted upon was ‘we are Athenians’. The decision to adopt a policy
towards Melos was made in Athens. The envoys were normatively tied to the
given instructions. Nevertheless, they must have had an amount of autonomy,
also because of long communication distances. They were also committed to
the rules of argumentation. Debates, which in most poleis were usually public,
organised political life throughout Hellas. However, Melians pointed out that
the Athenian envoys may not been following the rules of dialogue:
MELIANS: No one can object to each of us putting forward our own views in a
calm atmosphere. That is perfectly reasonable. What is scarcely consistent
with such a proposal is the present threat, indeed, the certainty of you making
Modelling Thucydides’ Melos episode 183
war on us. We see that you have come prepared to judge the argument
yourselves, and that the likely end of it will be either war, if we prove that we
are in the right, and so refuse to surrender, or else slavery.
ATHENIANS: If you are going to spend the time in enumerating your suspicions
about the future, or if you have met here for any other reason except to look
the facts in the face and on the basis of these facts to consider how you can
save your city from destruction, there is no point in our going on with the
discussion. If, however, you will do as we suggest, then we will speak on.
(Thucydides V:85–6; my emphasis)
Melians relied on the logic of normative and expressive action. They made points
about the requirements of equity and justice. However, Athenians were not
interested in these requirements; Melos was just one of the small islands with a
scarce population, oligarchic governance and limited military facilities. Melians
began to make expressive arguments: they are free and independent; and if they
have any sense of honour and bravery they are not going to submit, for it would
change the very nature of what they are. No matter what the consequences, they
wanted to stay independent, with great honour. In Hellas, the notion of honour
was particularly important (see Donnelly 1991: especially 32–7). This makes it
understandable why the Athenians were compelled to answer to the Melians in a
manner that implied the acceptance of the relevance of honour: ‘You will see that
there is nothing disgraceful in giving way to the greatest city in Hellas when she
is offering you such reasonable terms’. The Melians did not agree.
In the last instance, when a reference to heaven and fortune had proved futile,
the Melians pointed out that Sparta (the Lacedaemonians) might come to help
them. Melos did not belong to the Lacedaemonian alliance, and the Athenian army
was already encamped on the soil of Melos. The Spartans had the reputation of
acting slowly and cautiously, and their actions were restricted by the normative
requirements of the Nician Peace. The Athenian argument that ‘where danger is
concerned the Spartans are not, as a rule, very venturesome’ must have thus seemed
quite plausible. In ancient strategic terms, it is unlikely that the Melians made any
misjudgement about their hopeless situation (cf. the rational-choice model of Figure
7.1). It was for reasons of identity-constituting independence, courage and honour
that they refused to submit to the rule of the Athenian empire.
The Athenians initiated the Melian episode and also committed the massacre and
enslavement. Causally, the Athenians were responsible for the outcome. Although
the Melians could have acted differently, from a causal explanation point of view,
it is the aggressive Athenian imperialism that should be explained. In accordance
with CF2, it is possible to isolate from the causal complex the Athenian
aggressiveness as a particularly relevant INUS-component, and write its genealogy.
184 Visions of world politics
For Pericles had said that Athens would be victorious if she bided her time
and took care of her navy, if she avoided trying to add to the empire during the course
of war, and if she did nothing to risk the safety of the city itself. But his
successors did exactly the opposite, and in other matters which apparently
had no connection with the war, private ambition and private profit led to policies
which were bad both for the Athenians themselves and for their allies. Such
policies, when successful, only brought credit and advantage to individuals,
and when they failed, the whole war potential of the state was impaired.
(Thucydides II:65; my emphasis)
Had Athens continued to ‘avoid trying to add to the empire’, or to observe the
principles or justice and honour, no Melos episode could have occurred. Why
did the policy of Athens change? There must be two aspects to this explanation.
First, at the collective level, the adoption of a particular policy can be explained
by giving an account of the decisions in terms of debates and voting in Athens.
These were conducted in accordance with certain agreed procedures. Second, a
change in the structure of the ‘seriously taken’ discourse can be explained in
terms of learning, power and interests in a changing context of action.
In the fourth year of the war, the Peloponnesians (Sparta and confederates)
invaded Attica (Athens and its environment). Immediately after this invasion, all
of Lesbos (except Methymna), led by the Mitylenes, revolted against the Athenians
and tried to ally with the Peloponnesians. After long and bitter military struggles,
the Mitylenes finally surrendered. ‘In the fury of the moment’, as Thucydides
puts it, the Athenians were ‘determined to put to death not only the prisoners at
Athens, but the whole adult male population of Mitylene, and to make slaves of
the women and children’ (Thucydides III:36). However, the next day ‘brought
repentance with it and reflection on the horrid cruelty of a decree, which
condemned a whole city to the fate merited only by the guilty’ (Thucydides
III:36). The case was re-opened for debate and vote. Cleon spoke for the original
decision, arguing in a manner similar to that of Athenian envoys in Melos:
Fears or plots being unknown to you in your daily relations with each other,
you feel just the same with regard to your allies, and never reflect that the
Modelling Thucydides’ Melos episode 185
Cleon’s interpretation of the Athenian situation and possibilities was not accepted,
at least not by the majority of citizens. Diodotus, representing a Periclean point
of view, argued that the ‘two things most opposed to good counsel are haste
and passion; haste usually goes hand in hand with folly, passion with coarseness
and narrowness of mind’ (Thucydides III:42). Diodotus also protested against
using fear and intimidation as rhetorical devices. Instead, in his view, fair and
calm arguments were needed. Diodotus shared the opinion that the Mitylenians
were guilty of an unjust revolt. He argued, however, that the Cleonian policy
would be catastrophic in the long run. Many citizens accepted this argument.
In a nearly equal vote, the Athenians decided to save the majority of Mitylenians.
During the years following the Mitylenian revolt, the Cleonian perspective
became predominant in Athens. Athens as the ‘sea ruler’ was becoming increasingly
expansionist. A quick look at Cleon’s arguments can help to explain how and
why this occurred. In Cleon’s view, the world outside the city of Athens was full
of ‘fears’ and ‘plots’. All concessions and compassions were ‘full of danger’ to
Athenians. All Athens’s subjects were ‘disaffected conspirators’. Even partially
independent states like Mitylene were potential aggressors and allies of the enemy,
the Peloponnesians. Thus, Cleon concluded, ‘obedience is ensured by force and
fear, not by loyalty’. Gradually, the Athenians came to believe that Cleon was
right. It seems that continuous strife, fear and violence made Cleon’s points more
plausible. According to Thucydides, ambitious individual leaders—or leader-
candidates—also used these fears as a rhetorical (populist) power resource. The
result was the ‘realpotititk’ theory ((2), above), according to which ‘right is only in
question between equals in power, while the strong do what they can and the
weak suffer what they must’ (Thucydides V:89).
Power was at the core of this change. In its more narrow sense, power as
domination can be taken to mean, as Karl Deutsch (1963) does, that one does
not have to give in, because one has facilities and capabilities to force others to
186 Visions of world politics
comply. Power in this narrow sense is the priority of output over intake, the
ability to talk instead of listening. In a sense, it is also the ability to afford not
to learn and change what one is. ‘When carried to extremes, such narrow
power becomes blind, and the person or organization becomes insensitive to
the present, and is driven, like bullet or tornado, wholly by its past’ (Deutsch
1963:111). This kind of actor has a will in a sense that comes quite near to the
Nietzschean sense of a will-to-power.
The experiences of the long war and the plague caused the Athenians,
metaphorically speaking, to auto-program themselves to adopt the will of a ‘sea
power’, and to attempt to have an ability to speak and enforce their will but not
to listen. This was also because they did not trust others (including their subjects)
any more. According to this Cleonian perspective, if weaker parties are not
willing to listen to and to obey, it is necessary not only to force them to do so
but also to ‘teach others by a striking example that the penalty of rebellion [and
disobedience] is death’. This was the logic of the aggressive Athenian imperialism,
which eventually proved counterproductive and led to the defeat of Athens.
Thucydides thus tells a story, according to which the Melos massacre was not
necessary. With a wiser, Periclean policy Athens could have refrained from
imperialist expansionism as well from the policy of teaching lessons by means
of the threat of death to others (CF2). However, there are also traits of
necessitarianism in Thucydides’ History. In one of the most contested passages
of the History, Thucydides argues that ‘the real cause of the [Peloponnesian war
was…[t]he growth of the power of Athens, and the alarm which this inspired in
Lacedaemon’. These, he argues, ‘made war inevitable’ (Thucydides I:23).
Thucydides himself was an Athenian general exiled after his strategic failure
in 424 BC, in the seventh year of the war. Thucydides was committed to the
Athenian position of the early war and he makes it quite clear that he agreed
with the policy line of Pericles. Afterwards he was in a position of a critical,
cosmopolitan outsider. He also communicated with the representatives of the
former enemy. For Thucydides, the Athenian policy possibilities ranged from
the prudent statemenship of Pericles to the aggressive excesses of Cleon and
others. (Roughly the same set of policy possibilities existed on the Spartan
side.) If this horizon of possibilities is combined with Thucydides’ proto-
sociological theory of the growth of the population, trade, navy and alliance
revenues of Athens,15 the war really was ‘inevitable’ because no counterfactuals
can be seen from this horizon.
As Thucydides describes it, the Peloponnesians were worried about the growth
of Athenian power. Many believed that most of Hellas would sympathise with
those trying to dissolve the Athenian empire, which had developed out of the
Delian League (an alliance against Persia). Before the final outbreak of the war,
Modelling Thucydides’ Melos episode 187
‘both the Athenians and the Peloponnesians had already grounds of complaint
against each other’ (Thucydides I:66). In this situation, there were debates in
both camps about what to do. In Sparta, most shared the view that Athens was
already acting aggressively and oppressing large parts of Hellas. Therefore, war
should be declared without delay. Even the Athenians present in Sparta at that
time did not deny that there was some truth in these claims:
We did not gain this empire by force. It came to us at a time when you were
willing to fight on to the end against the Persians. At this time our allies came
to us of their own accord and begged us to lead them. It was the actual course
of events which first compelled us to increase our power to its present extent….
So it is with us. We have done nothing extraordinary, nothing contrary to
human nature in accepting an empire when it was offered to us and then in
refusing to give it up. Three very powerful motives prevent us from doing so—
security, honour, and self-interest. And we were not the first to act this way. Far
from it. It has always been a rule that the weak should be subject to the strong;
and, besides, we consider that we are worthy of our power.
(Thucydides I:75–6)
The main arguments of these Athenians were that they have done nothing
‘contrary to human nature’, and that they ‘are worthy of their power’. In Sparta,
the viewpoint of the King Archimadus prevailed. Accordingly, Sparta sent a
mission to Athens to complain about the grievances and continued war
preparations but did not (yet) declare war. It is noteworthy that Thucydides
does not present a full debate with different points of view about Athens’s
decision. Thucydides admits that ‘many speakers came forward and opinions
were expressed on both sides, some maintaining that war was necessary and others
saying that the Megarian degree should be revoked and should not be allowed
to stand in the way of peace’ (I:139; my emphasis). Yet, only the speech of
Pericles can be found in the History. In his speech, Pericles supported the policy
of ‘no concessions to Lacedaemonians’:
that you only gave in on this point through fear. But if you take a firm stand
you will make it clear to them that they have to treat you properly as equals.
(Thucydides I:140)
In ancient Hellas, it was common to think that some individuals and some
communities, by their moral qualities, were entitled to positions of leadership
and power. However, power is dangerous and corrupting, and in the wrong
hands it quickly leads to immoral behaviour, and then to civil strife, unjust war
and destruction. The development of Athens, as re-configured by Thucydides,
can be seen in these poetic terms.
There are also elements of tragedy in his text. For instance, the tragedy of
the Melos episode is that both the Athenians and the Melians were right. In the
short run, the Athenians were right: the Melians had no hope unless they
submitted, because ‘the strong do what they have power to do and the weak
suffer what they must’. Nonetheless, Melos did not submit, and they suffered.
In the long run, the Melians were right: by trying to replace moral legitimacy
with brute force, the Athenians eventually turned most of Hellas (and later also
Persia) against them. The result was the total defeat of Athens. This spectre of
suffering notwithstanding, Athens was set on a policy that was expansionist,
aggressive and harsh.
Speeches, dialogues and debates occupy a central place in the History, as in a
theatre drama. Any adequate reading of Thucydides should take into account
the drama-like structure and moralism of the History. In his Poetics, Aristotle
stated that every tragedy consists of a problem, which is presented in the
Modelling Thucydides’ Melos episode 189
beginning of the spectacle, and the rest of the drama is about the settlement of
the problem (Aristoteles 1982 [335–22 BC]: section 18). In Thucydides’ work,
the ‘Great War’ is the problem, set out in the beginning. The rest is about the
settlement of this problem. In the History, there are a number of turnings
(peripeteias) of events, perhaps the most important of them being the internal
development of Athens and its consequences. In Aristotle’s view, tragedy is an
imitation of worthy action. Tragedy does not imitate by telling the narrative in
the voice of a third person but through the settings of the people active in the
tragedy There are two ‘natural causes’ of action: the ways of thinking and the
characters of actors, which are both expressed in action. Furthermore, Aristotle
equates thinking with the ‘capability of expressing with words that which is
possible and is called by the situation’ (or occasion). This capability belongs to
the sphere of politics and rhetoric (Aristoteles 1982 [335–22 BC]: section 6).
From this perspective, Thucydides’ method becomes quite understandable: ‘My
habit has been to make the speakers say what was in my opinion demanded of
them by the various occasions, of course adhering as closely as possible to the
general sense of what they really said.’ Thucydides aimed at accuracy in his
descriptions, but he was forced to reconstruct the speeches in accordance with his
knowledge of the characters of the participants. He was also structuring—particularly
during the final decade of the war—his narrative as a tragic drama, thereby trying
to make sense of the whole. The temporal structure of a moralist strategy can be
consonant with writing CR genealogies of causal components and with the scepticism
about closed schemes of history. The History also provides a narrative that fulfils all
three stages of mimesis, it aims at iconic augmentation of future practices:
It will be enough for me, however, if these words of mine are judged useful
by those who want to understand clearly the events which happened in the
past and which (human nature being what it is) will, at some time or other
and in much the same ways, be repeated in the future. My work is not a
piece of writing designed to meet the taste of an immediate public, but was
done to last for ever.
(Thucydides I:22)
Conclusion
Notes
1 When I refer to this text, I stick to the standard way of first giving the number of the
‘book’ (or part) of the text (there are eight of them altogether), and then the section.
Because of my inability to read Greek, I have had to content myself with English
translations. I have used particularly the translations by Richard Crawley (1952) and
Rex Warner (1954), but in some points I have compared these translations to the first
English translation by Thomas Hobbes (1628) and to the Finnish translation by J.A.
Hollo (1964), which have enriched my reading of many of the key passages.
2 Within classical scholarship there have been extensive debates about different
interpretations of Thucydides. The naïve rationalist interpretation, according to which
Thucydides followed the canons of Newtonian science and by careful observations
determined laws of history that, in turn, would make possible predictions in the
contemporary world, prevailed in the USA during the 1950s and 1960s, but has been
since then challenged and rejected (see Connor 1985:3–19). Alker was the first
international relations (IR) theorist to become interested in critical readings of
Thucydides. His paper was first presented in a conference in 1980 and finally published
in 1988 in the American Political Science Review, it was re-published, with amendments, in
Alker 1996. Others became interested in re-readings of Thucydides soon after Alker.
See in particular Garst (1989:3–27); Donnelly (1991); Bagby (1994); Kokaz (2001);
and Bedford and Workman (2001). It is perhaps worth noting that Machiavelli has
also become an object of similar reinterpretations by Walker (1993:26–49).
3 Thucydides is not the only possible source of evidence, but clearly without Thucydides’
readily available account, our knowledge about the Peloponnesian wars would be
very scant. Classical scholarship has nonetheless been able to piece together, for
instance, a picture of the intellectual setting of the Peloponnessian Wars and the class
structure of Athens at the time of the wars. Mostly, I will confine myself to the account
of Thucydides only.
4 Kant does not give his source; the relevant passage of Hume’s ‘On the populousness
of ancient nations’ is cited in Connor (1985:20).
5 For a similarly fetishist neo-realist statement on Thucydides, see Gilpin (1984:287–
304).
6 It is quite alarming that rational-choice theorists have the confidence to declare that
their aim is to avoid external criticism! However, it is in fact possible, up to an extent,
to relax the aim of finding a unique ‘exit’ without moving out of the situational
determinist programme. Is this relevant? In IR most discursive practices are simplified;
most sophisticated applications of rational choice are typically only of highly specific
academic interest. Moreover, although it is possible to relax one assumption, or a few
assumptions, in time, and then to develop a new strand of research around those
qualifications; and although new research can be opened towards different directions;
this does not, in general, change the core assumptions. The core assumptions unite the
different strands, and they are also taught in the basic courses and textbooks. Critical
debates about a research programme should be concerned with the core assumptions,
not so much with the peculiarities of a particular strand of the large and well-funded
programme. Besides, most modification of the assumptions of rational-choice theory
are modest in most respects. They tend to respond only to internal criticisms, not to
external ones. Even when the possibility of legitimate external criticism is admitted, it
is not taken seriously.
7 If the model fails to predict, as it usually does, it is assumed that the problem lies in
the false ceteris paribus assumptions. If it is, quite surprisingly, able to predict, it is seen
as correct, even though there could have been many other, different models capable
of doing the same thing.
8 At least, this was the view of Aristotle, and in any case the Galileo-Copernicus-Bacon
192 Visions of world politics
view was very revolutionary in its own time. No such accounts are known to have
existed before. About the Aristotelian notion of causality, see any philosophy textbook.
9 The 1996 version does not include this important quote; it seems to have been a
victim of length-control in the context of the tendency towards ever-expanding richness
of quotes and exploration of possible research paths, which has become ever more
evident in the 1996 version.
10 This English translation fails to capture the argument that the gods rule, according to
the Athenians, whenever they can. Also, gods are hierarchical rulers (masters), and
thus would not condemn the Athenians. See Thukydides (1964:V:105), the second
sentence).
11 Referring to an earlier version of this iconic model, Alker (1996:44) talks about
Patomäki’s ‘most ambitious misspecification [which] treats the entire Melos episode as
iconically modelled history’ [my emphasis]. I hope this version is a more adequate
specification of the major components of the relevant causal complex.
12 The term ‘country’ does not appear in the Finnish translation; it only speaks of Hellas;
see Thukydides (1995 [1964]:I:2, the first sentence).
13 There are hermeneutic risks of misunderstanding in applying these concepts to the
ancient Greek world (even if the mode of teleological/strategic action is partially re-
interpreted to fit with the ancient Greek contexts). Nevertheless, it seems that no
great injustice is done to the meanings expressed in the Melian dialogue if they are
reconstructed along these lines. On the other hand, this sense may well be due to my
inability to imagine alternative frameworks of interpretation.
14 This is how Athens developed in the first place, according to Thucydides (I:2). War
and disturbances drove people to Attica, which, as a land of poor soil, was relatively
free of violence and robbery. Immigrants could become citizens in Athens. Much
later, during the Peloponnesian war, Thucydides himself had to go in exile from
Athens in 424 or 423, but he never lost his revenues from the silver and gold mines in
Trachia. Alcibiades fled to Sparta in 414 and immediately assumed an important
position there. Although death and enslavement may have been considered as threats
to those outside their poleis, as David Sylvan (1991a) maintains, many well-endowed
Hellenes could in fact travel freely and often assume a new citizenship without great
difficulties, even during the war (see, for example, Thucydides V:4).
15 For Thucydides’ account of this, see (I:6–19; I:89–117). It is interesting that this part
of the narrative could be easily interpreted in terms of the lateral-pressure theory.
This theory is presented in Choucri and North (1975), and Ashley (1980). For a
general account, largely based on Thucydides, of the development of the Athenian
empire, see Meiggs (1972).
16 Compare this to the imperialistic logic of game theory, according to which—and from
the point of view of an iterated ‘prisoner’s dilemma’ in particular—there is a universal
need to make commitments and threats credible and to establish a ‘hard reputation’
by realising the threats every once in awhile. This may, then, require imperialist
operations such as the Vietnam War. See, for example, Axelrod 1984:154.
8 A global security community
This view has been reinforced by the three world wars—including the Cold
War—that spanned the period from 1914 to 1989. International war, actual and
potential, occupied a central role in the world history of the twentieth century.
Yet, empirical evidence of large-scale violence and actual wars, particularly since
the Second World War, does not seem to support either (Pr. 1) or the conclusion.
Most political violence, and most wars, have concerned the ‘inside’ of states
rather than been wars between sovereign states. International wars have in fact
been relatively rare. Only about a tenth of armed conflicts have been inter-state
wars (see Alker 1989:21–4; Rummel 1995; Wallensteen and Sollenberg 2000).
Neither is a peaceful system associated exclusively with the confines of the
more stable states. The ‘security dilemma’, as conventionally understood, seems
to be a problem only in relations between some states. Many groupings of states
(regions) have rather developed into a community within which the ‘security
dilemma’ appears irrelevant. The democratic peace hypothesis, according to
193
194 Visions of world politics
which liberal-democratic states do not fight each other, covers some but not all
of these actual and possible security communities (see Adler and Barnett 1998a).
Karl Deutsch and his associates (1957) introduced the notion of security
community. They argued—at the height of the Cold War—that the existence of the
state is not a necessary or a sufficient condition for peace, nor is the non-
existence of the state a necessary or a sufficient condition for the prevalence of
the acute threat of political violence. These connections are contingent. The
imposition of a common government, with its capability of violent enforcement
of norms, may well decrease rather than increase the chances of peace. Besides
claiming that there is no axiomatic relationship between the absence of state
machinery of violence (or anything equivalent), and the condition of war, Deutsch
and his associates tried to disprove Arg. (a) as an empirical hypothesis. (For an
excellent summary of this innovative work, see Lijphart 1981.)
A group of eight scholars—six historians and two political scientists—studied
systematically the past experience of Germany, the Habsburg Empire, Italy,
Norway—Sweden, Switzerland, the UK and the USA, ‘to learn what history
might tell us about the problem’ (Deutsch et al. 1957:vii).1 Recently, this work
has been extended to cover the latter half of the twentieth century and, most
importantly, the world outside the North Atlantic area (Adler and Barnett 1998a).
In developing alternative and more processual hypotheses about the sources of
peace, Deutsch et al. (1957:5) distinguished between an amalgamated and pluralist
security community. Deutsch conceived both as political communities,
characterised by a process of political communication and the existence of some
shared rules and practices:
A security community is one in which there is a real assurance that the members
of that scommunity will not fight each other physically, but will settle their
disputes and conflicts in some peaceful way. If the entire globe were an integrated
security community, wars and large-scale political violence would have been
A global security community 195
and rule making. Public disagreements require conscious public airing. Politics
and violence both fall within the area of conscious, purposeful conflicts. Violence,
and threats of violence, tend to emerge with the rise of stubborn wills. In the
public sphere of politics, identities and interests are transformable and new processes
are initiated. Politics in this sense is non-violent; violence inevitably destroys the
preconditions of politics (Arendt 1970:54–6). Wæver (1996:127) says the same
thing when he notes that ‘for politics to emerge—not necessarily “democracy”,
but the precondition for it—we need to eliminate the logic of war’. He does not
speak generally of political violence but suggests, however, that ‘politics in this
sense demands the exclusion of security—the exclusion of security policy as well
as of insecurity’. A study of the conditions and nature of a global security
community thus has far-reaching implications for the possibility of world politics.
In the end, we had to rely most of the time on nothing more scientific than
the use of analogy, occasional insight, and judgement. It is true that ‘events
are not affected by analogies; they are determined by the combination of
circumstances’. But analysis of events is certainly affected by analogies. To
throw analogies away would be prodigal, in the absence of any better
source of clues.
(Deutsch et al. 1957:13)
So long as this assembling of conditions occurs very slowly, we may treat the
status of each condition and the status of all of them together at any one time
as a matter of stable, seemingly unchanging background…. But as the last of
198 Visions of world politics
the conditions in each sequence are added to those whose attainment was
assembled previously, the tempo of the process quickens. Background and
process now becomes one…. Moreover, it is a process that may become
accelerated as a by-product of other processes of political and social change….
In this assembly-line process of history, and particularly in the transition
between background and process, timing is important.
(Deutsch 1957:70–1)
The kind of sense of community that is relevant for integration, and therefore
for our study, turned out to be rather a matter of mutual sympathy and
loyalties; of ‘we-feeling’, trust, and mutual consideration; of partial identification
in terms of self-images and interests; of mutually successful predictions of
behaviour, and of co-operative action in accordance with it—in short, a matter
of perceptual dynamic process of mutual attention, communication, perception
of needs, and responsiveness in the process of decision-making. ‘Peaceful change’
could not be assured without this kind of relationship.
(Deutsch et al. 1957:36)
Will in this sense implies the desire not to learn and change; and power as
domination may imply the ability not to have to do so. Will is constituted by
one’s understanding of the world, of one’s identity, of the right conduct of
action and one’s interests and preferences. ‘Both will and power may form
aspects of the pathology of social learning’ (Deutsch 1963:247). By pathological
learning Deutsch means ‘a learning process, and a corresponding change in
200 Visions of world politics
inner structure, that will reduce rather than increase the future learning capacity
of the person or organization. The result is an overvaluation of the present
expectations against all possibilities of surprise, discovery, and change’ (Deutsch
1963:248). One way to impose one’s ‘will’ upon others in a situation where
there is a conflict of interpretations or interests is to enhance one’s relative
power to others, by dominating others in terms of giving orders, making binding
rules hierarchically and manipulating with threats of violence. This tends to
result in an insecurity community with many related others (cf. Ashley 1987:428).
A lot depends thus on how open actors are to learning and changes. How
tolerable are they towards others and pluralism? And how responsive are they
to the wills of others? Tolerance, responsiveness and learning capabilities are
connected to the nature of the community—and therefore social context.
The assumption is that there will always be disputes and conflicts between
social forces. This implies that there can never be a stable ‘order’, an eternally
fixed set of practices and institutions. In more metaphorical terms, Deutsch’s
assumption is that it is not possible to tame or freeze history for a long time.
History is open. New interests and claims will emerge and new ‘messages’
demanding changes will be sent and made public. To simplify: if, despite
sustained efforts, there is no responsiveness from the side of status quo forces,
a pathological learning process may occur also among the advocators of changes.
This may lead to the escalation of conflict and (threats of) violence. The
preparedness to use large-scale violence inside and/or outside indicates non-
integration and is a sign of an insecurity community—inside and outside.
Security communities, in contrast, are characterised by the expectation that
future changes are going to be peaceful. Integration generates non-preparedness
to use violence. This can be explicated as follows:
A1 If a social system has become integrated, no relevant actor has any reasons
to prepare for the use of political violence.
A2 As actors know (A1), they do not expect anybody to use political violence
either to preserve the status quo or to foster changes.
B1 Non-preparedness becomes a generally followed and rarely, if ever,
questioned rule of action.
B2 In the course of social time, (A2) becomes an automatic, routine-like and
self-evident presupposition of political thought, argumentation and action.
(A1) is based on the thesis that actors are causally responsible for the outcomes
and that, at any point in time, actors could have done otherwise. Actors have
discursively or practically known reasons for acting in a particular way; they act
upon these reasons. Reasons are embedded in practices, narratives about one’s
identity and position in the world, world-understandings and related interests.
A global security community 201
What matters, however, is not only whether actors really have any reasons for
violence but also whether they expect anybody else to think that way. Hence, a
security community presupposes both (A1) and (A2). If the practical experience
of actors confirms in the longer run (A1) and (A2), the security community
becomes sedimented in the longue durée of social time, i.e. becomes institutionalised.
(B1) and (B2) describe an institutionalised security community, such as the Nordic
countries from, at the latest, the early 1950s onwards.
Are there any general conditions for the generation of (A1 and A2) and (B1 and
B2), for the virtuous circle of the development of security community? The inner
structure of actors, communication and learning are socially conditioned. For
instance, the scope and diversity of the narratives (or scenarios) an actor is capable of
generating is the result of past learning of shared cultural meanings and forms the
basis for contemporary moral and strategic choices. However, available narratives
also depend on the mechanisms of choice that select and amalgamate them. A system
of domination can control or structure communication, and thereby shape not only
available narratives, but also trust and loyalty between actors. Moreover, actors and
actions are structured and stratified. Constitutive and regulative rules define and
position an actor, who has stratified and structured reasons for his or her actions that
occur in the context of unintended consequences of action. Reified and naturalised,
hierarchical or heteronomic, relations of domination may sustain particular practices
and related patterns of practical reasoning and rationalisation.
Following Roberto Unger (see 1987a:155–7; 1987b:35–6, 277–312), I think
it is possible to change not only the content but also the force of social contexts.
By force of contexts I mean their relative immunity to challenge in the midst of
everyday practices, the rigidity of positions of their positioned practices and the
consequent rigidity of social division and hierarchy. Social contexts can be
distinguished with respect to their openness to transformation. Unger refers to
this openness as the ‘negative capability’, but I think self-transformative capability of
contexts describes the idea better. It means the facility to challenge the context
in the midst of everyday practices and the disengagement of actors’ ‘practical
and passionate dealings from a pre-existing structure of roles and hierarchies’
(Unger 1987b:278–9). Since only agents in social relations can carry out context-
transformations and since this is social activity, the conditions for individual
self-transcendence and collective context-transformation are, in large part, social
conditions.4 On the basis of this reasoning, a general hypothesis emerges:
This is the hypothesis that the expectation of peaceful changes should be higher
in the more self-transformative contexts. It is compatible with the empirical
research of Adler and Barnett, Hayward Alker, Deutsch and others. For instance,
many of Deutsch’s background conditions for amalgamated security
communities—capabilities and processes of cross-cutting communication; high
202 Visions of world politics
Other agreements may be needed. It is also true that the idea of connecting the
development of social forces in time to the need for changes is characteristically
modern, which may limit its geo-historical applicability. Moreover, social systems
are open and hence there can be overriding mechanisms or processes at work
(for instance, think about the potential effects of an economic collapse or
depression in the midst of a politicisation process). The process towards increasing
self-transformative capacity of social contexts may also have unintended
consequences such as anomalies, pathological learning, socio-economic problems,
etc. Last but not least, the advancement towards greater disentrenchment is
never more than possible. It can be reversed or overridden by other factors.
Thus, hypothesis (β) does not imply that relative disentrenchment of contexts
will guarantee, permanently, the absence of political violence. Also, it is not
possible to generate a list of the institutional arrangements or imaginative
preconceptions that correspond to different degrees of disentrenchment. All
efforts to build more revisable contexts must work with the contemporary
contexts in or with the past or remote sequences of context making that we
study or remember (Unger 1987a:157).
These qualifications notwithstanding, hypothesis (β) seems to bear a great promise
for future research and practical actions. It is a CR re-interpretation of the nature of
mechanisms and processes underlying any security community. Hypothesis (β) can be
qualified, revised or refuted by means of empirical research and theoretical
argumentation.5 It can also be used in explanatory models and concrete Utopias.
A global security community 203
Adler (1997) argues that a security community does not have to be tied to a
geographical region. There can be also imagined security communities as
‘cognitive regions’. Thus, ‘security communities might emerge between non-
contiguous states’ (Adler and Barnett 1998b:33). For instance, Australia or Israel
can be part of the Western security community This point is important, but
204 Visions of world politics
does not go far enough. Adler (1997:252) himself maintains that multilateral
regimes and economic arrangements have created a non-territorial functional
space; that state authority in the realms of security, economic welfare, ecology
and human rights has been increasingly internationalised; and that a primitive
global civil society has emerged. Rules, relations of authority and systems of
domination do not have to follow the principle of territoriality. John Ruggie
(1998:192–7) describes the non-territorial economic ‘region’ of the world vividly.
He argues that states continue to engage with external economic relations with
each other. The terms of these engagements are largely set in systems of
multilateral governance.
To conclude this chapter, I shall briefly explain why the Kantian world order—
his solution to the international problematic—may not be a security community;
how Immanuel Kant was led to a dilemma he could not resolve; and how the
notion of global security community can help to overcome this problematic.
For Kant, all public rights are transcendental, i.e. universally valid. He claims
that the civil state is based a priori on the following principles (Kant 1983
[1793]:72):
(= men who own property and are their own masters, not women, wage-labourers
or children); in the process of law-making, all citizens have one vote.
1 There is the formal separation between private (including property) and public,
which is equated with a particular kind of society/state relationship. Private
capitalist economy is privileged; the public state is controlled and checked by
regular elections and the system of ‘checks and balances’.
2 The model assumes free trade, interdependence and, in that sense, a globalising
laissez faire economy.
3 The model includes implicitly Western, and particularly US, hegemony and
dominance of global governance. The USA, the EU and Japanese state
managers and ‘private’ actors also control global investment decisions and
flows. The UN Security Council—dominated by the USA—is responsible for
identifying the violators of the norms that define one as ‘a peace-loving nation’.
A global security community 207
The US army, supported by other Western states, enforces these norms, with
or without the UN Security Council (see Patomäki 1999a; 2001c).
4 At a thinly hidden level, there also is the US power projection, which consists
of extended deterrence, on the one hand, and the ‘idealist’ project of spreading
the US conception of genuine identity and right political conduct, on the
other (Klein 1988; see also 1990).
Even Kant, however, who is often taken to have advocated collective security, was
not arguing for a world government. Kant was suspicious of the idea of a world
government. In his view, a world government could be easily turned into a
tyranny:
a federative state of nations whose only purpose is to prevent war is the only
state of right compatible with their freedom…. The idea of international right
presupposes the existence of many separate, independent, adjoining nations;
and although such a situation is in itself a state of war (assuming that a
federative union among them does not prevent the outbreak of hostilities),
yet this situation is rationally preferable to their being overrun by a superior
power that melds them into a universal monarchy. For laws invariably lose
their impact with the expansion of their domain of governance, and after it
has uprooted the soul of good a soulless despotism finally degenerates into
anarchy. Nonetheless, the desire of every nation (or its ruler) is to establish
an enduring peace, hoping, if possible, to dominate the entire world.
(Kant 1983 [1795]:138 and 124–5)
of actions. The crucial issue is whether these challenges and revisions could
lead—how?—to increased learning and self-transformative capacity. This is a
question, it seems to me, that can be posed only after the international problematic.
A new background theory gives rise to a characteristic set of problems and also
provides some plausible answers to them.
Conclusion
Notes
1 Deutsch et al. (1957:16) explain the selection of their cases as follows:‘In selecting a
limited number of cases for intensive investigation, we concentrated on the area of
Western European and North Atlantic civilization. There are obviously other cases
that we should have liked to include. Considerations of time, resources, availability
A global security community 209
This final chapter is a call for making the Nordic model a basis for emancipatory,
regional and global political action. My case is only superficially connected
with recent attempts by Tony Blair and Bill Clinton—with the assistance of
Anthony Giddens (1998)—to resuscitate the Third Way.1 Rather, the argument
of this chapter is based on a causal model, which explains the decline of the
Nordic model. Moreover, I also make a case for re-defining many of the ideals
of Norden in the context of the legitimation problems—if not crisis—of the so-
called Washington consensus.
At the outset, the question is: Are the principles of the Swedish model2
sustainable in a changing world that tries to re-formulate and re-define the
public sphere and rules for economic policy in accordance with orthodox market-
monetarist postulates in macro-economics (fiscal and monetary policy) and micro-
economics (e.g. trade, labour market and industrial policy)?’ However, the issue
of the future of the Nordic model also has a significance that goes far beyond
the immediate concerns of economic policies of states. Norden has also been a
model of an enlightened and anti-militaristic society that follows the principles
of distributive justice and which was supposed to be morally superior to two
alternative models of modernisation: the USA and the Soviet Union. After the
collapse of the Soviet Union, it thus appears that the question of whether there
is a future for the Nordic model also means ‘Is there any socially oriented,
democratic alternative to the Washington consensus-led neo-liberalist
globalisation?’
My answer is affirmative. It is of course true that, due to fundamental
contextual changes, attempts to preserve the Nordic model within the confines
of a nation-state have been in difficulties for a quarter of a century. However,
there are many features in these models and related moral and political ideals
that also carry weight in the globalising world at the dawn of the century. In
light of this, I discuss democratisation, solidarity, de-commodification and
universalistic social policy as moral ideals; and the activist politics of neutrality
that gave rise to globalist attempts at building a more just and peaceful world.
At the threshold of a new century, these are potential elements in the formation
of new identities and in telling new stories and creating new myths for counter-
hegemonic purposes.
210
Beyond ‘Nordic nostalgia’ 211
There are at least two main ways of understanding the Nordic model in terms of its
content and place in world history. This contextual specification also determines the
identification of the main features of the Nordic model. First, Norden can be seen as
a political ideal and model of sovereignty-based active internationalism in the Cold
War context. With the sudden fading away of the Cold War, there were constitutive
reasons and real tendencies for the decline of this model (Wæver 1992).
Second, it can just as easily be seen as a model of organising production and
exchange relations and income distribution within the Bretton Woods system
of embedded liberalism. In this interpretation, the Nordic model and the social
forces driving it were vulnerable to the processes of gradual disembodiment of
‘embedded liberalism’ that started to make a qualitative difference at the turn of
the 1960s and 1970s (see Ruggie 1982, 1995). In this reading, the active
internationalism of Norden stemmed more from the shared societal values that
were translated into foreign-policy orientation than from the Cold War context.
I shall start with the Cold War contextualisation, following Ole Wæver’s (1992)
interpretation of Norden’s fate after the end of the Cold War. Wæver claims that
in the early 1990s there emerged a ‘Nordic nostalgia’, a longing for the certainties
of, and Nordic privileges in, the Cold War world. Originally, as a political ideal
in the Cold War context, the Nordic model was defined as representing the
Third Way: neither US capitalist liberalism, nor Soviet-style state socialism, but
social democracy. In this sense, the Nordic identity was dependent on the
competition between capitalism and communism. Moreover, the neutrality of
Sweden—and also that of Finland—stemmed from the Cold War struggle over
models of organising society, and from the related military competition (even if
sometimes the Cold War seemed to have assumed for its participants a form of
an eternal and politically neutral Waltzian system of bipolar power-balancing).
The presumed superiority of the rational, enlightened and (mostly) anti-
militaristic Nordic society was in part based on the fact that in Norden the
military tensions were much lower than in Central Europe. This was despite
Norway and Denmark being members of NATO and Finland having a Treaty
of Friendship, Co-operation and Mutual Assistance (signed in 1948) with the
Soviet Union. Further, Norden was a passport-free zone of common labour
markets and shared social security systems. For Wæver, however (1992:84–5),
this regime was partially independent of the Nordics, and only partially
stemmed from ‘rather grandiose schemes and declarations about issues
somewhat intangibly related to the Nordic countries themselves’.
It is true that there were also Cold War-related military-strategic reasons for
creating a pluralist security community and for deepening co-operation among
212 Visions of world politics
the Nordic countries.3 Nonetheless, even in the Cold War context, it was the
social democratic Sweden that led Norden, by showing an example of social
reforms and by exercising ethico-political leadership in building the idea of
actively internationalist Norden. This ideal was widely accepted for decades,
until the end of the 1980s. Even in this security-oriented interpretation, Norden
was not only about security, but also incorporated a particular socio-economic
system and a particular theory of modernisation. The main features of this
Norden, as seen through the lens of the Cold War competition, are summarised
in the first row of Table 9.1. The second row sets forth Waever’s account of
how the end of the Cold War undermined the identity and idea of Norden.
Wæver claims that, contrary to what many people might have expected, the
Nordic countries reacted to the end of the Cold War with scepticism, frustration
and attempts to limit the impact of change (Wæver 1992:77). Why? According
to Wæver, it is a matter of nostalgia both for a secure identity and for the
Golden Age of Social Democratic vision of world historical progress and the
building of a just society and peaceful world. For Wæver, the way that the
vision of Norden as an alternative has functioned was visible, for instance, in all
the EC debates in all the Nordic countries, where the ‘Nordic card has regularly
been played by opponents of EC membership’ (Wæver 1992:94). Nordic values
have been contrasted with the European market; Nordic co-operation has been
put forward as an alternative to European integration.
Beyond ‘Nordic nostalgia’ 213
There is reason to believe, however, that this Cold War and security-centred model
of Norden and its prospects is at least partially inaccurate and inadequate. Consider
the explanations for changes from the Cold War Norden to the post-Cold War
Norden. The logic in each explanation is similar and always inadequate. The
argument about the impossibility of the Third Way after the end of the Cold War
is conceptual only—and incorrect as such. It seems to be based on the idea that there
are only relational differences, and that the world is about language only (cf. Wæver
1992:79, 86):
The old system had two competing points—the planned economy and
capitalism—so that any distance naturally led to the privilege of being third.
The new system has one dominant centre—the EC—and all distance from
this becomes simply peripheralism.
(Wæver 1992:86)
However, even in its own terms, this argument is questionable. Why should the
third be automatically ‘privileged’ (whatever that means)? Many critical cultural
theorists would argue on the contrary that there is little room for the third in the
binary logic of Western—and many other cultures’—metaphysics. (About the violent
implications of strict either/or thinking, see, for example, Beardsworth 1996:54–97;
Galtung 1990). During the Manichean struggle between the USA and the Soviet
Union, there certainly was very little room for any third possibilities in many parts
of the world.
Also, the essentialist argument about the EC as the dominant centre has
only little grounding, at least at logical and conceptual levels. It presupposes the
pseudo-Hegelian master—slave argument, which says that if there is a master,
all attempts by the slaves to keep a distance are simply ‘peripheralism’. Hence,
the only worthwhile strategy for the slaves is to try to gain recognition from
their master and participate in his practices. This reading finds support in claims,
for example, that the previously anti-militarist Nordic countries should now
start to ‘carry power-political responsibility’ (Wæver 1992:82). Obviously, there
could be more emancipatory readings of the master-slave dialectics.
However, the logic of the argument itself is problematic. Suppose we talked
not of the Third Way but instead about the Nordic model. Would we not still be
making reference to roughly the same real entity X (i.e. a particular social complex,
consisting of systematically organised relational, institutionalised practices, and to
the associated values of democracy, solidarity and egalitarianism)?
The question is: Why is it that this X had to change? Why is it that X could
not compete with the US model of modernisation without any problems even
after the collapse of the Soviet Union? There is no compelling logical, conceptual
reason why it could not have done so. Only if we can demonstrate, by way of
systematic empirical research, that this fallacious way of thinking was in fact
214 Visions of world politics
prevalent in Norden after 1989, would it be possible to claim that this was a
cause of the decline of the Nordic model.
Wæver does not provide any systematic empirical evidence. However, at the
level of constitution of discursive practices, there is also the possibility that
Waever’s article has, after 1992, reinforced the acceptance of this fallacious
inference and thereby contributed to the relative decline of Nordic values,
particularly among foreign-policy elites in these countries.
Consider, next, the explanation of change in the socio-economic system.
Besides simple descriptions of changes in economic policies in Sweden in
particular, the main explanation seems to be that the social democratic welfare
state is in an identity crisis because it is essentially connected to the notions of
modernisation and rationalisation, which, in these post-modern days, are in
crisis (Wæver 1992:85). However, even if we accepted this argument, and I
think there is something to it (cf. Leonard 1997), it would be far from obvious
why the US welfare system or neo-liberalism would be less in crisis. Even the
EU is typically represented as an exemplar of modernisation, although its
complexity and divergent nature has also given rise to more post-modernist
interpretations (cf. Ruggie 1993). Ruggie and others notwithstanding, the
modernity of social democracy cannot be a cause for the crisis of the social
democratic form of welfare state as contrasted to the ‘key ideas’ of standard,
modern liberalism, namely ‘political freedom rights, the free market and
integration’, as Wæver claims. Notably, Wæver only mentions the main
characteristics of the social democratic form of the welfare state in a footnote
(Wæver 1992:85, n. 13).
Finally, there is a further argument: Norden was not only defined as the
Third Way between capitalism and communism, but also in contrast to Europe.
That is, particularly in security political terms, it was supposed—by whom?,
with what effects?—to be better than Europe. After the end of the Cold War, the
key positive dynamics, in terms of movement towards peacefulness, were taking
place in Europe, and this supposedly turned the whole setting upside down. It
was Europe that proved to be somehow better than Norden. However, the problem
in this argument is the ambiguity in the terms ‘better’ and ‘Europe’.
To be a causal argument (that could, possibly, constitute a partial explanation
of the change of the Norden), Wæver’s point has to be seen as a claim about the
prevalent political rhetoric that was also efficacious in practice and action. Yet,
Waever’s point seems to be at least as much, if not more, a normative argument
that, in contrast to Norden, Europe after the end of the Cold War is now morally
and/or politically better. Simultaneously, the term Europe is used in different
senses in this argument. Sometimes it is the ‘common European home’ of Mikhail
Gorbachev (1989 as the key dynamics), and sometimes it is the Europe of the
EU (1992 as the key dynamics). However, exactly in what sense could it be
argued that Europe, in either of these two senses, suddenly became more peaceful
than Norden? There is no justification for this presumption, or for the significance
of the comparison itself. Yet, we do recall the argument that the reluctant Nordics
should now partake in the new master’s power political practices.
Beyond ‘Nordic nostalgia’ 215
Above I shed a shadow of suspicion over the Cold War and European
integrationoriented interpretation of the Nordic model. However, false beliefs
can be causally efficacious. The Cold War and security-centred model of Norden
together might explain the decline of Norden to the extent that the arguments
discussed above were de facto playing a role among the relevant political actors
by constituting their actions and practices. It is the task of further research to
determine the extent to which this was significant in the late 1980s and 1990s.
Even if it were true to a significant degree, this fact would simply become the
next object or tendency to be explained in depth.
There are reasons to believe that the causal processes bringing about difficulties
for Norden—and the Swedish model in particular—were largely unrelated to the
Cold War. In Sweden there had been adjustments to the Nordic model for a
decade before Gorbachev. The economic problems of Sweden started as early
as the 1970s, with some of them having their origins in the 1960s. In different
phases, they continued over the Second Cold War of the early 1980s and
became more severe again after 1989. The responses to these crises further
changed the context of political action and favoured particular solutions at a
later stage. They had already brought about a re-definition of the Third Way
by the early 1980s. The Third Way began to mean, increasingly, a compromise
between pure social democracy and neo-liberalism, rather than the Third Way
between capitalism and communism (Ryner 1994a:389 and 423, n. 2).
The problems and related re-definitions stemmed mostly from the internal—
external dynamics of the Swedish political economy, entangled in the globalisation
of social relations. Since the early 1970s, this globalisation has, in turn, taken place
in the context of the partial disruption and transformation of the Bretton Woods
system. However, to make the case for a political-economy explanation of the
decline of the Swedish model, we first need an adequate identification of its major
features.
Norden is most frequently associated with the universalist welfare state. Indeed,
the already classic power mobilisation thesis of Gøsta Esping-Andersen (1990)
makes Sweden a representative of the institutional re-distributive model of social
policy, which operates externally to market logic. This model presupposes
poverty to be a permanent condition of industrial societies if not counteracted
by decommodified social institutions. For Esping-Andersen (1990:44), de-
commodification is emancipation from the dependency of capitalist labour
markets—and thereby also from employers’ control—by means of universalist,
high-level social benefits. Also, education, Healthcare and insurances were
organised collectively and outside markets, that is, in a de-commodified way.
216 Visions of world politics
Finally, this social democratic model was coupled with a full employment
policy.
However, there was more to the Swedish model than this. The Nordic model
also implied a certain way of organising production and exchange relations,
and presupposed a particular world order. Table 9.2 summarises the major
features of this model, again also by contrasting two eras (here I follow closely
Ryner 1994b; 1997; but see also Weiss 1998:86–100; and in particular the
more detailed historical account by Pontusson 1992:37–123).
The Rehn-Meidner programme, named after its key proponents Gøsta Rehn
and Rudolf Meidner, emerged with the Saltsjöbaden capital-labour Accord of
1938 and was developed in the 1940s and 1950s. This model advocated
solidaristic wage policy but also encouraged firms to make technological
innovations, by squeezing low-productivity firms and industries. It also helped
adjustments to the technological dynamism of the capitalist world economy by
means of an active labour market policy: unemployed were re-trained and moved
to new employment by the state.
The Social Democratic Party (SAP) rose to power in 1932 by winning
parliamentary elections. It remained in office—either alone or by forming coalition
governments—uninterruptedly for forty-four years, until 1976. SAP provided
ethico-political leadership in developing and consolidating the institutional
framework. The Saltsjöbaden Accord formed the basis for centralised, corporatist
wage negotiations between the Swedish Union Federation (LO) and the Swedish
Employers’ Association (SAF). However, the Accord also guaranteed absolute
autonomy for the capitalist firms and their managers in deciding upon finance
and investments, intra-firm relations and the allocation of returns and profits.
Since the Accord, this capitalist laissez faire aspect of Swedish social democracy
remained unchallenged until the late 1960s.
The Swedish model also presupposed a particular world order. The era of
Pax Americana between 1945 and 1973 was the ‘Golden Age’ of the Swedish
model—and the original Bretton Woods system of embedded liberalism (Ruggie
1982). The essence of the embedded liberalism compromise of the original
Bretton Woods system was to devise a framework that would safeguard and
even aid the quest for domestic stability and legitimisation, ‘without, at the
same time, triggering the mutually destructive external consequences that had
plagued the inter-war period’ (Ruggie 1982:393). Moreover, the Bretton Woods
system also represented a partial victory of productivism over financial capital:
the Bretton Woods systems imposed significant constraints on the freedom of
movement of financial capital including fixed exchange rates and capital controls
(Gill 1997a:7).
In the embedded liberalism compromise, it was up to nation-states to design
their own variations of domestic stability and legitimation in terms of liberal
democracy and the welfare state, but Pax Americana set definite limits to possible
variations. Although its foundations were laid before the Second World War, the
Swedish model was, in fact, a possible variation. It was also more pluralistic than
it might have appeared at the outset. Particularly the Cold War-centred
interpretation of the Third Way would see Norden as competing with the USA
and the Soviet Union as the symbol of modernity and enlightenment, and, in its
own self-understanding, as a model of a ‘more developed society’. Yet, the Swedish
model allowed internally for competing understandings of its meaning. It was
seen either as a modernisation of capitalism—rendering class struggle and ideology
irrelevant (capitalists, industrialists)—or it was seen as a middle way of extreme
tendencies (mainstream social democracy). It was also envisaged as a stage on the
way to democratic socialism (radical social democracy, the New Left).
Many industrialists and most mainstream social democrats would argue that
Sweden was able to establish a relatively egalitarian society by means of an
exceptionally generous and comprehensive welfare state. In contrast, the more
radical social democracy and the New Left would emphasise that social
democracy had neither changed the underlying capitalist economic structure
nor gained control over investments (Rothstein 1996:4–5).
218 Visions of world politics
By the 1960s, the Swedish model had become the basis of the identity of Norden.
However, despite its long-lasting success in fulfilling its own promises, it started
to come under pressure from a number of directions. The turning point for the
Swedish model came in 1976. The Social Democrats had to step down from the
Government for the first time in forty-four years, giving way to the Centre
Party and Conservatives. That, in turn, took place six years after the US
declaration that the dollar was no longer convertible to gold, and two to three
years after the oil crisis, in the context of a deep Swedish economic depression.
Figure 9.1 illustrates how, in open systems, both internal and external
conditions tend to change, and how the basis of an institutional framework
may erode. This also happened to the Swedish model, despite its sustainability
and achievements between 1938 and 1976. Figure 9.1 summarises the underlying
socio-economic dynamics in terms of changes in occupational structure and
related class identity, as well as in the identity, interests and options of private
capital. These processes co-explain many of the difficulties that emerged by the
1970s, and also some of the first transformations of the Swedish model.
The first signs of trouble emerged in the 1960s: expansionary macro-
economic policies triggered inflation, and in-built tendencies for technological
dynamism were compromised for political reasons. Increasing alienation
between the trade union leaders and the rank and file led to wildcat strikes,
quickly followed by a leftist revolt against the decision-making monopoly of
capitalist owners and managers over investments and workplace practices.
Backed up by the global New Left movement, the labour movement in
Sweden introduced the ideas of active industrial policy, industrial democracy
and collective share ownership, and thereby it also challenged the
fundamentals of the Saltsjöbaden Accord (see Pontusson 1992).
Continuous wage-push inflation became a recurrent problem in the Swedish
economy Productivity growth declined dramatically in the 1970s, particularly
after 1973.4 Indeed, the global economic crisis of Fordism of the 1970s hit
Sweden more severely than any other OECD country. Stagflation, declining
productivity growth after decades of high productivity growth, led to Sweden
accumulating substantial budget and payment deficits and a foreign debt during
the period 1976–82 (Ryner 1994a:387–92). Simultaneously, a new world order
began to emerge. The new world was dominated by globalisation and trends
towards both more orthodox economic liberalism and the dominance of financial
capital over production.
In these circumstances, organised business and the bourgeois parties of Sweden
began a massive mobilisation against industrial democracy and, more concretely,
against the wage-earner funds. Consequently, the labour movement retreated from
the more radical features of the plans to co-control and democratise investments.
Despite these political developments—a turn to right favourable to capital—Swedish
firms began an investment boom abroad in the 1980s, exploiting the new exit
options. The new Social Democratic government, elected in 1982, also tried to use
Figure 9.1 Structuration of the conditions of the decline of the Swedish model 1976–89, in the context of stagflation and indebtment
220 Visions of world politics
In terms of the moral and political ideals, the Nordic (or Swedish) model has
been in decline for a quarter of a century. Even still, these ideals are shared by
the majority of the population in all Nordic countries, although they have been
on the defensive, and are often blurred with themes of ‘necessary adjustments’
and ‘European integration’.
In Sweden, the ideal of a Third Way was re-defined and compromised as
early as the 1980s. The reforms of the welfare state that have been taking place
since then have had very little to do with traditional social democratic ideals.
They are much more in line with the neo-liberalist political programme on state
re-structuring and economic liberalisation in accordance with the Washington
consensus (for a recent articulation, see World Bank 1997; and for a slightly
contrasting view, Stiglitz 1998). Consequently, Sweden has been losing its ethico-
political leadership in the Nordic countries.
The unilateral decision by the Swedish government to apply for EC
membership in October 1990 was, in one perspective, perhaps the last nail to
the coffin of Norden. Finland and Norway followed suit. Finland has become an
exemplary EU member that has in many respects renounced its Nordic values,
as well as its association with Swedish ethico-political leadership. The Norwegian
population, however, voted that Norway should stay outside the Union, largely
on the grounds that membership would compromise its Nordic values.
So, was this the end of the Nordic model? Not necessarily. Although the model
itself has been compromised by neo-liberalism, there are signs and indicators of the
persistence of the power of social democratic institutions and values. My argument is
not only about the size and welfare benefits of the state or the average public opinion
in these countries, but also concerns deeper processes. Institutional transformations
are conditioned by public will-formation, which in turn is the outcome of
complicated processes of formation of ways of life, dominant forms of representation
and, indeed, institutions, which structure the decision-making situation faced by
actors and influence trust (Rothstein 1998:134). There is reason to believe that, given
a more adequate understanding of the changing context of Nordic politics, support
for revived Nordic values should clearly outnumber the current—hesitant, confused
and ambiguous—approval of neo-liberalism.
Now, the quantitatively measured role of the state has in fact been rising in
Norden since 1980, despite the partially implemented efforts at welfare state
retrenchment. Table 9.3 gives the figures for total expenditure of central
Beyond ‘Nordic nostalgia’ 223
Table 9.3 Central government budget, total expenditure, as percentage of the GDP
224 Visions of world politics
As the drafters of neo-liberal political programmes also know, unlike the situation
in the nineteenth century when capital also had global reach—even if without
the technological devices of the network society (cf. Castells 1996)—global political
innovations today must confront mass democracy and the broad-based desire
for representation. Democracy, at least in the sense of periodic voting in free
elections, is more institutionalised on a world-scale than ever before (cf. Gill
1997a:1–2). In Norden, national democracy is sedimented in the longue durée of
history: all Nordic countries have been liberal democracies since the early 1920s,
in Sweden and Finland without any interruptions.
The democratic, solidaristic and egalitarian values and interests seem to have
been deeply institutionalised. Measures to lower the level of social benefits, at
high levels of unemployment and with increasing pressure to deregulate labour
market and relations, have tended to be unpopular and hard to implement. In
Sweden, they cost the Conservative-led coalition government of 1991–4 its
support, and the Social Democrats have since been losing support as they
implement welfare cutbacks. After the September 1998 elections, SAP is only
able to form a government with the support of the Greens and the Left Party
(former Communist Party) (Widfeldt 1998).
In Finland, practically all parties have been integrated, to varying degrees,
into the neo-liberal programmes of state restructuring and economic liberalisation,
so there is no real electoral alternative. Apparent consensus among parties
notwithstanding, there are critical discussions within parties about the current
imperatives and options, and, most notably, a clear majority of citizens seem to
have been against the present (more laissez faire and commodified) kind of capitalist
market economy. The popular resistance against deepening integration seems
to have been stronger than in any other EU country (see, for example,
Europinion 1995). Finns have been against a European single currency and
EMU, yet Finland entered the third phase of EMU in 1999. In general, it can
be argued that, given the common sentiment, the acceptance of government
policy seems to indicate a false understanding of the EU processes.
Danish and Norwegian popular resistance to the European integration process
indicates a similar adherence to Nordic values. Denmark first rejected the
Maastricht Treaty and then had it accepted in the second referendum but with
major qualifications. Danish support for European integration has been primarily
confined to its function in facilitating trade. The opposition to the Maastricht
Treaty was based not just on Danish nationalism but also on the defence of the
Scandinavian model of democracy and welfare state (see Lawler 1997:574–8).
In Norway, the majority of voters have twice rejected EC/EU membership,
first in 1972 and then again in 1994. In the latter Norwegian anti-EU campaign,
the debate and referendum was represented as struggles between ‘commonsense
Beyond ‘Nordic nostalgia’ 225
and capitalism’. This ‘commonsense’ was typically, either directly or indirectly, the
discourse of Nordic social democratic values, exceptionalism and progressive
internationalism, although authentic voices of the countryside and nationalism
were also strong (Lawler 1997:578–81).
In the Nordic democracies, which used to be highly participatory, a cleavage
between the political elite and large segments of the population has emerged.
The mainstream of the elite has been convinced of the necessity of neo-liberal
adjustments and European integration, yet the population tends to be much
more supportive of social democratic values. The conviction of the political
elite has been at least partially based on false beliefs about the situation and the
existing choices. For instance, it is a mistake to try to understand the popular
resistance to the EU only in terms of societal security, identity and nationalism,
as Wæver et al. (1993:24–40) do.5 Given a linear temporality, this mis-
identification tends to reinforce the elite belief that their societies are simply
lagging behind in the evolution of social organisation. This, of course, justifies
attempts to educate and lead or manipulate the bulk of citizenry.
It has indeed been claimed that political elites manipulated the acceptability of EU
membership. Not only did the elites of the parties in office make the decision first,
expecting their political parties, and ultimately, the wider public opinion to join them
later (preferably just before the referendum) (Lawler 1997:583), but they also
manipulated the conditions for public will-formation. For instance, the currencies of
Norway, Sweden and Finland were linked to the ECU in the course of the financial
crisis of 1990–1. This was a clear step towards membership, yet it was neither
supported by the expert opinion of economists nor discussed democratically (Moses
1997). Similarly, the order of the 1994 referendums clearly maximised the
bandwagoning effect in favour of the membership. Had the first referendum been
held in Norway, Sweden, too, would very likely have rejected membership. This, in
turn, would have strengthened the NO (to the EU membership) side also in Finland,
the country most favourable towards membership.
Joining the EU cannot be reduced to enforcement of neo-liberal reforms, but
given the nature of the EU of the 1990s, these are concomitant and mutually
reinforcing aims (as recognised also by Wæver 1994b:7–8). Although the EU is
facing severe legitimation problems everywhere, in Norden, it seems to be particularly
difficult for anti-state, commodifying, free-market neo-liberalism to find sufficient
support for free-market and privatisation reforms. Neo-liberal reforms are therefore
usually conducted under the rubrics of economistic necessity and ‘pragmatic’ or
‘realist’ acquiescence to the new globalised and post-Cold War world. However,
sometimes a reference is also made to more elevating stories that justify neo-
liberalisation and the European integration in terms of a quasi-Hegelian Zeitgeist.
Times are now different; social democratic Norden is mere nostalgia for the past.
Wæver’s article about Nordic nostalgia is a case in point.
Elite discourses notwithstanding, Norden abides in terms of both
institutionalised practices of the welfare state and, in particular, popular values,
partially enabled and sustained by the very institutions created by the social
democratic movement. Finally, it should be noted that Nordic co-operation
226 Visions of world politics
It is very hard to make any sound judgements on the relative strengths of different
identities (are we talking about opinion polls, widely circulated public discourses,
presuppositions of practices or what?). It nonetheless seems to me that the Baltic
identity has remained much weaker and more vague than the Nordic one by all
accounts. Leaving aside Norway and, more ambiguously, Finland, there have been
few if any economic success stories around the Baltic Sea in the late 1990s.
Despite the relative perseverance of Norden, there is a widespread confusion
and few non-neo-liberal visions about the direction to be taken.
people are misinformed and misled by a false account of the reasons for
occurrence of unwanted things or the absence of wanted or desired things, it is
accurate to say that their perceived interests are not in their real interests.
Although the only way to test a hypothesis about real interests is a sustained
dialogue (under sufficiently ideal conditions), it is both possible to examine the
reasons for illusions, reifications and mystifications, and develop visions informed
by better accounts of the real-world processes.
Actors can be misinformed and misled at different levels. Meta-theoretically,
abstract ontological, epistemological and axiological assumptions—many of which
go unnoticed most of the time—can make concrete interpretations susceptible to
stoicism (indifference to the world), scepticism (denial of the world) or unhappy
consciousness (compensation for the frustrations of the real world found in a
fantasy world), or, in general, to different discursive there-is-no-alternative (TINA)
formations (see Bhaskar 1993, 1994; and, in the IR context, Figure 1.1 of
Chapter 1.
In elite discourses, the problems of the Nordic model since the 1970s have
made sense in terms of theories presupposing irrealist philosophies giving rise
to TINA-formations. These have included positivist economics/rational-choice
theories and sceptical post-modernism, both typically imported from the USA
(although the latter originates in France).6 These have been coupled with more
substantial theories that assumed that neo-liberalist reforms are necessary
adjustments to the irrevocable processes of Europeanisation and globalisation.
Theoretical concepts can also be vastly misleading simply by being rooted in
historical realities other than the one under scrutiny, and by being applied in an
unreflective and uncritical way (see Bourdieu and Waqcuant 1999).
Whatever the variation, the result is always the same: tacit acceptance of the
existing relations of domination and other constraints and absences, which are
jointly causing the unwanted and undesired situation that prohibits further
flourishing of human possibilities. These irrealist elite discourses are then fed to
popular commonsense, particularly through endless, repetitive discussions and
images of the increasingly privatised and commercial media intimately tied to
the globalising—mostly US—networks and corporations (see Herman and
McChesney 1997). Yet, although many understandings have changed because
of these powerful forces, some of the institutions and most of the values of the
‘commonsense’ have tended to remain characteristically social democratic and
Nordic.
What, then, is the real interest of Norden? My argument is that it is in the real
interest of the Nordics to struggle for changing the conditions, which have
made it so difficult to sustain and further develop social/democratic ideals.
Consider the model I have suggested for explaining the relative decline of Norden.
There are two sets of conditions here. First of all, there are those glocal conditions
and tendencies—essentially local and national processes having global conditions—
which have made labour-based power mobilisation, corporatist joint central
regulation and the welfare state itself more difficult to sustain. They have also
turned out to work against attempts to re-introduce ideas about democratic
228 Visions of world politics
control over investments and production process. Second, there are the related,
mostly external global processes that have undermined the embedded liberalism
of the original Bretton Woods system and have made the liberalism of the
Washington consensus appear so indisputable.
The Nordic model has been based on an extraordinary reliance on the sovereign
state and its capabilities to produce welfare and democratic, internationalist
progress. In an intellectual climate dominated by attempts to overcome the state
sovereignty, Lawler (1997:566) quite rightly calls this Nordic preoccupation
with the state as an ‘intellectually unfashionable attachment to a positive
understanding of the sovereign state which a generic term such as nationalism
cannot fully capture’.
Moreover, the reliance on the state in this way has not produced the typical
exclusive self—other patterns to which the notion of state sovereignty is connected
in political realist/idealist discourse, as depicted by post-structuralist theories
(see, for example, Ashley 1989). Since the early 1950s, the Norden has been the
exemplar for a pluralist security community, where it would be totally absurd
even to imagine any security threat coming from other Nordic countries (i.e.
threat of violence is simply a non-issue), and where all the social problems have
been and are resolved by means of peaceful changes (see, for example, Joenniemi
1996:11–22). These peaceful changes have been in accordance with the social
democratic ideology. At the end of the 1960s, the Nordic countries attempted
to set up NORDEK, a Nordic economic union. Even though the NORDEK
project failed in the early 1970s, Nordic integration remains significantly deeper
in social areas, labour markets and freedom of citizens than that of the EU.
Given the conditions that have been undermining the Nordic model, there is
a need for a new spatio-temporal self-understanding. The neat model of inside
progress and universal welfare, both within the Nordic countries and in the
Norden as a whole, combined with outside foreign policies, might have worked
for a while, under the very specific geo-historical circumstances of 1938–68
(partially disintegrated post-war capitalist world economy, the Bretton Woods
system, etc.). However, ethically, whether or not Nordic foreign policy has been
concerned with security and wealth in a traditional way or with the world’s
social problems by means of active state-led progressivism, it has, at least at
some level, implied an external other (cf. Neumann 1994). Existentially, it has
never come to terms with its own conditions, whether glocal, regional or global.
Starting with the critical realist (CR) idea that emancipation is ‘characteristically
the transition from an unwanted, unnecessary and oppressive situation to a wanted
and/or needed and empowering or more flourishing situation’ (Bhaskar 1993:396;
1994:253), I conclude this chapter by arguing that any meaningful Nordic
emancipation must be connected to regional and global social democratic reforms.
The explanation of the decline of the Nordic model shows that the causes were
Beyond ‘Nordic nostalgia’ 229
For many Nordic social democrats, nation-states are no longer sufficient as the
framework for political action. Perhaps the EU means more than simply
necessary adjustments and neo-liberalisation? Perhaps it may also open up new
possibilities for regulation and redistribution? Thus, European regionalism has
become a possible solution to the problem of controlling and regulating capitalist
markets and creating more democratic and solidaristic arrangements. On this
basis, Hettne (1993) has argued that there is a new positive understanding of
the ‘logic of stateness vis-a-vis the free play of market forces’, this time articulated
as the ‘pursuit of regionness’.
This strategy addresses at least one of the crucial conditions for the decline
of Norden, namely the trans-nationalisation of production and markets within
Europe. For the Nordic countries, the single market of the EU is the most
important one, whether defined in terms of imports and exports or the activities
230 Visions of world politics
These reforms would not amount to transforming the EU into a democratic welfare
state, but the direction is clear. These are steps towards democratising
decisionmaking and empowering the EU with state powers to control and regulate
capitalist market forces, and to take some steps towards European re-distribution.
In fact, to make this vision even more Keynesian, it could also include ideas
about European demand management, based, for example, on devaluation of
the euro, and employer’s contributions and taxes on households. To make it
more egalitarian in a Greenish manner, it has embraced ideas about cutting
down the working hours a person is allowed to work and distributing the
remaining hours more evenly (cf. Lipietz 1996).
The programme could go further in tackling problems in localities (in the
workplace in particular) through the EU by complementing these reforms with
democratic reforms in the labour market and within work organisations. This
could be done, for instance, in accordance with the model of negotiated
environment, which seeks efficiency not by means of ‘flexibility’ and
individualising competition (i.e. by commodification of human beings and social
relations), or by discipline (i.e. by neo-Taylorist means), but by means of
involvement and participation of both trade unions and blue- and white-collar
workers (for some details, see Lipietz 1996; also 1994; and Ryner 1997).
To address the quest to re-think political community and democracy, the
vision should also include considerations about European citizenship and identity
(cf. Minkkinen and Patomäki 1997:111–47). The simultaneous weathering of
Beyond ‘Nordic nostalgia’ 231
national social and political citizenship and the emergence of the possibility of
post-national citizenship could be taken as a step towards reconstructing political
spaces and communities in Europe. Currently, European citizenship is based
merely on principles of economic liberalism (for criticism, see Habermas 1992).
Any sensible programmatic vision about Europe should include a strategy for
resuscitation of political citizenship in post-national terms.
Finally, to avoid the violence of building a new centre in the world and
excluding alternative possibilities, the new European identity should be able to
incorporate a moment of indecision and to consciously play against the tendency
of creating a new inside/outside dichotomy (cf. Derrida 1992). However, these
Derridean remarks already point towards the limitations of the Europeanist
strategy. Europe is in fact only a small part of the globalising world. A central
puzzle is the EU’s positioning vis-à-vis other parts of the world.
The Europeanist strategy of overcoming the Nordic nostalgia has in fact two
shortcomings. Obviously, it may cause Norden as a distinct identity to disappear
unless it will make Europe appear Norden writ large (cf. Therborn 1997:32–3).
However, more importantly, the conditions for the decline of Norden are not
only EU-European (cf. Figure 9.1). The relevant cultural and discursive relations
of representation and mediation are also global. The basic regulatory framework
of the world economy is devised in Washington and multilateral systems of
governance. Last but not least, the options of productive capital and the spaces
of financial markets are global. Hence, it does not suffice to address these issues
merely by pursuing European regionness.
Indeed, the European integration process has been and is conditioned by
globalisation (see Castells 1998:310–34). Gill (1997b:209) may simplify to the
extreme when he argues that EMU can be seen as a ‘structural adjustment
programme for Europe’, but he has an important point to make. The (neo-
)liberalist rules and principles of global governance are constitutive of the core
principles of the EU. The Treaty of Rome was based on the free-trade principles
of the Bretton Woods system; the Maastricht Treaty and its amendments clearly
reflect the concerns of the post-Bretton Woods Washington consensus. This is
not a coincidence. Global relations of domination and hegemony are at play.7 A
vision of global reforms is thus required as well.
Norden could have a potentially important role to play in advocating and realising
global reforms. In contrast to the values of Norden, the globalising world is
characterised by:
• rapidly growing inequalities both within countries and globally (UNDP 1999);
• increasing de-limitation of the space of democratic self-determination (Dryzek
1996: particularly Chapters 2, 4 and 6; Teivainen 1994; 1997);
• reinforced dominance of the US and non-democratic systems of global
governance (Crawford and Marks 1998; Held 1995:74–120; Patomäki 1999a;
2001b: Chapter 3; and forthcoming);
These trends, coupled with slackening conditions for economic growth, caused
notably also by financial globalisation (Felix 1995:15–32) and consequent
outbursts of major financial crises (see Michie and Smith 1999), imply a deep
and growing legitimisation problem for the Washington consensus-led systems
of governance. Adherence to the values of democratisation, solidarity, de-
commodification and universalistic social policy imply a commitment to major
changes.
Nordic countries have a long tradition of activism in advocating global changes,
often together as in the UN (however, for their differences, see Wiklund 1996).
During the Cold War, the issues dominating the agenda were related to disarmament
and development. Since the mid-1980s, ecological issues (the Brundtland
Commission) and global governance (Commission on Global Governance, co-chaired
by Ingvar Carlsson) have assumed a central place on stage.
The Commission on Global Governance has proposed that there should be
a new pillar of the UN system, namely an economic security council, and that
the UN system should be funded by means of a global tax (see, for example,
Commission on Global Governance 1995; Carlsson 1995). A few days’ revenue
from a global tax on financial transactions would suffice to cover all present
UN debts and annual expenses. The tax would also enable many new global
socio-economic and political projects. Although the UN may already be
domesticated and impoverished by the USA and a new universal organisation
may be needed (see Patomäki forthcoming), steps towards a global tax seem to
be a prerequisite for more adequate principles of governance.
To be justifiable, social/democratic values and purposes must always be
dialogically tested: Are they sufficiently universalisable? They must also gain
concrete ethico-political support to be legitimate (but this in itself is an argument
for democratisation). The following sketchy programmatic vision of global reforms
builds upon the basis of the past and ongoing projects. Of course, there is a
multiplicity of possible strategies of materialising some of the values of Norden in
the emerging global political space. An ideal strategy, based on sufficiently
universalisable values, would tackle a core issue of economic globalisation, include
means to collect major revenues for public political purposes (which must be
determined democratically) and enable possibilities for peaceful and democratic
changes of the rules and principles of systems of global governance.
The re-emergence of global financial markets is perhaps the clearest instance
of economic globalisation. In the Keynesian tradition, it can also be argued as
being the key issue. Unfettered financial markets create collectively irrational
outcomes, increase uncertainty, create conditions for financial crises and compel
Beyond ‘Nordic nostalgia’ 233
states to follow similar, orthodox economic policies, with strong tendencies for
a deflationary bias and decreasing equality (see Kirshner 1999:315–16, 326;
Patomäki 200 1b, Chapter 3). ‘When the capital development of a country
becomes a by-product of the activities of a casino, the job is likely to be ill-done’
(Keynes 1961 [1936]:159). Like most IMF members, Finland, Norway and
Sweden have had first-hand experience of these effects. They experienced a
series of major banking and currency crises in the early 1990s.
To curb the power of the financial markets and seize some control over
them, at least four measures are needed. First, the horizon of investment decisions
have to be made more long-term; second, indebtedness, inter-dependencies and
actions of the investors have to be regulated; third, some control over financial
off-shores and tax havens must be seized; and, finally, the power of financial
flows has to be curbed by raising transaction costs.
Investment horizons can be lengthened at global, regional or national level,
simply by applying controls over short-term capital inflows (e.g. by minimum
non-remunerated reserve or holding requirements, applied in the 1990s by Chile
and Malaysia). Although, also, national and regional regulatory measures do
help, at least as temporary measures, re-regulation can best be implemented by
global arrangements. In particular, financial off-shores and tax-havens can only
be controlled by means of wide-scale collective action.
The best way to curb global financial flows and raise transaction costs is a
small global tax on all transactions. The idea, suggested by James Tobin in the
1970s (Tobin 1978), is simple: by implementing a low-rate tax on currency
transactions, many speculative movements would become unprofitable and the
financial system more stable. Simultaneously, the tax would yield huge revenues
for the international or global community (see ul Haq et al. 1996).
Realisation of the Tobin tax requires collective political action. As mentioned, the
Tobin tax is also an element in the Europeanist political strategy, as a unilateral tax
imposed by the European Central Bank. The premise is that ‘London and New York
are always going to prevent it’ from being implemented globally (Martin and
Schumann 1997:104). However, Hans-Peter Martin and Harald Schumann have
not explained how the EU could implement the Tobin tax unilaterally. A recent
Working Paper of the European Parliament summarises the standard view:
It thus seems that the minimum requirement is for at least all major financial
centres to be within the sphere of taxation, and that there is a system of global
governance of the tax (see Tobin 1996; Garber 1996).
234 Visions of world politics
However, it is, indeed, possible to modify the Tobin tax so as to enable any
grouping of relevant countries, preferably covering a substantial part of the
global financial markets, to start it at any time. (The proposal is made in Patomäki
1999b, and is developed further in Patomäki 2001b: Chapters 5–7.) Norden
could play a role in initiating this, by setting up the system and also finding
partners in Asia, Russia, non-EU parts of Europe and Latin America. Only for
Finland—the only Nordic country that has entered the third phase of the EMU—
would it be a prerequisite to have the EMU-Europe included from the outset.
The Tobin tax regime can be realised in two phases. In the first, a grouping
of countries would establish an open agreement—any state can join at any time—
and a supra-national body orchestrating the tax and collecting the revenues of
a small underlying transactions tax (10 basis points, at most); much bigger
exchange surcharge triggered by exceptional volatility (1–3 per cent or even
more); and a relatively high tax, up to 20 per cent in the case of non-cooperative
tax havens, on domestic currency lending to non-residents (only to non-residents
who are not yet within the tax regime). This arrangement would solve the tax
evasion problem and is economically sound. In the second phase, which would
be carried out either when all major financial centres and most other countries
have joined the first-phase system, or at latest by, say, year 2010, a universal
Tobin tax at a higher—yet absolutely low—0.5–1.0 per cent rate would be applied.
This arrangement is devised in such a manner that it would build up pressure
for the outsiders to join. In other words, since this model empowers actors to
work for the Tobin tax also without the support of the USA, the UK, the IMF,
etc., it is emancipatory in its implications.
Financial markets’ re-regulation and the establishment of the Tobin tax
constitute tackling a core issue of globalisation. The Tobin tax would also mean
the possibility of collecting major revenues for public political purposes. These
revenues could be used as leverage for other reforms. This, in turn, opens up a
new global ethico-political problematic. A global tax regime with sanctions and
surveillance systems, and potential for huge revenues—hundreds of billions of
US dollars—raises the whole problematic of political theory in the global context:
What are the principles of legitimation of collective organisations? How should
they be assessed in terms of material benefits and their distribution, rightful
authority, justice and democracy? There is a quest to re-think democratic
accountability, representation, participation and citizenship.
The strategy centred around the Tobin tax is both for the autonomy of
states and for new global institutional arrangements. The new institutional
arrangements would not hinder processes of globalisation but would rather
politicise, democratise and re-direct them (see Patomäki 2000a; 2001b). They
would also yield new resources for public political action. Some of these could
be used for global demand inducements to counteract deflationary tendencies.
Like in the Keynesian theory (see Keynes 1961 [1936]:372–84), demand
inducements should also contribute to global redistribution.
However, it is likely that financial re-regulation and the Tobin tax would not
suffice to reverse the tendencies towards increasing de-limitation of the space of
Beyond ‘Nordic nostalgia’ 235
Conclusion
In this chapter, I have argued that the Cold War and security-oriented
interpretation of Norden is, if not entirely incorrect, at least partially inadequate
and misleading. The global political-economy dynamic provides a better
explanation of the relative decline of Norden.
There has been a complex interplay of changes in the relations of production;
the dynamics of class identification and welfare state development; and
globalisation as the emergence of new exit options, the erosion of the embedded
liberalism of the original Bretton Woods system and the re-emergence of global
financial markets. These, together with the related cultural and socio-economic
changes, have transformed the structural conditions for labour- and capital-
based power mobilisation in Sweden and, mutatis mutandis, in other Nordic
countries. Together with the end of the Cold War, this interplay has also given
rise to irrealist and economistic stories about the requirements of ‘new times’;
of the gradual changes in the meaning of the notion of the Third Way; and the
neo-liberal framing of new social problems.
Tackling the real conditions of the decline of Norden demands a new social
democratic orientation in time and space. The nation-state can no longer provide
a sufficient framework for progressivist political action, and the socially flavoured
Wilsonian ‘idealism’ appears also to be a somewhat anachronistic basis for
‘progressivist internationalism’.
This is not to argue that politics occurs now only outside sovereign states. Rather,
local and national struggles are essentially connected to regional and global struggles,
and therefore they cannot be taken as separate spheres any more (‘first progressivism
at home, and then exportation of these universalist ideals to the rest of the world’).
Along these lines, I discussed two programmatic visions that should take
Nordics beyond the ‘Nordic nostalgia’. The Europeanist strategy is based on a
new positive understanding of the ‘logic of stateness vis-à-vis the free play of
market forces’, this time articulated as the ‘pursuit of regionness’. Yet, this strategy
236 Visions of world politics
Notes
The point of After International Relations is not that relations between states (or
nations) would have disappeared. Many states remain important actors. A few
may be bigger and more capable than ever. Yet two major changes justify the
evocation of the idea about ‘after international relations’. First, one hundred
years after the exhaustion of geographical space by the European empires, and
the height of the pre-1914 capitalist economic inter-dependence, the world has
become even smaller and more vulnerable. The epoch of territorial empires
and geo-politics in the original sense is over. There are now about 200 formally
independent states, and a majority of them claim to be democratic. Globalisation
talk is often geo-historically myopic and ideologically biased. It is nonetheless a
sign of the times. It has become increasingly hard to sustain the illusion of the
inside/outside—or, worse, society/state of war—dichotomy.
Second, the epistemological and ontological ground has shifted. The problematic
articulated by Hume and Kant has become increasingly anachronistic. There is a
realist alternative to the positivist interpretation of the natural sciences. Although
the conceptions, opportunities and problems associated with the rise and expansion
of capitalism may be part of everyday practices more than ever, a majority of social
scientists, outside orthodox economics and its extensions, have renounced the
capitalist conception of man as the basis for social and political theory. Also, the
problem of building a modern state cannot appear the same any longer. Whereas,
for instance, in Africa this project turned out to be disastrous, the European states
have become involved in the long-term process of building an integrated security
community and partial union. Many states’ structures, also outside Europe, have
become regionalised or globalised. Last but not least, as has been argued in this
book, there is now an alternative to the problem of order. Besides criticising the
theoretical underpinnings of this problem, I have proposed that a security
community, whether pluralist or amalgamated, is essentially dependent on the
self-transformative capacity of relevant contexts.
Every problem, whether practical-political or more theoretical, lays on a set
of presuppositions. The aim of After International Relations has been to establish a
way out of the problematic that has puzzled so many great thinkers and scholars
of the past 200 years. By showing how and why the theories based on the
international problematic have failed (Part I), by articulating an alternative,
237
238 Epilogue
critical realist research programme (Part II) and by illustrating how this research
programme can be put to work (Part III), my hope is that this book will enable
better research and other practices. Although this may sound overly ambitious,
it, in fact, builds upon the contribution of numerous philosophers, social theorists
and critical IR and GPE researchers. Simultaneously, it is at best only an
opening.1
Marx (1968 [1852]:96) suggested that everything in history repeats itself.
The first time is a tragedy, the second a farce. Perhaps there will be an epilogue
to international relations too? The problem is that ‘in the past, the worst disasters
could not kill mankind’ (Jaspers 1961:318). Would a replay as a farce be possible?
History is open, and we can only guess what story will fit. Meanwhile, we can
try to learn from past mistakes, past trials and errors, and try to build, on that
basis, better models and stories. We know that only learning and change can
make a global security community possible.
Notes
1 Where this book ends, other books start. Democratising Globalisation. The Leverage of the
Tobin Tax (2001b) is an analysis of the consequences of the re-emergence of global
finance. It explains the instability of financial markets and the role of finance in global
relations of hegemony and domination. It makes a normative case for a particular
concrete Utopia—the Tobin tax—linked to a regulated currency regime, governed by a
democratic organisation. Democratising Globalisation will be followed by Critical Realist
Global Political Economy, a book that attempts to remedy the absence of power in
economics and the absence of economics in GPE. Time and Space in Republican Theory.
Rethinking Participation and Representation in the Global Age has been in the making for
years. It aims to re-think the nature of public political spaces and political participation
and representation under the conditions of the late-modern globalising world. Most
importantly, it purports to do this concretely, not just abstractly.
Bibliography
——(1988a) ‘The dialectical logic of Thucydides’ Melian dialogue’, American Political Science
Review 82(3):805–820.
——(1988b) ‘Politics as political argumentation: Towards regrounding a global
discipline’, paper presented at the XIVth IPSA World Congress, 31 August,
Washington DC.
——(1989) ‘Uppsala fireworks: Data-based thoughts on the origins and possible
obsolescence of the European state system’, in [Link] (ed.) States in Armed
Conflict 1988, Report No. 30 (July), Uppsala University: DPCR, pp. 19–34.
——(1991) ‘Making peaceful sense of the news: Institutionalizing international conflict
management events reporting using frame-based interpretative routines’, paper
presented for the panel of DDIR project at ISA, Vancouver, BC, 19–23 March.
——(1992) ‘The humanistic movement in international studies: Reflections on Machiavelli
and Las Casas’, International Studies Association, 1992 presidential address,
International Studies Quarterly 36(4):347–71.
——(1996) Rediscoveries and Reformulations., Cambridge: Cambridge University Press.
Alker, Jnr, H., Bennett, J. and Mefford, D. (1980) ‘Generalized precedent logics for
resolving insecurity dilemmas’, International Interactions 7(2):165–206.
Alker, Jnr, H. and Biersteker, T. (1984) ‘The dialectics of world order: Notes for a
future archeologist of international savoir faire’ , International Studies Quarterly 28(2):
121–42.
Alker, Jnr, H., Biersteker, T. and Inoguchi, T. (1989) ‘From imperial power balancing to
people’s wars: Searching for order in the twentieth century’, in [Link] Derian and
[Link] (eds) International/Intertextual Relations. Postmodern Readings of the World Politics,
Massachusetts: Lexington Books, pp. 135–62.
Alker, Jnr, H. and Christensen, C. (1972) ‘From causal modelling to artificial intelligence:
The evolution of a UN peace-making simulation’, in [Link] and [Link]
(eds) Experimentation and Simulation in Political Science, Toronto: University of Toronto
Press, pp. 177–224.
Alker, Jnr, H., Duffy, G., Hurwitz, J. and Mallery, J.C. (1991a) ‘Text modeling for
international politics: A tourist’s guide to RELATUS’, in [Link] (ed.) Artificial
Intelligence and International Politics, Boulder, CO: Westview Press, pp. 97–126.
Alker, Jnr, H. and Haas, P.M. (1993) ‘The rise of global ecopolitics’, in [Link] (ed.)
Global Accord: Environmental Challenges and International Responses, Cambridge, MA: MIT
Press , pp. 133–71.
Alker, Jnr, H. and Hurwitz, R. (1980) Resolving Prisoner’s Dilemmas, Cambridge, MA:
APSA.
——(2001) ‘Reasons, causes, games, lessons and evolution in inequitable sequential
prisoner’s dilemmas’, paper presented at the 42nd Annual Convention of the
International Studies Association in Chicago, 20–4 February.
Alker, Jnr, H, Lebedva, M., O’Donnell, S. and Polanyi, L. (1991b) ‘Retelling cold war
stories: Uncovering cultural meanings with linguistic discourse analysis’, prepared
for IVth US—Soviet Workshop on Models and Concepts of Interdependence Between
Nations, Moscow, 26–8 June.
Alker, Jnr, H. and Russett, B. (1965) World Politics in the General Assembly, New Haven,
CT: Yale University Press.
Alker, Jnr, H. and Sylvan, D. (1994) ‘Kuinka diskurssianalyysi voi edesauttaa
politiikkatieteen kehittymistä’ [Political discourse analysis], transl. into Finnish by
[Link]äki, Kosmopolis 24(3):5–25.
Amin, A. (1994) ‘Post-Fordism: Models, fantasies and phantoms of transition’, in [Link]
(ed.) Post-Fordism. A Reader, Oxford: Blackwell, pp. 1–39.
Amin, T. (1985) Nationalism and Internationalism in Liberalism, Marxism and Islam, Islamabad:
Bibliography 241
Gadamer, H.-G. (1975 [1965]) Truth and Method, trans. [Link] and [Link]
(eds), USA: The Seabury Press.
——(1977) Philosophical Hermeneutics, [Link] ed. [Link], Berkeley, LA: University
of California Press.
Gaddis, J.L. (1992) ‘International relations theory and the end of the Cold War’,
International Security 17(3):5–58.
Galbraith, J. (1967) The New Industrial State, Boston: Houghton-Mifflin.
Galtung, J. (1977) Methodology and Ideology. Theory and Methods of Social Research Vol.1,
Copenhagen: Christian Ejlers.
——(1988) Methodology and Development. Essays in Methodology, [Link], Copenhagen:
Christian Ejlers
——(1990) ‘Cultural violence’, Journal of Peace Research 27(3):291–305.
——(1996) Peace by Peaceful Means, London: Sage, with PRIO.
Garber, P. (1996) ‘Issues of enforcement and evasion in a tax of foreign exchange
transactions’, in [Link] Haq, [Link] and [Link] (eds) The Tobin Tax. Coping with
Financial Volatility, Oxford: Oxford University Press, pp. 129–12.
Garrett, G. (1998) Partisan Politics in the Global Economy, Cambridge: Cambridge
University Press.
Garst, D. (1989) ‘Thucydides and neorealism’, International Studies Quarterly 33(1):3–
27.
George, J. (1989) ‘International relations and the search for thinking space: Another
view of the third debate’, International Studies Quarterly 33(3):269–79.
Giddens, A. (1976) New Rules of Sociological Method. A Positive Critique of Interpretative
Sociologies, London: Hutchinson.
——(1977) Studies in Social and Political Theory, London: Hutchinson.
——(1979) Central Problems in Social Theory, Los Angeles: University of California Press.
——(1981) A Contemporary Critique of Historical Materialism, Vol.1, Berkeley: University of
California Press.
——(1983) ‘Comments on the theory of structuration’, Journal for the Theory of Social
Behaviour 13:75–80.
——(1985a) ‘Marx’s correct view on everything’, Theory and Society 14:167–74.
——(1985b) The Nation-State and Violence. Volume Two of the Contemporary Critique of Historical
Materialism, Oxford: Polity Press.
——(1987a) Social Theory and Modern Sociology, Oxford: Polity Press.
——(1987b) The Nation-State and Violence. Volume Two of a Contemporary Critique of Historical
Materialism, Cambridge: Polity Press.
——(1998) The Third Way: The Renewal of Social Democracy, Cambridge: Polity.
Gill, S. (ed.) (1993) Gramsci, Historical Materialism and International Relations, Cambridge:
Cambridge University Press.
——(1997a) ‘Analysing new forms of authority: New constitutionalism, panopticism
and market civilization’, paper presented at the conference Non-State Actors and
Authority in the Global System, Warwick University, 31 October–1 November
1997.
——(1997b) ‘An emu or an ostrich? EMU and neoliberal globalisationlimits and
alternatives’, in [Link] and [Link]äki (eds) The Politics of Economic and
Monetary Union, Dordrecht: Kluwer, pp. 207–31.
Gill, S. and Law, D. (1988) Global Political Economy. Perspectives, Problems and Policies, New
York: Harvester Wheatsheaf.
——(1989) ‘Global hegemony and the structural power of capital’, International Studies
Quarterly 33(4):475–99.
Gills, B. and Palan, R. (1994) ‘Introduction: The neostructuralist agenda in
246 Bibliography
Brighton.
Isaac, J.C. (1987) Power and Marxist Theory. A Realist View, Ithaca: Cornell University
Press.
——(1990) ‘Realism and reality: Some realistic reconsiderations’, Journal for the Theory
of Social Behaviour 20(1):1–31.
Jabri, V. and Chan, S. (1996) ‘The ontologist always rings twice: Two more stories
about structure and agency in reply to Hollis and Smith’, Review of International
Studies 22(1): 107–10.
Jaspers, K. (1961) The Future of Mankind, trans. [Link], Chicago: The University of
Chicago Press.
Jerneck, M. (1990) ‘Olof Palme—en international propagandist’, in [Link] and
[Link] (eds) Socialdemokratin of svensk utrikespolitik. Från Branting till Palme,
Stockholm: Utrikespolitiske institut, pp. 121–42.
Joenniemi, P. (1996) ‘Norden: A community of asecuriry?’, Copenhagen: Working
Papers 6, Copenhagen Peace Research Institute.
Kant, I. (1956 [1788]) Critique of Practical Reason, trans. [Link], New York: Macmillan.
——(1983) Perpetual Peace and other Essays, Indianapolis: Hackett Publishing Company
[‘Idea for a universal history’, 1784‘May be true in theory, but is of no practical
use’, 1793‘Perpetual peace’, 1795].
Kaplan, M. (1969) ‘The new great debate. Traditionalism vs. science in international
relations’, in [Link] and [Link] (eds) Contending Approaches to International
Politics, Princeton, NJ: Princeton University Press, pp. 39–61 [first published in the
October 1966 issue of World Politics].
Katzenstein, P. (1985) Small States in World Markets, Ithaca, New York: Cornell University
Press.
Keat, R. (1981) The Politics of Social Theory. Habermas, Freud and the Critique of Positivism,
Oxford: Basil Blackwell.
Keat, R. and Urry, J. (1975) Social Theory as Science, London: Routledge & Kegan Paul.
Kekkonen, U. (1970) ‘The common neutrality of the Nordic countries’, a speech
presented in Helsinki on 23 January 1952 (the pyjama pocket speech), in
[Link] (ed. [Link]) Neutrality. The Finnish Position, London: Heinemann,
pp. 53–6.
Keohane, R.O. (1986) ‘Theory of world politics: Structural realism and beyond’, in
[Link] (ed.) Neorealism and Its Critics, New York: Columbia University Press,
pp. 158–203.
Keynes, J. (1961 [1936]) The General Theory of Employment, Interest and Money, London:
Macmillan.
Kirshner, J. (1999) ‘Keynes, capital mobility and the crisis of embedded liberalism’,
Review of International Political Economy 6(3):313–37.
Klein, B.S. (198 8) ‘Hegemony and strategic culture: American power
protection and alliance deference polities’, Review of International Studies,
14:133–48.
——(1990) ‘How the West was one: Representational politics of NATO’, International
Studies Quarterly 34:311–25.
Kokaz, N. (2001) ‘Moderating power: A Thucydidean perspective’, Review of
International Studies 27(1):27–49.
Koskenniemi, M. (1989) From Apology to Utopia. The Structure of International Legal Argument,
Helsinki: Finnish Lawyer’s Publishing.
Krasner, S. (1982) ‘Structural causes and regime consequences: Regimes as intervening
variables’, International Organization 36(2):185–205.
Kratochwil, F. (1993) ‘The embarrassment of changes: Neo-realism as the science of
Bibliography 249
Michie, J. and Smith, J. (eds) (1999) Global Instability. The Political Economy of World
Economic Governance, London: Routledge.
Minkkinen, P. and Patomäki, H. (1997) Yhdentymisen ja rahaunionin politiikkaa.
Suomija Emu globaalissa poliittisessa taloudessa [Politics of Integration and
Monetary Union Finland and E M U in the Global Political Economy],
Helsinki: Like.
Morgenthau, H. (1946) Scientific Man versus Power Politics, Chicago: University of Chicago
Press.
Moore, Jnr, B. (1969) Social Origins of Dictatorship and Democracy. Lord and Peasant in the
Making of the Modern World, Penguin: Harmondsworth.
Moses, J. (19 97) ‘Trojan horses: Putman, E C U linkage and the E U
ambitions of Nordic elites’, Review of International Political Economy
4(2):382–415.
Naess, A. (1972) The Pluralist and Possibilist Aspect of the Scientific Enterprise, London :
Allen & Unwin.
Neufeld, M. (1993a) ‘Reflexivity and international relations’, Millennium: Journal of
International Studies 22(1):53–76.
——(1993b) ‘Interpretation and the “Science” of international relations’, Review of
International Studies 19(1):39–61.
Neumann, I. (1994) ‘A region-building approach to Northern Europe’, Review of
International Studies 20(1):53–74.
Nicolson, H. (1939) Diplomacy, London: Oxford University Press.
Norrie, A. (1991) Law, Ideology and Punishment, Dordrecht: Kluwer.
——(1993) Crime, Reason and History, London: Butterworths.
——(2000) Punishment, Responsibility and Justice. A Relational Critique, Oxford: Oxford
University Press.
Olafson, F.A. (1979) The Dialectic of Action. A Philosophical Interpretation of History and the
Humanities, Chicago: The University of Chicago Press.
Ollman, B. (1971) Alienation. Marx’s Conception of Man in Capitalist Society, Cambridge:
Cambridge University Press.
Onuf, N.G. (1989) World of Our Making. Rules and Rule in Social Theory and International
Relations, South Carolina: University of South Carolina Press.
——(1995) ‘Levels’, European Journal of International Relations 1(1):35–58.
——(1998) The Republican Legacy in International Thought, Cambridge: Cambridge
University Press.
Ordershook, P.C. (1986) Game Theory and Political Theory. An Introduction, Cambridge:
Cambridge University Press.
Palmer, R.E. (1969) Hermeneutics. Interpretation Theory in Schleiermacher, Dilthey, Heidegger
and Gadamer, Evanston: Northwestern University Press.
Palonen, K. (1987) ‘The art of the possible on the periphery. [Link] and Urho
Kekkonen in the realpolitik tradition’, in [Link], and [Link] (eds) Transformation
of Ideas on a Periphery. Political Studies in Finnish History, Helsinki: Finnish Political
Science Association, pp. 98–115.
Patomäki, H. (1989) ‘A metatheoretical and methodological critique of neorealism:
Towards a fourth debate in international relations research’, Occasional Papers 8,
1989, Turku: Political Science, University of Turku.
——(1992a) ‘What is it that changed with the end of the cold war? An analysis of the
problem of identifying and explaining change’, in [Link] and [Link] (eds)
The End of the Cold War. Evaluating Theories of International Relations, Dordrecht: D.
Reidel, pp. 179–225.
——(1992b) ‘From normative Utopias to political dialectics: Beyond a deconstruction of
Bibliography 251
108–48.
Schürmann, R. (1984) ‘Deconstruction is not enough: On Gianni Vattimo’s
call for “weak thinking”’, Graduate Faculty Philosophy Journal 10(1)
(spring):165–77.
Searle, J. (1983) ‘The world turned upside down’, The New York Review of Books, October
27:74–9.
——(1996) The Construction of Social Reality, London: Penguin.
Sharp, G. (1984) The Politics of Nonviolent Action. Part One: Power and Struggle, Boston:
Porter Sage Publishers.
Shatzki, T. (1988) ‘The nature of social reality’, Philosophy and Phenomenological Research
XLIX:239–60.
Shotter, J. (1983) “‘Duality of structure” and “intentionality” in an ecological
psychology’, Journal for the Theory of Social Behaviour 13:19–43.
Singer, J. (1961) ‘The level-of-analysis problem in international relations’, in [Link]
and [Link], (eds) The International System, PrincetonPrinceton University Press,
pp. 77–92.
——(1969) ‘The incomplete theorist: Insight without evidence’, in [Link] and J.N.
Rosenau (eds) Contending Approaches to International Politics, Princeton, NJ: Princeton
University Press , 62–86.
Skinner, Q (1988) Meaning and Context. Quentin Skinner and his Critics, ed. [Link],
Cambridge: Polity Press.
Smith, M.J. (1986) Realist Thought from Weber to Kissinger, London: Lousiana State
University Press.
Smith, S. (1994) ‘Foreign policy theory and the new Europe’, in [Link] and
[Link] (eds) European Foreign Policy. The EC and Changing Perspectives in Europe, London:
Sage, pp. 238–73.
de [Link], G.E.M. (1972) The Origins of the Peloponnessian War, New York: Cornell
University Press.
Stiglitz, J. (1998) ‘More instruments and broader goals: Moving toward the post-
Washington consensus’, Wider Annual Lectures 2, UNU WIDER (World Institute
for Development Economics Research Helsinki), available online at: http://
[Link]/[Link].
Suganami, H. (1983) ‘The structure of institutionalism: An anatomy of
British mainstream international relations’, International Interactions
7(5):2363–81.
Sylvan, D. (1991b) ‘Insides, outsides, and walls in Classical Greece’, paper presented
at the XVth World Congress of the IPSA, 21–5 July, Buenos Aires.
——(1991a) ‘The qualitative—quantitative distinction in political science’, Poetics Today
12(2):267–86.
Sylvan, D. and Glassner, B. (1985) A Rationalist Methodology for the Social Sciences, Oxford:
Basil Blackwell.
Taylor, M. (1976) Anarchy and Cooperation, London: John Wiley & Sons.
Teivainen, T. (1994) ‘Politics of economism and the containment of changes in the
global political economy’, in [Link]äki (ed.) Peaceful Changes in World Politics,
Tampere: Tapri Research Report No. 71, pp. 81–121
——(1997) ‘The independence of the European Central Bank: Implications for
democratic governance’, in P Minkkinen and [Link]äki (eds) The Politics of Economic
and Monetary Union, Dordrecht: Kluwer, pp. 55–75.
Therborn, G. (1997) ‘Europe in the 21st century: The world’s Scandinavia?’, Irish
Studies in International Affairs 8:21–34.
Thucydides (1843 [1628/c. 400 BC]) The History of the Grecian War, [Link], in
254 Bibliography
University Press.
Wallensteen, P. and Sollenberg, M. (2000) ‘Armed conflict 1989–98’, Journal of Peace
Research 37(5):635–49.
Waltz, K. (1959) Man, the State and War, New York: Columbia University Press.
——(1979) Theory of International Politics, New York: Addison-Wesley.
——(1986) ‘Reflections on “theory of international politics”: A response to my critics’,
in R. Keohane (ed.) Neorealism and Its Critics, New York: Columbia University Press,
pp. 322–46.
Walzer, M. (1986) ‘The politics of Michel Foucault’, in [Link] (ed.) Foucault. A Critical
Reader, Oxford: Basil Blackwell, pp. 51–68.
Weber, M. (1969 [1949]) ‘Ideal types and theory construction’, in [Link] (ed.)
Readings in the Philosophy of Social Sciences, New York: The Macmillan Company, pp.
496–507.
——(1978 [1956]) Economy and Society. An Outline of Interpretative Sociology, ed. [Link] and
[Link], Berkeley: University of California Press [original published in German
in 1921].
——(1985 [1919]) ‘Politics as vocation’, in [Link] (ed.) Weber: Selections in
Translation, trans. [Link], Cambridge: Cambridge University Press, pp. 212–
25.
Weiss, L. (1998) The Myth of the Powerless State, New York: Cornell University Press.
Wendt, A. (1987) ‘The agent-structure problem in international relations theory’,
International Organization 41(3):335–70.
——(1991) ‘Bridging the theory/meta-theory gap in international relations’, Review of
International Studies 17(4):383–92.
——(1992a) ‘Anarchy is what states make of it’, International Organization 46(2):391–
425.
——(1992b) ‘Levels of analysis. Agents and structures: part III’, Review of International
Studies 18(2):181–5.
——(1994) ‘Collective identity formation and the international state’, American Political
Science Review 88(2):384–96.
——(1999) Social Theory of International Politics, Cambridge: Cambridge University Press.
Widfeldt, A. (1998) ‘Swedish election report. The Swedish parliamentary election of
1998’, ECPR News 10(1):18–19.
Wight, C. (1999) ‘They shoot dead horses don’t they? Locating agency in the agent—
structure problematique’, European Journal of International Relations 5(1):109–42.
——(2001) ‘Linking agents and structures to levels of analysis’, paper presented at the
42nd Annual Convention of the ISA, Chicago, 20–4 February.
Wight, M. (1966) ‘Why is there no international theory’, in [Link] and [Link]
(eds) Diplomatic Investigations. Essays in the Theory of International Politics, London: George,
Allen & Unwin, pp. 17–34.
Wiklund, L. (1996) ‘Norden 1 FN’, Internasjonal Politikk 54(1):75–83.
Williamson, O. (1975) Markets and Hierarchies, New York: Free Press.
Winch, P. (1971 [1958]) The Idea of Social Science and its Relatio n to Philosophy, London:
Routledge & Kegan Paul.
Wittgenstein, L. (1979) On Certainty, ed. [Link] and [Link] Wright,
Oxford: Basil Blackwell.
World Bank (1997) World Development Report 1997. The State in a Changing World, Oxford:
Oxford Univeristy Press.
von Wright, G.H. (1971) Explanation and Understanding, London: Routledge & Kegan
Paul.
Young, O. (1969) ‘Aron and the whale: A Jonah in theory’, in [Link] and
256 Bibliography
absence 7, 8, 18, 41, 129 project 61, 65, 66; and ‘emancipatory
abstractions 127–8, 130 humanism’ 46–7; four phases of 44–7;
action: communicative 105, 110, 111; on history writing 58–9; and
complex causes for 81, 91–2, 119; and importance of openness 44; on Jesus
context 119–20; dependent on reality story 56–7; and learning process 43–4;
144–5; dramaturgical 110, 111; limitations of 64–6; on mathematical
freedom of 119–20; and internal and social science 135; and modelling
external relations 117; meaningful 119; approaches 56–61; as President of the
need for theory of 116; normatively ISA 64, 66; on social ontology 99; on
regulated 110; possibilities 131; and temporal motivation 140; theory of
rules 102–3, 108; rationalisation of conflicts 44 on Thucydides 167, 173,
151; relationship with power 112–14; 191; United Nations studies 44–6,
relationship with structures 109, 47–8, 54, 68; work on collective
116–17; social 108–12, 116, 136, 173; dilemmas (Prisoner’s Dilemma) 51–6,
strategic 111, 152; teleological 110, 65; and world history theorising 61–3
111; transformative 120 amalgamated community 194, 195, 196,
actors: as components of a causal 201–2
complex 87–8, 119 and context 119, ambiguity 103–4, 106–7
120; co-operation among 111; geo- ambivalence 103–4
historical constitution of 91, 137–8, Amin, Tahir 61
139, 140; ‘internal’ and ‘external’ 80; analogy 127, 128, 130, 197
knowledge of 126, 131; linked by analysis: historical vs. structural 91
understanding (Habermas) 104–5; and anarchy 33, 74, 191
misinformation 227; and practical Archer, Margaret 7
wisdom 159; in security community Archimadus, King 187
200–1; social 102; states as 71–2 Arendt, Hannah 143, 156
Adler, Emanuel 198, 203–4, 209 Aristotle 99, 102, 159, 188, 191
Adorno, Theodor 6 arms races 127
Africa 237 artificial intelligence 48, 61
agency 151, 152 Ashley, Richard 13, 33, 61, 69, 71
agency-structure problem 72, 73, 74 astronomy 22
aitia 179 Athens: aggressive imperialism 179, 182,
Alcibiades 180, 192 183–8, 190; conduct of Peloponnesian
Alker, Hayward R. 11–13, 41, 43–69 War 168, 179; debates and voting
collection and analysis of data 47–51; 184–5; development of 192; eventual
and critical realism 44; criticism of defeat 188; and the gods 192; loss of
scientific and statistical practice 45–6, legitimate leadership 182; and Melos
47–51; on data-coding procedure s episode 170–1, 174–9, 180, 181,
134; ‘Dialectics of World Order’ 182–8; as misleading concept 180
257
258 Index
realpolitik of 176, 178, 185, 189; Bull, Medley 39, 40, 65–6
resources 182 Bunge, Mario 82
atomism 22, 29; entities 3, 135; Butterfield, Herbert 36
individuals 29, 31, 32; social 25; states
78; world 78 cannibals 29
Attica 184 capital accounting 24
Augustine, St 29, 34, 36 capitalism 21, 133, 237; alienation and
Axelrod, Robert 55 atomism of 203; central to IR 38;
control strategies (EU) 229, 230;
Bacon, Francis 22, 30 development of 24, 25–6, 29, 83–4,
balance of power 21, 28, 31–2, 42, 167; 134; and globalisation 38, 79, 84, 118,
Alker on 62–3 224; link with positivism 25–6; and
Bank for International Settlements 205 problem of order 30–1; as social
banking crises 221, 223, 233 system 118; and strategic action 111;
banks, international 134, 230, 233 Swedish model 217
Barnett, Michael 198, 209 Carlsnaes, Walter 71, 72, 87
Becker, Gary 4 Carlsson, Ingvar 232
behaviouralism 43, 45, 73, 88, 95 cartography 26, 27
Bennett, James 54–5 Catholics, Roman 27
Bentham, Jeremy 2, 25 causal complexes 78–9, 119–20; and
Berkeley, George 2, 22 iconic models 123, 126, 129–33; and
Bernstein, Richard 123 time and space 137
Bhaskar, Roy 5–7, 9, 14, 35, 162; on causal efficacy 151
action 116; Alker and 64, 65; on causal hypotheses 132
atoms 135; on communicative action causal powers 88
105; concepts of structure 100, causal stories 87–9
114–15; on empirical invariances 76; causality: Alker’s ontological shift from
explanatory emancipation concept 16, mechanical model 48; critical realist 5,
144, 152–5, 156, 158, 160; on 8, 15, 76; Deutsch and 197; dialectical
Horkheimer 18; on interference with 104; and empirical invariances 75, 76;
object by subject 151, 152; on IR and hierarchies 83; Humean 48, 75,
levels 94; and normative discourse 76, 78; Kantian moral reason cut off
158; on philosophy and realism 8; from 4; and levels-of-analysis problem
on positivism 26, 49; on reason 88; on 72–3, 74–5; positivist 3; and post hoc ergo
science 100; on social structure 101–2, propter hoc-fallacy 76, 135; and
108; on society 150; TMSA 99, 102, quantitative data 134–5
108; on truth 144, 145, 146–7, 148, causation: Hume’s scepticism about 23
153–4, 158, 160; on vagueness and cause: ancient Greek attitude to 172; as an
ambiguity 103–4 INUS-condition 76–7, 91, 92, 101,
Biersteker, Thomas 61, 62, 63 102, 108, 119, 131, 132
Bill of Rights (1689) 27 centre-periphery structures 60, 128, 138–9
Blair, Tony 210, 236 Chammah, A. 55
Bolshevik Revolution 113, 143 Chan, Stephen 71
book-keeping 24, 134 change 43–4
Boulding, Kenneth 43 Chase-Dunn, Christopher 64
Bourdieu, Pierre 7 Chile 45, 233
bourgeois man 10, 17 Choucri, Nazli 50, 69
Braudel, Ferdinand 58 Christianity 21, 29–30, 34, 36
Bretton Woods system 3, 16, 85, 205, citizenship 230–1
211, 215, 217, 228, 230, 231 Cleon 184–5, 186
British Institutionalism 78 Clinton, Bill 210, 236
Brundtland Commission 232 Club of Rome 58, 60
Bruno, Giordano 22 coding practices 50–1, 57
Index 259
Cold War 113, 193, 194, 232; Alker and de-commodification 215–16
44; end of 3, 16, 76, 212, 213, 214; deconstruction 41
Norden and 211–13, 214, 215, 217, 235; deductive-nomological model 75, 91
and political realism 39, 43; second 215 Delian league 186
collective identity 198–9 democracy: global 224, 229; liberal 224
collective security 62–3 democratic peace hypothesis 130, 193–4
Collier, Andrew 7, 154, 158 democratic peace theory 3, 93–4
colonialism 27, 28, 29, 84 Denmark 211, 221, 223, 224
Commission on Global Governance 232 Dependencia school 60
commodification 231see also dependency relations 60; as levels 83–4
de-commodification dependent variable 73, 74
communication 136, 173 depth 81–2; of explanation 91
Communist Party 113 Derrida, Jacques 7, 40–1, 154–5, 157, 231
comprehensive views 150 Descartes, René 2, 22
computational hermeneutics 57 desecuritisation 208
computers 57, 66, 68, 94 Dessler, David 75, 99
Comte, Auguste 35, 76, 99 Deutsch, Karl 15, 55, 60, 127; on learning
configurating 140 43, 152–3; on power 185–6; and
constant conjunctions 76 security community concept 41,
constructivism 4, 39, 108 194–5, 196–200, 201–2, 208, 209
context:of action 119–20; and deviant cases 51
quantitative data 134; see also social dialectical argumentation 125
context dialectical causality 104
contradictions 104, 106–7 dialectical exchange 71
contrastive demi-regularity 129 dialectics: definition 173; development of
Copernicus, Nicolas 22 consciousness 54; formalised (Rescher)
Cornford, F. 172–3 106–7, 174–6; and Melian dialogue
correspondence: as metaphor 145; and 173–9; of open, stratified systems
theory of truth 145–6 117–18
critical realism (CR) 1, 4–5; agency/ ‘Dialectics of World Order’ project 61, 65, 66
structure debate 71; components of 77; dilemmas, collective 12, 51–6
core notions of context and complex Diodotus 185
119; the ‘critical’ aspect diplomacy 28
9–10; development of 5–7; and discourse, normative 158, 160
epistemological relativism 8–9; and discourse analysis 39
judgemental rationalism 9; domination, systems of 199, 201, 204,
methodology 13–15; and ontological 205, 232
realism 8; see also specific aspects double hermeneutics 14, 103, 109, 111,
criticism 158 160; of iconic modelling 123–42
Cunning of Reason 34 dualism 64, 89
currency transaction tax 17, 230, 233–4 Durkheim, Emile 99, 108
cybernetics 48, 60, 127
East Timor 139
Darwin, Charles 34 EC see European Community
Darwinism 34 ecology 232, 235
data: coding procedures 50–1, 130; econometrics 135
collection and analysis 47–51, 134; economic cooperation 4
and epistemology 49–50; nature of elite discourses 225, 227
11–12; and ontology 48, 51; primary emancipation 9, 10, 16, 228–9; and
14; qualitative 135–7; quantitative security community 203; see also
133–5; theory-dependence of 90; explanatory emancipation
verbal 48 emancipatory explanation 144
Declaration of the Rights of Man 2 emancipatory humanism 46–7
260 Index
governance 229, 231–5; inequalities Hollis, Martin 13, 71–5, 89, 91, 99
232; political innovations 224; ‘horizon’ 142
regulation 233; relations of domination Horkheimer, Max 6, 18
204, 205, 232; and security community Human Development Index 223
concept 203–5; social relations 215; humanism 29
strategy 17; surveillance 139; taxation Hume, David 2, 21, 25, 29; ampliative
232, 233–5, 236 approach of 125–6; as an anachronism
Glorious Revolution (1688) 27 237; on balance of power 28, 31–2, 42;
Gold Standard 84 on causality 48, 75, 76, 78; Christian
Gorbachev, Mikhail 3, 113, 214 realists and 36; conception of human
government debt 223 nature 36; critique of ontology 22–4;
graphical argumentation techniques 174, on external relations 104; and ‘great
176, 177 debates’ of international relations 36–8;
Greenwood, John 103 on growing power of Britain 28; on
Grunberg, Isabelle 68 justice 25, 30; liberals and 34;
Gulf Cooperation Council 195 Morgenthau on 35; Nietzsche and 36;
political career 28; posivitism 11, 22–4;
Habermas, Jürgen 5, 18, 41, 54; on post-positivism and 4; and problem of
rationality 162; theory of social action order 30; on reality 23; on reason 32;
109–10, 121; theory of ‘system’ and scepticism about causation 23; on
‘Lebenswelt’ 112; theory of selfishness of man 25; on Thucydides
understanding 89–90, 104–5 167, 190
Halley, Edmond 22 Hurwitz, Roger 54, 55, 65
Harré, Rom 5, 14, 64, 88; on truth 147, 148 hypocrisy 156
Harvey, David 138
haute finance 84 ‘icon’ 129
Hegel, Georg 34, 35, 37 iconic augmentation 141
Hegelian theory: development of iconic modelling 14, 66, 78–9, 80, 100,
consciousness 54; master-slave 109, 123, 173, 189, 190; ampliative
argument 213 and reductive approaches 124–6;
Heidegger, Martin 133, 142 analogies, metaphors and metonyms
Hellas 168, 180–2, 186–7, 188, 190; 128–9, 130–1, 132; and causal
notion of honour 183 complexes 123, 126, 129–33; double
Hellenes 180, 192 hermeneutics of 123–42; empirical
hermeneutic circle 136 evidence 133; homoeomorphic 108,
hermeneutics 35, 68, 89, 99, 102, 123, 127; hypotheses of 131–3; idealisations
150; computational 57; second level and abstractions 127–8, 130; of Melos
problem 136; understanding 72; see episode 179–80, 189, 190, 192;
also double hermeneutics paramorphic 126–7qualitative
Herz, John 84 evidence 135–7quantitative data
Hettne, Björn 229 133–5; space and time in 137–41;
hierarchical organisations 83, 111, 118 teleiomorphic 108–9, 110, 123, 127–9;
Hirschman, Albert 29, 30 types of 126–9
historical analysis 91 idealisations 127–8, 130
historical research 46, 56 idealism 2, 33, 34–5, 36, 39, 40–1
history: and causality 10; and foreign illusion 21; speculative 158
policy 55, 58; lessons of 159; openness IMF see International Monetary Fund
of 200, 208, 238; problem of ending imperialism 237; see also colonialism
140; theorising 12, 61–3, 66, 70; individualism 99
writing 58–9 induction, problem of 23, 31
Hobbes, Thomas 2, 21, 22, 25, 29, 105; Industrial Revolution 83–4
on book-keeping 24–5; and problem Inoguchi, Takashi 61, 62, 63
of order 30 inside/outside dichotomy 94–5, 229
262 Index
institutionalisation 153, 201, 224, 225 36, 37, 143, 209; moral reason 4, 11;
integration 203, 204 theory of peace 3, 21, 39;
interaction 76, 79, 116–17 transcendental arguments 9; world
interdependence 36, 39, 203 order 143, 205–8
internal relations 79; and communicative Kaplan, Morton A. 39
action 100–6; and iconic modelling Kautsky, Karl 63
131, 136; relationship with action 117; Keat, Russell 5
vague and ambiguous nature of 106–7 Kelsen (positivist) 35
‘inter-national’: first use of term 2 Kennan, George 43
International Monetary Fund 134, 205, 233 Keohane, Robert 167–8, 170
International Political Economy (IPE) Keynes, John Maynard 220, 230, 232–3,
80–1; see also global political economy 234, 236
international relations: four ‘great Kissinger, Henry 43
debates’ of 36–40 knowledge: of actors 126; quantitative,
international relations, discipline of (IR) qualitative and relational 117
10–13; foundations 11; in 1960s 43; in Kratochwil, Friedrich 75
1980s 70; related to emergence of Kuhn, Thomas 150
states 82–3; Wendt vs. Hollis and
Smith debate 71–5; see also levels labour, division of 84
inter-state wars 193–4 labour markets 83, 220, 223, 224, 227, 228
invariance 3, 75, 76, 134 Lacedaemon see Sparta
investment 233 Lakoff, George 128
Ireland 199, 209 language 90, 105, 116, 150
lateral pressure theory 50, 192
Jabri, Vivienne 71 Latsis, Spiro 169
James II, King 27 law, rule of 28
Japan 206 Lawson, Tony 7, 129, 135
Jesus story 56–7 League of Nations 39
Johnson, Mark 128 learning process: and change 43–4; and
journalistic studies 43, 134 Gorbachev 3pathological 152–3,
judgemental rationalism 9, 77, 144, 148, 199–200
155, 157 Lebenswelt 112
Lefebvre, Henri 138
Kant, Immanuel 2–3, 9, 10, 11, 21; belief Lehnert, G. 56–7
in Kingdom of Angels 34; as classical Leibniz, Gottfried 69
political theorist 36–7; concept of Lenin, V.I. 42, 63, 113
‘nation’ 17; concern over Europe’s Lesbos 184
destructive powers 84; importance of levels: against scientific
40; on international problematic 32–4; (re)conceptualisations of 82–7; as
‘mechanisms’ for peace 33; on complex assemblies and integrated
perpetual peace prospects 38; and wholes 82, 85; convenience of 86–7;
phenomena/noumena dichotomy as degrees 82; as layers 82–3; as ranks
33–4, 35, 39–40; and positivism 3; 83; realist conceptions of 81–2; social
suspicious of world government contexts 85–6; and study of social
concept 207; on Thucydides 167, 190; systems 118
on war and peace 17 levels-of-analysis problematic 72, 73–81
Kantian theory: as anachronism 237; liberalism 3, 70; economic 231;
critical questions 6, 9, 10; dualism 89; embedded 211, 217, 220, 228, 229;
and ‘great debates’ of international Kantian 2–3, 4, 34, 36, 37, 143, 209;
relations 36–40; Hegel’s view of 34; see also neoliberalism
idealism 2, 33, 36, 39; influence on Lin Biao 63
20th century thinking 39, 43; and IR Linge, David E. 142
problematic 15; liberalism 2–3, 4, 34, linguistic fallacy 4
Index 263
space: in iconic models 137–9; and time belief in brute power politics
94, 118; see also geo-historical processes 169–70; on causes of Peloponnesian
Sparta 180, 181, 182, 183, 184, 186–8 war 186–8; on development of Athens
spatiality, horizontal 79, 80 192; different interpretations of 191;
Spinoza, Baruch de 30 on Hellas 180–2; and history as
state: and assumptions of space 138; tragedy 172–3; on lasting value of his
behaviour 73–5; and hierarchies 83; work 189; moralism in 188; on
modern (development of) 21, 26–8, Pericles 184; on Persian Wars 182;
84, 237; Nordic preoccupation with personal background 186, 192;
228; as open system 118; as person 87, ‘polimetrics’ of 173; tragedy in 172–3,
129; private property analogy 26; and 188–90; use of dialogue 168, 169–70,
problem of order 31; and quantitative 189; see also Melos episode
data 134; relationship with institutions time: in iconic models 137–8, 139–41;
79, 82–3; relationship with peace and and space 94, 118; see also geohistorical
violence 15; relationship with war processes
193–4; as unitary actor 71–2 TINA-formations 227
states system theory 86–7 TMSA see transformational model of
statistics 117, 133–5; Alker on 45–6, 49, social activity
68 Tobin, James 223–5, 236
stoicism 35–6, 37, 227 totalitarianism 68
stories 57–8; ‘two stories’ dichotomy Toulmin, S. 29
87–90; see also narratives traditionalism: vs. positivism 36, 39; texts
structuralism 91, 99 134
structuration, theory of 99, 100, 112, 113, tragedy: Aristotle on 188–9; history as
114 58; Marx on 238; in Thucydides
structure: definitions of 114–15, 116–18; 172–3, 188–90
nature of 100; social 101–3, 114–18 transformational model of social activity
Sweden 224, 235, 236; and Cold War (TMSA) 99, 102, 108
211, 212; economy 214, 215, 223, transformative action 120
233; and EU membership 222, 225; trans-nationalisation 229–30
inequalities 223; 1980s 222; see also trials 137; by jury 103, 107–8, 110–11
Swedish model truth: as metaphor 149–50; nature of 14,
Swedish Employers’ Association 217, 220 143–50, 153–4, 155–6, 157–8, 161–2;
Swedish model 16, 215–22; decline of ontological and alethic aspects 147,
215; see also Norden 148; politics and 143; Popper’s theory
Swedish Union Federation 217, 220 of 161
symbols, technical 126
systems: comprehensive 149; definitions undecidability 154–5
of 118; open 76, 117–18, 130, 135, understanding: of being 133; explanation
146, 155–6, 157, 202, 218; as 72–3, 132–3; Habermas’s theory of
positionpractice 104, 111–12; social 89–90, 104–5; as productive process
83–5, 118theoretical 150 87
unemployment 223, 224
Tarski, Alfred 161, 162 Unger, Roberto 159, 201
tax havens 84, 233 unhappy consciousness 34, 35, 36, 37, 227
taxation: EU 230; global 17, 232, 233–5, United Nations: Alker’s studies of
236 General Assembly 44–6, 47–8, 54, 68;
Thatcher, Margaret 111, 220 collection and analysis of data 134;
thematising 133, 142 new proposals for 232, 235; Nordic
Third Way 16, 210, 211, 213, 214, 215, countries in 232; Security Council 206
217, 220, 222, 236 Urry, John 5
Thucydides (The Peloponnesian Wars) 15, US Space Command Vision 2020 139
34, 41; aitia and prophasis concepts 179; USA 210, 214; action against Soviet
Index 267
Union (1918) 143; covert activities 93; 207; people’s 63, 143; role of states
dollar no longer convertible to gold 193–4; to end all wars 63, 143;
218; hegemony of 205, 206–7, 232; twentieth century 193; see also violence
interdependence school 39; networks Washington consensus 222, 228
and corporations 227; political realism Weber, Max 35, 37, 109, 128
11; positivism 39; see also Bretton welfare state 212, 214, 215, 220, 222, 225,
Woods system; Cold War 227; spending 220, 223
Utopias 2; concrete 158–60, 195, 202, Wendt, Alexander 13, 86–7, 90–1, 93, 94,
203, 208 99; debate with Hollis and Smith 71–5
Utrecht, Peace of 2, 28 Westphalia, Peace of 2, 27, 40
Wight, Martin 36
vagueness 103–4, 106–7 will, hardening of 199–200
value judgements 153–4 will-to-power 36, 186
Vico, Giovanni Battista 30 William of Orange 27
Vienna, Concert of 42, 84 Wilson, Thomas Woodrow 62–3, 143
violence 155–8; critical theory and Winch, Peter 99, 104, 154
avoidance of 14; ‘cultural’ 161as Wittgenstein, Ludwig 99, 104, 106, 107,
outcome of pathological learning 149
process 152; politics and 196and Wittgensteinians 39
security community 193–6, 200–1, Woodrow Wilson Chair (Aberystwyth)
202, 203, 205; state and 15, 194; see also war 39
voluntarism 24, 36, 51, 99 World Bank 134, 205
world government 207
Wæver, Ole 92–3, 196, 208, 225; and ‘world polities’ 80–1
Norden 16, 211–14, 226 World System Analysis 60, 127
Waisman, Friedrich 104, 106, 107 World Trade Organization 205, 236
Walker, R.B.J. 40, 70, 79
Wallerstein, Immanuel 46, 56 xympasa gnome 170
Waltz, Kenneth 39, 74, 82, 83, 94
Waltzian neo-realism 82, 127; reductio ad Young, Oran
absurdum of 74
war: and Kantian world order 21, 206, Zimmern, Alfred 39