[PDF] A macro agent and its actions

Abstract

In science, macro level descriptions of the causal interactions within complex, dynamical systems are typically deemed convenient, but ultimately reducible to a complete causal account of the underlying micro constituents. Yet, such a reductionist perspective is hard to square with several issues related to autonomy and agency: (1) agents require (causal) borders that separate them from the environment, (2) at least in a biological context, agents are associated with macroscopic systems, and (3) agents are supposed to act upon their environment. Integrated information theory (IIT) (Oizumi et al., 2014) offers a quantitative account of causation based on a set of causal principles, including notions such as causal specificity, composition, and irreducibility, that challenges the reductionist perspective in multiple ways. First, the IIT formalism provides a complete account of a system's causal structure, including irreducible higher-order mechanisms constituted of multiple system elements. Second, a system's amount of integrated information ( Φ ) measures the causal constraints a system exerts onto itself and can peak at a macro level of description (Hoel et al., 2016; Marshall et al., 2018). Finally, the causal principles of IIT can also be employed to identify and quantify the actual causes of events ("what caused what"), such as an agent's actions (Albantakis et al., 2019). Here, we demonstrate this framework by example of a simulated agent, equipped with a small neural network, that forms a maximum of Φ at a macro scale.

Full PDF

1 A macro agent and its actions

Larissa Albantakis *, Francesco Massari , Maggie Beheler-Amass , and Giulio Tononi Department of Psychiatry, Wisconsin Institute for Sleep and Consciousness, University of Wisconsin-Madison, Madison, WI 53715, USA Swarthmore College, Swarthmore, PA 19081, USA * Correspondence: [email protected] & These authors contributed equally to this work.

Abstract

In science, macro level descriptions of the causal interactions within complex, dynamical systems are typically deemed convenient, but ultimately reducible to a complete causal account of the underlying micro constituents. Yet, such a reductionist perspective is hard to square with several issues related to autonomy and agency: (1) agents require (causal) borders that separate them from the environment, (2) at least in a biological context, agents are associated with macroscopic systems, and (3) agents are supposed to act upon their environment. Integrated information theory (IIT) (Oizumi et al., 2014) offers a quantitative account of causation based on a set of causal principles, including notions such as causal specificity, composition, and irreducibility, that challenges the reductionist perspective in multiple ways. First, the IIT formalism provides a complete account of a system’s causal structure, including irreducible higher-order mechanisms constituted of multiple system elements. Second, a system’s amount of integrated information ( Φ ) measures the causal constraints a system exerts onto itself and can peak at a macro level of description (Hoel et al., 2016; Marshall et al., 2018). Finally, the causal principles of IIT can also be employed to identify and quantify the actual causes of events (“what caused what”), such as an agent’s actions (Albantakis et al., 2019). Here, we demonstrate this framework by example of a simulated agent, equipped with a small neural network, that forms a maximum of Φ at a macro scale. Introduction

What is an agent? To date, there is no single, agreed-upon definition of an agent that captures all relevant intuitions behind the concept. As a minimal property, an agent must be an open system that dynamically and informationally interacts with an environment. This simple requirement, however, immediately poses a methodological problem: When subsystems within a larger system are characterized by biological or informational properties, their boundaries are typically taken for granted and assumed as given (Krakauer et al., 2014; Oizumi et al., 2014; Albantakis, 2018; Kolchinsky and Wolpert, 2018). Moreover, at least in a biological context, the notion of agency is typically associated with macroscopic spatio-temporal scales. For example, the causally relevant information that organisms pick up from their environment is generally not dependent on microscopic details; the cognitive abilities of an animal are related to the interactions of their neurons, rather than the underlying molecules, atoms or quarks; and the goal-directed actions performed by humans and 2 other animals, and even life itself, have been characterized as instances of top-down causation (Ellis, 2009, 2016; Walker and Davies, 2013). This brings us to the last point: that agents are supposed to act upon their environment. Yet, as many have argued, the fact that physical events are either determined by previous (micro-physical) events, or emerge from (quantum) randomness, seems to be at odds with the notion of an autonomous agent with intrinsic causal power. Originally developed as a theory of consciousness (Tononi, 2015; Tononi et al., 2016), IIT offers a quantitative framework to characterize the causal structure of discrete dynamical systems (Oizumi et al., 2014). The main quantity, Φ , measures to what extent the causal constraints that a system exerts onto itself are irreducible to those of its parts. A system with Φ > 0 , forms a unitary whole, as all of its subsets constrain and are being constrained by other subsets within the system above a background of external influences (Maturana and Varela, 1980; Tononi, 2013; Marshall et al., 2017; Aguilera and Di Paolo, 2018; Albantakis, 2018; Farnsworth, 2018). In this way, IIT provides the tools to identify whether a set of elements forms an entity with causal borders that separate it from its environment – a maximum of Φ (Oizumi et al., 2014; Marshall et al., 2017). The IIT formalism can also be applied across micro and macro spatiotemporal scales in order to identify those organizational levels at which the system exhibits strong causal constraints onto itself. As shown in previous work (Hoel et al., 2016; Marshall et al., 2018), it is indeed possible for a system to have higher Φ at a macro level than at the micro level. According to IIT principles, the particular spatio-temporal scale that specifies a maximum of integrated information ( Φ $%& ) defines the spatio-temporal scale at which the system specifies itself in causal terms. Finally, the causal principles of IIT, including notions such as causal specificity, composition, and irreducibility, can be employed to identify and quantify the actual causes and effects of events (“what caused what”) within a transition between subsequent states of discrete dynamical system (Albantakis et al., 2019). Such a principled account of actual causation makes it possible to identify the causes of an agent’s actions and to trace them back in time (“causes of causes”) (Juel et al., 2019). Our goal here is to demonstrate how various aspects of the IIT formalism can be combined to provide an account of (macro) agents and their actions in silico , through the example of a simulated agent (“animat”) equipped with a small neural network that is able to perform a simple perceptual categorization task (Figure 1) (Beer, 2003; Albantakis et al., 2014; Hintze et al., 2017). We will first describe the animat as a macro system of interacting “neurons” (black boxes) (Marshall et al., 2018), before zooming in on its micro constituents (Figure 2). As we will show, the animat exhibits higher values of integrated information ( Φ ) at the macro level than at the micro level. Next, we will trace the causes of the animat’s actions back in time. While each action is necessarily preceded by a chain of micro events, these can only account for parts of the action. In our example, a cause for the action as a higher-order event constituted of multiple micro occurrences can only be found at the macro spatio-temporal scale. More broadly, our example analysis serves to demonstrate that IIT’s causal framework provides a consistent, quantitative account of causation that challenges the wide-spread reductionist perspective—that only individual micro constituents ultimately have causal power. 3 Figure 1. Artificial agent (“animat”) capable of performing a perceptual categorization task. (A) The animat is placed in a 16 by 36 unit environment. The animat itself is 3 units wide and can move to the right and left 1 unit at a time. Its two sensors are positioned on each side with 1 unit between them and switch ‘on’ (1) whenever a block is directly above them irrespective of its distance. Per trial one block is falling to the right or left at one unit per time step. (B) The task is to catch blocks of size 3 and 6 and to avoid blocks of size 4 and 5 (Task 4 in (Albantakis et al., 2014) ). (C) The animat is equipped with a binary, deterministic Markov Brain (Hintze et al., 2017) constituted—at the macro level—of two sensors, 3 hidden elements, and two motors. The update function is fully described by the animat’s deterministic state transition probability matrix. The sensor states are determined by the environment. The motor elements do not have feedback connections to the rest of the system.

The simulated animat – macro and micro

The animat we will analyze in the following is capable of performing an active perceptual categorization task (Beer, 2003; Marstaller et al., 2013) with high accuracy (97.7% correct). In the simulated environment, blocks of different sizes are falling to the right or left, one at a time, and the animat has to catch or avoid them depending on their size (Figure 1). To that end, the animat is equipped with two sensors that turn ‘on’ (1) if a block is positioned directly above them at any distance, and two motors that enable the animat to move to the right or left ( 𝑀 ( 𝑀 ) = (0,1) : move right, 𝑀 ( 𝑀 ) = (1,0) : move left, 𝑀 ( 𝑀 ) = {(0,0), (1,1) }: stand still). The animat’s behavior is determined by a “Markov Brain” (Hintze et al., 2017), a small neural network which, in our specific case, is constituted of binary elements with deterministic input-output functions. In (Albantakis 2014), we have used a genetic algorithm to evolve a population of this type of Markov Brains for high fitness in the block-catching task. Both the connectivity structure and update function of the Markov Brains were encoded in a genome (string of integers) and adapted through fitness selection and mutation. Macro level network:

At the macro level (Figure 1C), the Markov Brain of our example animat is equivalent to that of the best performing animat in (Albantakis 2014) (Task 4), with three hidden nodes (

𝐴, 𝐵, 𝐶 ) that are connected in an all-to-all manner, while connections from the sensors and to the motors are feedforward only. The update function of the animat’s Markov Brain can be represented by its state transition probability matrix (TPM), which specifies the output state of the hidden nodes and motors given the prior state of the sensors and the hidden nodes (the state of the sensors is fully determined by the environment). The macro elements, nodes

𝐴, 𝐵, 𝐶 and motors 𝑀 ( , 𝑀 ) , update at the same rate as the environment. 4 By construction, each macro element of our example animat corresponds to a “black box” (Marshall et al., 2018), constituted of a set of micro elements that interact and update at a finer spatial and temporal scale. In Figure 2 we zoom in on the micro constituents of the macro-level animat displayed in Figure 1C. Within the IIT framework, the macro level strictly supervenes upon its micro constituents. The macro level corresponds to a mapping that groups disjoint subsets of micro elements into non-overlapping macro elements (Hoel et al., 2013, 2016; Marshall et al., 2018). Likewise, the state of a macro element is always determined by a surjective mapping of the micro states of its underlying micro constituents. Micro level network:

The animat’s micro level is constituted of 72 simple logic gates (COPY, AND, OR, XOR, NOT, and NOR gates, as well as one Majority (MAJ) gate that turns ‘on’ if more than half of its inputs are ‘on’). As shown in Figure 2, the sensors consist of the same two elements at the macro and micro level. However, the macro elements

𝐴, 𝐵, 𝐶, 𝑀 ( , 𝑀 ) each correspond to a “black box” (Marshall et al., 2018) constituted of several (10 to 17) micro elements. The micro elements within these black boxes are connected in a largely feed-forward manner (except for one loop in each motor black box, which “clocks” its motor response (see below)) and collectively emulate the logic function of their respective macro node over 4 micro time steps (updates). Each black box has one output node, but may receive inputs from the other black boxes or the sensors via multiple input nodes. The output nodes of the motor black boxes determine the animat’s movements but do not feedback into the network. In this way, the connectivity between the black boxes mirrors the connectivity of the macro-level animat shown in Figure 1C. State-mapping:

As each black-box has four layers (including inputs and the output node), four micro updates correspond to one macro update. The environment updates at the same rate as the macro elements. This means that, when viewed at the micro level, the sensors receive and output the same environmental input for four micro time steps.

The mapping from micro-level states to macro-level states is accomplished as proposed by (Marshall et al., 2018). The macro state of a black box corresponds to the state of its output node at the time of the macro update (here, every four micro updates) (Figure 2, bottom). The states of all non-output elements are ignored at the macro level, as they are hidden within the black boxes. As a consequence, each macro state is realizable by multiple micro states. Figure 2 shows one possible micro state corresponding to the macro state 𝑆 ( 𝑆 ) 𝐴𝐵𝐶𝑀 ( 𝑀 ) = (0,0,1,0,1,1,0) . Similarly, the states between the macro updates are ignored for determining the macro update function. Due to the specific implementation of the motor black boxes, their outputs remain in state (0,0) between macro updates, if the Markov Brain is initialized correctly at the beginning of each trial (e.g., in state “all off”). This means that the animat may only perform actions on those micro time steps that correspond to the macro update. This guarantees that our example animat behaves in exactly the same way as an animat that, at the micro level, is implemented as in Figure 1C without further sub-constituents. In the following we will apply IIT’s causal analysis to our example animat, at both the macro and the micro level. To that end, we will first evaluate the causal constraints the system exerts onto itself—its cause-effect structure —at both levels. Second, we will assess to what extent the constraints specified by the cause-effect structure are irreducible under a partition of the system, as measured by Φ . 5 6 Figure 2. Looking inside the macro level black-boxing at the animat’s micro level constituents.

Each macro element

𝐴, 𝐵, 𝐶, 𝑀 ( , 𝑀 ) is constituted of several (10 to 17) micro elements (simple logic gates: COPY, AND, OR, XOR, NOT, NOR, and MAJ as indicated). Within each black box, micro elements are connected in a largely feed-forward manner (except for one loop in each motor black box). Every four micro updates correspond to one macro update. The macro state, here 𝑆 ( 𝑆 ) 𝐴𝐵𝐶𝑀 ( 𝑀 ) = (0,0,1,0,1,1,0) , corresponds to the state of the black-box output nodes at the time of each macro update and is thus multiply realizable. Shown here is one possible realization corresponding to the last state in the time series shown below. The states of the other micro elements are ignored at the macro level (as they are hidden within the black boxes). Likewise, the state of the other micro timesteps are not taken into account in the mapping (Marshall et al., 2018) . Given the particular implementation of the motor black boxes, the animat may only move on those micro time steps that correspond to the macro updates. The compositional cause-effect structure of a system in a state

The IIT formalism evaluates five causal principles: intrinsicality, composition, information, integration, and exclusion (Oizumi et al., 2014; Tononi, 2015). We will briefly outline these principles underlying IIT’s causal analysis by example of the macro animat shown in Figure 1C and its corresponding transition probability matrix (TPM). For details and formal definitions of the relevant quantities we refer to the original publications (Oizumi et al., 2014; Tononi, 2015). All IIT quantities can be computed from a given TPM using PyPhi, IIT’s python toolbox (Mayner et al., 2018). Here, we used the standard configuration corresponding to “IIT 3.0” (Oizumi et al., 2014). In general, the IIT analysis starts from a discrete dynamical system 𝑆 , constituted of 𝑛 interacting elements 𝑆 with 𝑖 = 1, … , 𝑛 . Each element must have at least two internal states, which can be observed and manipulated, and is equipped with a Markovian input-output function 𝑓 that determines the element’s output state 𝑠 depending only on the previous system state 𝑠 :;( : 𝑠 = 𝑓 (𝑆 :;( = 𝑠 :;( ) . This means that all elements are conditionally independent given the past state 𝑠 :;( of the system. 𝑆 is fully described by its state transition probabilities: 𝑝̂(𝑆 : = 𝑠 : |𝑆 :;( = 𝑠 :;( ) = ∏ 𝑝̂A𝑆 = 𝑠 | 𝑆 :;( = 𝑠 :;( B, ∀ 𝑠 : , 𝑠 :;( . E5F( ( 1 )

Note that Eqn. 1 includes system states that may not be observed during the dynamical evolution of the system, but require system interventions (Pearl, 2000; Ay and Polani, 2008). The notation 𝑝̂ emphasizes that all probabilities herein correspond to interventions, not mere observations: 𝑝̂(𝑆 : = 𝑠 : |𝑆 :;( = 𝑠 :;( ) = 𝑝A𝑆 : = 𝑠 : G𝑑𝑜(𝑆 :;( = 𝑠 :;( )B (Pearl, 2000). Intrinsicality:

Our goal is to evaluate the causal constraints specified by the set of elements 𝑆 onto itself, above the background of external influences. If 𝑆 is a subset of elements within a larger system, all elements outside of 𝑆 are held fixed in their current state throughout the causal analysis and thus act as background conditions (causal conditioning). From its intrinsic perspective, the system is always in one particular state at any given moment. Accordingly, IIT’s causal analysis is state-dependent—it characterizes the system in its current state—and we take all previous system states 𝑠 :;( to be a priori equally probable (maximum entropy). State-averaged system properties, such as its stationary distribution, or an observed time-series, are extrinsic, available to an external observer but not the system itself. 7 For illustration, we choose the macro candidate set 𝐴𝐵𝐶 in state

𝐴𝐵𝐶 = (1,0,1) (Figure 3A). In that case, the sensors and motors 𝑆 ( 𝑆 ) 𝑀 ( 𝑀 ) = (0,0,1,0) act as fixed background conditions. The transition probabilities of 𝐴𝐵𝐶 can be obtained from the full TPM (Figure 1C) by conditioning on 𝑆 ( 𝑆 ) 𝑀 ( 𝑀 ) = (0,0,1,0) and are shown in Figure 3B. While the macro level TPM supervenes upon the micro level TPM, we ignore the underlying micro updates here and only take the macro TPM into account when we assess the animat’s macro cause-effect structure. From the intrinsic perspective of the system at the macro level, the micro elements and their updates are hidden inside the black boxes. Only the constraints between the macro elements should be taken into account. Composition:

In contrast with reductionist accounts that only consider how individual system elements update and interact, and holistic approaches that describe the dynamical evolution of the system as a whole based on its global state transitions, IIT takes a compositional perspective on causation (Albantakis and Tononi, 2019). Not only single elements (here

𝐴, 𝐵 , and 𝐶 ), but also combinations of elements may specify their own constraints about other system subsets as long as they are irreducible (see below). Within our candidate set 𝐴𝐵𝐶 = (1,0,1) we thus evaluate the integrated information 𝜑(𝑥 : ) of all subsets 𝑋 = 𝑥 : of 𝐴𝐵𝐶 = (1,0,1) (Figure 3C). A subset 𝑋 with 𝜑(𝑥 : ) > 0 is termed a mechanism within the system in its current state 𝑆 = 𝑠 : . Mechanisms constituted of single elements are termed “first-order mechanisms”, while those constituted of multiple elements are termed “higher-order mechanisms” and are occasionally labeled by their specific order, e.g., “second-order mechanism” for 𝐴𝐶 = (1,1) . Information:

The IIT formalism employs a counterfactual, interventionist notion of causation (Lewis, 1973; Pearl, 2000) to evaluate the causal constraints that a set of elements in its current state specifies about its causes and effects within the system. However, rather than testing for a counterfactual relation based on a single alternative, IIT considers all possible system states in its causal analysis, which can thus be expressed in probabilistic, informational terms (Albantakis et al., 2019). For clarity we add a time subscript to indicate a system subset at a specific point in time. The constraints that a system subset 𝑋 : ⊆ 𝑆 in its current state 𝑥 : ⊆ 𝑠 : specifies about the prior or next state of another subset 𝑍 :±( ⊆ 𝑆 are captured by its cause or effect repertoire. Specifically, the effect repertoire of 𝑥 : over the subset 𝑍 :P( is defined as: 𝜋(𝑍 :P( |𝑥 : ) = ∏ 𝑝̂A𝑍 G𝑥 : B . ( 2 ) The symbol 𝜋 indicates that the repertoire is a product distribution over the individual elements 𝑍 ∈ 𝑍 :P( rather than simply the conditional distribution over 𝑍 :P( . In this way, all 𝑍 are conditioned on 𝑥 : but receive independent “random” inputs from variables in 𝑆 : \𝑋 : which are marginalized (causal marginalization). The cause repertoire of 𝑥 : over the subset 𝑍 :;( is defined as: 𝜋(𝑍 :;( |𝑥 : ) = (T ∏ 𝑝̂A𝑍 :;( G𝑥 B with 𝐾 = ∑ ∏ 𝑝̂A𝑍 :;( = 𝑧G𝑥 B . ( 3 ) Here the product is over the elements in 𝑥 : , which discounts biases from common inputs from 𝑆 :;( \𝑍 :;( that are marginalized. A subset 𝑥 : specifies information about 𝑍 :;( and 𝑍 :P( to the extent that conditioning on 𝑥 : constrains the state of 𝑍 :;( and 𝑍 :P( compared to its unconstrained probability 𝜋(𝑍 :±( ) (see (Oizumi et al., 2014; Tononi, 2015; Albantakis and Tononi, 2019) for details). By constraining a subset 𝑍 :;( , 𝑥 : specifies information about its possible cause 8 within the system. Likewise, by constraining a subset 𝑍 :P( , 𝑥 : specifies information about its possible effect within the system . Integration:

All subsets 𝑥 : ⊆ 𝑠 : may specify their own information about other subsets 𝑍 :±( (see composition). However, a subset only contributes to the intrinsic information of the system if this information is irreducible ( 𝜑(𝑥 : ) > 0 ). This is tested by partitioning the cause/effect repertoire 𝜋(𝑍 :±( G𝑥 : ) into two parts 𝜋A𝑍 (,:±( G𝑥 (,: B × 𝜋A𝑍 ),:±( G𝑥 ),: B and measuring the difference between the intact and partitioned distributions (see (Oizumi et al., 2014) for details). Of all such partitions 𝜓 , the one that makes the least difference to the cause/effect repertoire (termed “MIP” for minimum information partition) determines the integrated information 𝜑(𝑥 : , 𝑍 :±( ) specified by 𝑥 : over the subset 𝑍 :±( . Moreover, to be a mechanisms within the system the subset 𝑥 : must specify information about its causes and effects, requiring that min :±( _𝜑(𝑥 : , 𝑍 :±( )‘ > 0 . Within system 𝐴𝐵𝐶 in state (1,0,1) , the information that subset 𝐴𝐵 : = (1,0) specifies about its causes is reducible, as its cause repertoire 𝜋A𝐴𝐶 :;( | 𝐴𝐵 : = (1,0)B can be partitioned into 𝜋(𝐶 :;( | 𝐴 : = 1) × 𝜋(𝐴 :;( | 𝐵 : = 0) . Likewise, the information that 𝐴𝐵𝐶 : = (1,0,1) specifies about its effects is reducible, as its effect repertoire 𝜋A𝐴𝐵𝐶 :P( | 𝐴𝐵𝐶 : = (1,0,1)B can be partitioned into 𝜋(𝐶 :;( | 𝐴 : = 1) × 𝜋A𝐴𝐶 :;( | 𝐵𝐶 : = (0,1)B . Exclusion:

Finally, 𝑥 : may specify integrated information 𝜑(𝑥 : , 𝑍 :±( ) about various subsets 𝑍 :±( within a system. The causal role it plays within the system is determined by the subsets 𝑍 :;(∗ and 𝑍 :P(∗ over which 𝑥 : specifies the maximal amount of integrated information. 𝑍 :;(∗ and 𝑍 :P(∗ are respectively termed the cause and effect purview of 𝑥 : . In summary, the amount of integrated information 𝜑(𝑥 : ) specified by the subset 𝑥 : can be expressed as: 𝜑(𝑥 : ) = 𝑚𝑖𝑛 :±( c𝑚𝑎𝑥 Y c𝑚𝑖𝑛 e c𝐷 c gAY h±i |j h BeAg(Y h±i |j h )B kkkk. ( 4 ) The set of all irreducible mechanisms within the system, their cause and effect purviews, and their integrated information 𝜑(𝑥 : ) compose the intrinsic cause-effect structure of a system in a state 𝒞(𝑠 : ) . Comparing the macro and micro cause-effect structures

As shown in Figure 3C, the macro cause-effect structure of

𝐴𝐵𝐶 : = (1,0,1) is composed of five mechanisms with 𝜑(𝑥 : ) > 0 , all first order mechanisms and two higher order mechanisms. The information specified by these mechanisms corresponds to the compositional intrinsic information that the system 𝐴𝐵𝐶 in state (1,0,1) specifies about itself from the intrinsic perspective. For example, Element 𝐴 : = 1 specifies that 𝐵 :P( = 1 . Likewise, 𝐵 : = 0 specifies that 𝐶 :P( = 1 , but only with 𝑝 = 0.75 . Together, 𝐴𝐵 : = 10 specify 𝐵𝐶 :P( = 11 with certainty ( 𝑝 =1.0 ). 𝐴𝐵 : = 10 thus specifies irreducible information about the next state of 𝐵𝐶 :P( that cannot be accounted for by 𝐴 : = 1 and 𝐵 : = 0 taken independently. Nevertheless, some of the Within the cause and effect repertoire, we can identify the specific state that is maximally constrained by 𝑥 : , which then corresponds to the specific cause or effect of 𝑥 : within the system from the intrinsic perspective of the system (Haun and Tononi, 2019), (Barbosa et al, forthcoming). In addition, it is possible to evaluate relations between the cause and effect purviews of the various mechanisms, which specify the causes and effects specified by multiple mechanisms (see (Haun and Tononi, 2019)). 𝐴𝐶 : = 11 specifies the next state of the system 𝐴𝐵𝐶 :P( with certainty. As outside investigators, we can thus infer the state of every subset of

𝐴𝐵𝐶 :P( . Note, however, that such an inference requires a mechanism to be performed. The system itself only has information about subsets of

𝐴𝐵𝐶 :P( if other mechanisms exist, such as 𝐴 : = 1 or 𝐵𝐶 : =01 , that specify that particular information (Albantakis and Tononi, 2019). Figure 3. Compositional cause and effect information of subsystem ABC. (A) The goal is to evaluate the causal information that ABC in its current state specifies about itself, treating the other elements as fixed background conditions. (B) The TPM for subsystem ABC can be obtained from the system’s TPM (Figure 1C) by conditioning the full TPM on the current state of 𝑆 ( 𝑆 ) 𝑀 ( 𝑀 ) = (0,0,1,0) . Since the elements are binary, we can write the TPM in state-by-node format where each column specifies the probability of A,B, or C to be ‘on’ (1) given the respective input row. (C) Every subset of 𝐴𝐵𝐶 : =(1,0,1) may form a separate mechanism in ABC and thus specify information about its possible causes and effects within the system in a compositional manner. Here, the information that subset 𝐴𝐵 : = (1,0) specifies about its causes is reducible to a partition of 𝐴𝐵 : into 𝐴 : × 𝐵 : . Likewise, the information that 𝐴𝐵𝐶 : = (1,0,1) specifies about its effects is reducible to a partition of 𝐴𝐵𝐶 : into 𝐴 : × 𝐵𝐶 : . The cause-effect structure (CES) of ABC in state (1,0,1) is thus constituted of five irreducible mechanisms and the information they specify.

10 The corresponding set of micro elements consists of the 43 elements included in the black boxes

𝐴, 𝐵 , and 𝐶 . All other micro elements are taken to be background conditions. The micro system state is the one shown in Figure 2. The micro cause-effect structure is computed based on the micro TPM of the system. All of the 43 micro elements specify first order mechanisms in their current state. In principle, the 𝜑 values of all subsets of the 43 micro elements would have to be evaluated for higher order mechanisms. However, a set of elements 𝑥 : can only form a higher order mechanism if each of the elements shares inputs and outputs with other elements in the set. Otherwise, 𝜑(𝑥 : ) is necessarily 0 as either the cause or effect repertoire can be partitioned without loss (Oizumi et al., 2014; Mayner et al., 2018). As the connectivity at the micro level is rather sparse, modular, and feedforward, only a few layers of nodes may give rise to higher order mechanisms. We identified 12 higher order mechanisms, one in the input layer of black box 𝐶 , the other 11 are specified by four elements in black box 𝐵 (second layer, 2-5). On average, the 𝜑 value of the micro mechanisms is lower than that of the macro mechanisms: 〈𝜑〉 r5stu = 0.16 < 〈𝜑〉 rxstu = 0.27 , which means that the macro mechanisms constrain their respective inputs and outputs more than the micro mechanisms. Micro and macro system-level integrated information 𝚽 The cause-effect structure

𝒞(𝑠 : ) contains all the intrinsic information the system specifies about itself at the respective level of description. However, the notion of intrinsic information requires that there is a system in the first place, meaning one “whole” as opposed to multiple separate sets (Oizumi et al., 2014; Albantakis, 2018; Albantakis and Tononi, 2019). The next step in IIT’s causal analysis is thus to evaluate whether and to what extent 𝒞(𝑠 : ) is integrated, i.e., irreducible under a partition Ψ of the system. This is quantified by Φ(𝑠 : ) , the integrated information of the system as a whole 𝑆 in a particular state 𝑠 : : 𝛷(𝑠 : ) = 𝑚𝑖𝑛 | c𝐷 _𝒞(𝑠 : ); 𝒞A𝛹(𝑠 : )B‘k. ( 5 ) Again, we search for the system partition Ψ that makes the least difference to the cause-effect structure 𝒞(𝑠 : ) , the MIP (minimum information partition). As defined in (Oizumi et al., 2014; Tononi, 2015), system partitions are unidirectional, rendering the connections from one part of the system 𝑋 ⊂ 𝑆 to the rest ineffective. If the system does not form a unified whole and can be partitioned into two or more parts without loss,

Φ = 0 . Also systems in which two or more parts of the system are connected in a feedforward manner cannot be integrated (

Φ = 0 ). In a system with

Φ > 0 , all parts of the system constrain and are being constrained by the rest of the system above a background of external influences. Φ can thus be viewed as a measure of how much a system exists for itself, in causal terms. For Φ to be high, every possible partition must affect the integrated information 𝜑 specified by many mechanisms within the system. At the micro level, we identified the MIP as indicated in Figure 4A between the fourth input element of black box 𝐵 and its one output, the first AND gate in the second layer . As only two first order mechanisms are affected, this cut leads to a Note that the IIT python package Pyphi cannot, at the moment, compute the Φ value of a 43 element system exhaustively. To identify the micro MIP and assess Φ(𝑠 :r ) , we took advantage of the modularity of the micro system

11 comparatively low value of

Φ(𝑠 :r ) = 0.032 (the ‘m’ superscript indicates the micro level, below ‘M’ stands for macro level). While the cause-effect structure at the macro level is based entirely on the macro TPM, the system level integrated information Φ(𝑠 :(cid:129) ) is still evaluated by partitioning between micro elements (Marshall et al., 2018). This means that the same set of partitions Ψ is tested and can be compared at the macro and micro level. In this way it becomes impossible to trivially increase the system’s integration at certain macro levels by “hiding” weak connections inside the macro elements. The macro TPM of the partitioned system is obtained by black-boxing the partitioned micro system using the same element and state mapping as for the unpartitioned system. Compared to the micro level, the same partition has more substantial effects on the macro cause-effect structure, affecting the effect information specified by 𝐴 : = 1 and also the cause information specified by 𝐴𝐵 : = 10 and 𝐵𝐶 : = 01 . For this reason, the integrated information specified by the system at the macro level amounts to the higher value of Φ(𝑠 :(cid:129) ) = 0.213 (Figure 4B).

Figure 4. Comparing the integrated information of the micro and macro level.

The same minimal partition (indicated by the scissors cutting the connection in bold) affects the cause-effect information at the micro level less than at the macro level. While in (A) only the two micro elements directly connected by the partitioned arrow are affected, in (B) the partition also has secondary effects on the constraints of macro node A and C on macro node B (as indicated by the dashed, bold arrows). This explains the higher 𝛷 value at the macro level. and identified a separate MIP for each black box using Pyphi. The system MIP then corresponds to the minimum across black boxes. Partitions between black boxes all have larger effects on the micro cause-effect structure, as the output nodes of each black box are connected to many micro elements within the system.

12 According to IIT, maxima of integrated information Φ define causal entities having causal borders with their environment (Oizumi et al., 2014; Marshall et al., 2017, 2018). To identify whether a particular set of elements specifies a maximum of Φ , in principle, requires evaluating many other candidate systems. In our example, all systems larger than the set of elements that constitute A, B, and C (within the dashed rectangle in Figure 4) necessarily have Φ = 0 because the sensors and all elements in 𝑀 ( and 𝑀 ) are only connected to ABC in a unidirectional manner). As explained above, systems in which one part is connected to the rest in a feedforward manner cannot form an integrated system according to IIT. Consequently, only sets of elements that are strongly connected (for which a directed path exists from each element to every other element) can have a value of Φ > 0 . Using this short-cut, we can establish that the system ABC, as well as the set of its constituting 43 micro elements analyzed in Figure 4A and B form a maximum of integrated information Φ in their current state. This means that removing or adding any element from the set would lead to a lower Φ value . In this way, ABC and its set of constituents define a causal border that separate the internal constraints within the animat from its environment, both at the micro and macro level (Marshall et al., 2017, 2018) . As Φ measures the irreducible intrinsic constraints of a set of elements onto itself over a background of external influences, the macro system ABC can be said to be more autonomous than the set of its micro constituents. Tracing back the causal chain leading up to the animat’s actions

So far, we have focused on the causal information that the system’s elements specify about each other, alone and in combination, at a micro and macro level of description (Figure 3), and its integration as measured by Φ . As we have demonstrated, the macro level description, while supervening on the micro constituents, specifies more integrated information. In particular, the output nodes of the black boxes A, B, and C, play a crucial role in integrating the network over longer time scales. The causal principles of IIT (such as composition, information, integration, and exclusion), can also be employed to identify and quantify the actual causes and effects of an occurrence (“what caused what”), such as an agent’s actions (Albantakis et al., 2019; Juel et al., 2019). An “occurrence” here simply denotes a set of elements in a particular state: 𝑥 : . As described above, from the intrinsic perspective, the causal role that an occurrence 𝑥 : plays within the system is determined by the causal information 𝑥 : specifies about its cause and effect purviews, 𝑍 :;(∗ and 𝑍 :P(∗ , which are the system subsets over which the amount of integrated information 𝜑(𝑥 : ) is maximized (Equation 4). The actual state of the cause or effect purviews Our focus here lies on autonomy and the notion of self-defined causal borders that separate the internal mechanisms of the agent from its environment. According to IIT, a physical substrate of consciousness must specify a global maximum of Φ across all overlapping sets of elements and spatio-temporal scales. In other words, any particular micro element can only contribute its causal power to one physical substrate of consciousness by IIT’s exclusion postulate. Given the size of the animat’s micro implementation, an exhaustive analysis across all possible sets of elements and spatio-temporal mappings was not feasible. Thus, it is possible that smaller subsets within the 43 micro elements may specify even higher values of Φ within the system. Likewise, other spatio-temporal mappings may reveal additional “meso” levels of description with higher values of Φ than 𝐴𝐵𝐶 : . This also means that the animat’s sensors and motors technically form part of the environment, while the causally autonomous entity is defined as the integrated core of the animat. 𝑍 :;(∗ and 𝑍 :P(∗ , however, is unknown from the intrinsic perspective of the system in its current state. To identify the actual cause 𝑧 :;(∗ of an occurrence 𝑥 : , instead, we take the perspective of an extrinsic observer of the system with access to the system’s time series {𝑠 :;(cid:131) , … , 𝑠 :;( , 𝑠 : } (Figure 5A). In parallel to 𝜑(𝑥 : ) , the actual cause 𝑧 :;(∗ is then identified as the sub-state 𝑧 :;( ⊆ 𝑠 :;( over which 𝑥 : specifies the most irreducible causal information: 𝛼(𝑥 : ) = 𝑚𝑎𝑥 X c𝑚𝑖𝑛 e _𝑙𝑜𝑔 ) _ g(X h(cid:136)i |j h )e(g(X h(cid:136)i |j h )) ‘‘k. ( 6 ) 𝜋(𝑧 :;( |𝑥 : ) here denotes the probability of the specific state 𝑧 :;( in the cause repertoire 𝜋(𝑍 :;( |𝑥 : ) . The goal is to identify what caused x (cid:138) ⊆ 𝑠 : given a particular state transition 𝑠 :;( ≻𝑠 : . We refer to the original publication (Albantakis et al., 2019) for further details on the measure 𝛼(𝑥 : ) and the set of permissible partitions {𝜓} . In deterministic systems, the identified actual cause 𝑧 :;(∗ typically corresponds to an occurrence at 𝑡 − 1 that is minimally sufficient for 𝑥 : to occur, at least in the case where 𝑥 : is a first-order occurrence (a single element in its particular state) . The actions of our example agent are defined by the state of both of its motor units, the output nodes of 𝑀 ( and 𝑀 ) . As indicated by the micro time series displayed in Figure 5A, the agent’s actions are necessarily preceded by a chain of micro events. In the particular state evaluated above 𝑀 ( 𝑀 ) = 10 , which means that the animat is moving to the left. In the following we will use “ 𝐸 (cid:143)) ” and “ 𝐸 (cid:144)) ” to denote the micro output elements of the black boxes 𝑀 ( and 𝑀 ) . Both, 𝐸 (cid:143)) and 𝐸 (cid:144)) are AND logic-gates and receive direct inputs from two micro elements each, here labeled 𝐸 (cid:145)(cid:146) , 𝐸 (cid:143)( and 𝐸 (cid:143)(cid:144) , 𝐸 (cid:143)(cid:146) (from left to right). At time t − 1 , these micro elements were in state 𝐸 (cid:145)(cid:146) 𝐸 (cid:143)( 𝐸 (cid:143)(cid:144) 𝐸 (cid:143)(cid:146) = 1101 . Applied to the transition {(𝐸 (cid:145)(cid:146) 𝐸 (cid:143)( 𝐸 (cid:143)(cid:144) 𝐸 (cid:143)(cid:146) ) :;( = 1101} ≻{(𝐸 (cid:143)) 𝐸 (cid:144)) ) : = 10} , the actual causation analysis here provides the intuitive result that the actual cause of the occurrence 𝐸 (cid:143)),(cid:138) = 1 was (𝐸 (cid:145)(cid:146) 𝐸 (cid:143)( ) :;( = 11 with 𝛼 = 2.0 bits (both inputs had to be ‘on’ in order to switch the AND-gate 𝑀 ( ‘on’) and the actual cause of 𝐸 (cid:144)),: = 0 was 𝐸 (cid:143)(cid:144),:;( =0 with 𝛼 = 0.415 bits (which prevented 𝐸 (cid:144)),(cid:138) to be ‘on’). In principle, we also evaluate if any higher order occurrences specify their own irreducible causes (applying the composition principle). However, in this particular case the occurrence (𝐸 (cid:143)) 𝐸 (cid:144)) ) : = 10 is reducible, as the elements do not share common inputs at the micro level. (𝐸 (cid:145)(cid:146) 𝐸 (cid:143)( ) :;( = 11 and 𝐸 (cid:143)(cid:144),: = 0 are the direct (or proximal) micro causes of the individual outputs 𝐸 (cid:143)),(cid:138) = 1 and 𝐸 (cid:144)),: = 0 . Yet, the animat’s action (“move left”) here corresponds to the higher-order occurrence (𝐸 (cid:143)) 𝐸 (cid:144)) ) : = 10 , and the proximal micro causes do not provide a causal explanation for why 𝐸 (cid:143)),(cid:138) = 1 and 𝐸 (cid:144)),: =0 occurred together. With respect to an agent’s action, the direct micro-level cause is rarely considered the cause with the greatest explanatory power (Woodward, 1989). For example, while a motor neuron in the spinal cord may directly initiate a movement, we are typically more interested in identifying In non-deterministic systems Eqn. (6) would generally identify the minimal occurrence at 𝑡 − 1 that raises the probability of 𝑥 : the most. Introducing a “specification factor” π(z (cid:138);( |x (cid:138) ) in front of log ) _ (cid:153)((cid:154) (cid:155)(cid:136)i |& (cid:155) )(cid:156)((cid:153)((cid:154) (cid:155)(cid:136)i |& (cid:155) )) ‘ in Eqn. 6 effectively implements a tradeoff between an increase in probability of 𝑥 : and the cost of setting additional elements into a particular state, which allows identifying the part of 𝑧 :;(∗ that was particularly relevant for the occurrence of 𝑥 : .

14 the cortical events or external stimuli that triggered the action. To that end, we can employ the actual causation analysis to trace the causal chain of micro occurrences back in time, identifying the “causes of the causes” of the animat’s action (Juel et al., 2019). Specifically, we now start with (𝐸 (cid:145)(cid:146) 𝐸 (cid:143)( 𝐸 (cid:143)(cid:144) ) :;( = 110 , the union of the actual causes of (𝐸 (cid:143)) 𝐸 (cid:144)) ) : = 10 and identify the actual causes of all occurrences x (cid:138);( ⊆ (𝐸 (cid:145)(cid:146) 𝐸 (cid:143)( 𝐸 (cid:143)(cid:144) ) :;( = 110 at time 𝑡 − 2 , and so on. As a measure of the causal relevance of a particular micro element, we sum its relative contribution to the 𝛼 values of all actual causes it participates in within a given time step (see (Juel et al., 2019) for details). Figure 5B shows the results of tracing the causes of {(𝐸 (cid:143)) 𝐸 (cid:144)) ) : = 10} back to the beginning of the trial ( 𝑡 = 0 ). The histogram on the right shows the summed causal strength across elements. After an initial transient through the micro elements that make up the motor black boxes and the micro elements constituting the internal black boxes A, B, and C ( 𝑡 = 31 to 𝑡 = 24 , moving upwards from the bottom), a first peak of the overall causal strength can be observed at 𝑡 = 23 , when the backtracking reaches the output elements of the black boxes A, B, and C for the second time. This shows that these micro elements play a special causal role, not only with respect to the constraints the system poses onto itself, but also regarding the causes of its actions. Going back further in time, we find additional peaks in correspondence with the spatiotemporal scale that matches the black-box macro level. The reason for these peaks is that the black box outputs act as causal bottlenecks within the system, with many incoming and outgoing connections. Each output element thus contributes to the causes of many occurrences at the next micro time step (setting the states of the many black box input elements). In sum, tracing back the causal chain of events at the micro level of description provides an independent way of identifying nodes within the system that act as “causal bottlenecks”. In this way, the actual causation analysis can inform the search for relevant spatiotemporal scales and black boxings that may form maxima of integrated information. Finally, we can apply the actual causation analysis directly to the macro-level transition {(𝑆 ( 𝑆 ) 𝐴𝐵𝐶) (cid:157);( = 00001} ≻ {(𝑀 ( 𝑀 ) ) (cid:157) = 10} . In doing so, we find the macro occurrence (𝑆 ( 𝐴𝐶) (cid:157);( = 001 (or equivalently (𝑆 ( 𝐵𝐶) (cid:157);( = 001 ) to be the cause of 𝑀 (,(cid:157) = 1 with 𝛼 = 1.30 bits, and 𝐴 (cid:157);( = 0 to be the cause of 𝑀 ),(cid:157) = 0 with 𝛼 = 0.25 bits . In addition, the higher-order occurrence (𝑀 ( 𝑀 ) ) (cid:157) = 10 specifies its own irreducible cause (𝑆 ( 𝐴𝐶) (cid:157);( = 001 at the macro level with 𝛼 = 0.35 bits. This can be interpreted as causal information that the joint occurrence of (𝑀 ( 𝑀 ) ) (cid:157) = 10 specifies about the particular state of 𝑆 ( 𝐴𝐶 that actually happened at 𝑇 − 1 which is not specified by its parts taken independently. In other words, there is a causal explanation for the action “move left”, corresponding to (𝑀 ( 𝑀 ) ) (cid:157) = 10 at the macro level, beyond the independent occurrences of 𝑀 (,(cid:157) = 1 and 𝑀 ),(cid:157) = 0 . As we have seen above, such an explanation does not exist at the micro level. The 𝛼 values at the macro level are somewhat smaller than those of the proximal causes at the micro level, as we average over all possible initial states of the motor black-boxes when we evaluate the strength of the causal links in the macro transition, which introduces a certain level of indeterminacy (see (Marshall et al., 2018)). Figure 5. Tracing back the causes of action 𝑴 𝑴 = 𝟏𝟎 . (A) Micro level time series specifying the state of all 72 micro constituents of our example animat over 32 time steps. (B) Starting from the current micro state of the agent (here 𝑡 = 31 ), the actual causes of the occurrences 𝐸 (cid:143)) = 1 , 𝐸 (cid:144)) =0 , and 𝐸 (cid:143)) 𝐸 (cid:144)) = 10 are identified in the preceding micro time-step 𝑡 = 30 (proximal causes). The micro elements 𝐸 (cid:143)) and 𝐸 (cid:144)) here correspond to the output elements of the motor black boxes 𝑀 ( and 𝑀 ) . Iteratively, the backtracking analysis then identifies the causes of the set of elements involved in the proximal causes of the previous time step (causes of causes). At each time step we determine the strength 𝛼 of the causal link between an occurrence and its actual cause (Eqn. 6) and assign each micro element its summed contribution to 𝛼 (right panel). The histogram on the left shows the summed contribution across elements. After an initial transient, maxima of causal strength can be observed at every macro time step for the micro elements that correspond to the black box outputs which highlights the special causal role that these output elements play within the system. Discussion