EssaysPart 1: Foundations
2

Affordances, Enaction, and the Grammar of Interaction

Gibson and Varela made operational.

20 min read

I. The Door Handle

Before I ever read J.J. Gibson, I understood affordances through door handles.

A flat plate on a door affords pushing. A vertical bar affords pulling. A round knob affords turning. You do not need a sign that says "push" or "pull." Your body reads the hardware and knows what to do. The affordance is not a property of the door alone, and it is not a property of your body alone. It is the relationship between the twothe specific fit between the door's form and the body's capacities that specifies an action without requiring instruction.

Don Norman popularized this concept in design, but Gibson's original formulation is deeper and more radical. For Gibson, affordances are not subjective impressions. They are real features of the environment, specified in the structure of ambient light, that tell the organism what actions are available. A flat surface affords walking. A graspable object affords grasping. A cliff edge affords falling. These affordances exist whether or not anyone perceives themthey are properties of the organism-environment system, not of the organism's mind.

This ecological understanding is the foundation of everything in this book. Embodied cognition, as I argued in Essay 1, is not a theory about how the brain processes information. It is a claim about where cognition livesin the dynamic coupling between body and environment, not inside the skull. Affordances are the grammar of that coupling. They are how the body reads the world.

II. The Banjo as Affordance

The banjo taught me this before Gibson did. An instrument is a system of affordances: the neck affords gripping, the strings afford plucking or strumming, the head affords resonance. But these are only the generic affordancesthe ones any body discovers in the first moments of holding the instrument. The deeper affordances emerge over time, through practice, as the coupling between body and instrument tightens.

After a year of playing, my left hand discovered that the neck's curvature affords a specific kind of sliding movement between chord positionsnot the guitaristic slide along a single string, but a rolling motion that uses the thumb as a pivot, allowing the fingers to reorganize during transit. This affordance is not specified in the instrument's design drawings. It exists only in the relationship between a particular hand geometry, a particular neck profile, and the accumulated hours of practice that taught the hand to feel the curvature as an invitation rather than an obstacle.

After five years, the affordances had restructured my hand. The calluses, the muscle memory, the slightly widened span between index and ring fingerthese were the traces of coupling, the body's adaptation to the instrument's demands. The instrument had not changed. My body had changed to meet it. Gibson would say the affordances were always there, waiting to be discovered. Merleau-Ponty would say the body extended its schema to incorporate the instrument. Both are right, and the difference between them matters less than the phenomenon they both describe: that the body and the instrument became a system, and the system's capacities exceeded what either possessed alone.

This is the thaumotrope of ecological coupling: body on one side of the disc, environment on the other, and the affordancethe action-possibility that belongs to neither aloneproduced by the spinning between them. The banjo does not afford clawhammer picking to an untrained hand. It affords clawhammer picking to a hand that has spent years in coupling with it. The affordance is real, but it is historically constitutedproduced by the history of the body-environment relationship, not given in advance.

III. Enaction: Cognition as Participation

Francisco Varela, Evan Thompson, and Eleanor Rosch, in The Embodied Mind, radicalize Gibson's ecological approach into what they call enaction. The enactive claim: cognition is not the processing of representations inside the head. Cognition is the participatory activity of a body engaged with an environment. The world is not "out there," passively waiting to be represented. The world is enactedbrought into being through the organism's engagement with it.

This sounds abstract until you watch a baby learning to reach. The baby does not first build an internal model of the cup and then execute a reaching program. The baby reachesclumsily, iteratively, adjusting grip and trajectory with each attemptand through the reaching, the cup becomes a graspable thing. The affordance does not pre-exist the baby's engagement with the cup. It emerges through the engagement. This is what Varela means by enaction: the world and the organism co-specify each other through the organism's activity.

Jesse Schell arrives at the same insight from game design. His argument for a three-discipline methodologypsychology, anthropology, and design, all needed to understand experiencemaps onto the enactivist claim that no single observational position captures what happens in a lived interaction. The psychologist studies what is in the player's head. The anthropologist studies the player's cultural context. The designer studies what the system affords. None of these alone gives you the experience. The experience is the enactive product of all threethe coupling between the player's cognition, the player's culture, and the system's affordances, enacted in real time through the player's body.

IV. The ITP Education

I arrived at ITP from a music background and a classical education. I knew Merleau-Ponty. I knew Ovid. I knew what it felt like to inhabit a body that was trained for specific kinds of performancethe fencer's body, the musician's body, the classicist's body that spent years hunched over texts in languages whose speakers were dead.

What I did not know was how to build things that enacted cognition in other people's bodies.

ITP taught me this through practice, not theory. The program's pedagogy is enactivist whether or not it uses the word. You do not study interaction design by reading about it. You build prototypes, put people in front of them, watch what happens, and iterate. The knowledge you gain is not propositional"users prefer X to Y"but embodied: a felt sense for what works, what falls flat, what surprises, what delights. You develop an affordance vocabulary not through study but through couplingrepeated engagement with materials, tools, and the bodies that test your designs.

Andrew Schneider's Performing Reality course made this explicit. The methodology: take a neurological or psychological phenomenon and use it as the blueprint for an experience you put other people through. This is enactive designyou do not represent the phenomenon, you re-enact it, creating conditions under which the participant's body produces the phenomenon from the inside. The Elton John piece I describe in Essay 11 came from this course. So did the AITA exercise. Both are affordance designs: they structure the environment so that the participant's body discovers something through engagement that no amount of explanation could deliver.

Both courses taught me to think of affordances not as features I embed in a design but as relationships I cultivate between the design and the body that engages with it. The affordance is not in the controller or the gesture recognizer or the virtual object. It is in the couplingthe specific fit between what the system offers and what the body can do.

V. Burickson's Research and the Affordances of Empathy

Abraham Burickson's empathetic research framework, developed through his work with Odyssey Works, translates the enactivist position into a practice of design research.

Burickson distinguishes three modes of research: experiential (the designer immerses themselves in the context they are designing for, learning through their own body), relational (the designer builds relationships with the people they are designing for, learning through sustained engagement rather than one-time interviews), and inclusive (the design process itself becomes a collaboration between designer and participant, each one shaping the other's understanding).

All three are enactivist methods. Experiential research is the designer's body learning affordances through direct engagementthe same way the baby learns the cup's graspability by reaching for it. Relational research acknowledges that affordances are socialthat what the designed environment affords depends on who is in it, and that understanding those affordances requires the kind of sustained coupling that Varela describes. Inclusive research goes furthest: it treats the user not as a source of data but as an enactive partner in the design process, co-specifying the world that the design will bring into being.

This is the opposite of the survey, the focus group, the A/B test. Those methods treat the user as a source of data to be extracted. Burickson's methods treat the user as a body to be coupled witha participant in the enactive process through which the design's affordances are discovered and refined.

VI. The Pit Orchestra as Affordance Ecology

The pit orchestra is an affordance ecologya system in which every element affords specific actions to specific bodies, and the coupling between all of them produces something none could produce alone.

The conductor's baton affords tempo and phrasing to the musicians who have learned to read it. The score affords melodic and harmonic structure to hands that have trained on the instrument. The acoustic space of the pit affords a specific kind of listeningcompressed, intimate, the sound of the player next to you louder than the sound of the orchestra as a whole. Uri Hasson's neural coupling research describes what happens in this environment: brains synchronize. The musicians' neural activity begins to mirror the conductor's, and the audience's neural activity begins to mirror the musicians'. The coupling is not metaphorical. It is measurable, physiological, and it produces the specific experience of live musicthe feeling that the room is breathing together.

This is enaction at collective scale. No single musician is producing the experience. The experience is enacted by the systemthe coupling between nine bodies, nine instruments, a conductor, a score, an acoustic space, and an audience whose presence changes everything. The affordances are distributed across the entire ecology. The conductor affords timing. The bassist affords harmonic ground. The drummer affords rhythmic anchor. My banjo affords rhythmic drive and melodic color. And the audiencesilent, invisible below the stage lipaffords the most important thing of all: the reason to play. Without that coupling, we are nine people making noise in a hole in the ground. With it, we are a single mind with nine bodies.

Meineck's Theater of Dionysus was the same kind of affordance ecology, scaled to a polis. The amphitheater's bowl shape afforded collective sight. The masks afforded archetypal recognition. The chorus afforded communal voice. The open-air sightlines afforded spectators the ability to see each other watchingan affordance that the proscenium stage deliberately removes. Each architectural element afforded something specific to the bodies that occupied the space, and the coupling between all those affordances produced theatrocracy: democratic subjectivity enacted through designed spectatorship.

VII. The Grammar of XR Interaction

If affordances are the vocabulary of embodied cognitionthe individual action-possibilities that the environment presents to the bodythen the grammar of interaction is the way those affordances combine into sequences, patterns, and structures.

A door handle is a single affordance. A building is a grammara structured sequence of affordances (handles, corridors, stairs, rooms, thresholds) that guides the body through space and time. A musical instrument is a grammara structured set of affordances (strings, frets, keys, pedals) whose combinations produce a musical language. Jane Alison extends this into narrative: the shapes of storiesspiral, meander, wave, radial, fractalare grammars that structure the reader's temporal experience, each one affording a different relationship between the reading body and the narrative's unfolding.

XR interaction design is grammar design. The individual affordancesreach, grasp, point, throw, gaze, stepare the phonemes. The question is how they combine. What sequences of embodied actions produce meaningful interactions? What "sentences" can the participant "speak" with their body in the virtual environment? What "syntax" governs the combinationswhat follows what, what can occur simultaneously, what is prohibited?

This is where Gibson's ecological approach and Wittgenstein's language gameswhich I explore in Essay 6converge. A language game, for Wittgenstein, is a form of life: a complete practice in which language, action, and social context are fused into a single activity. Learning to play a language game is learning to participate in a form of lifeand that learning is enactive. You do not learn the grammar from outside. You learn it by coupling with the practice, the way you learn the banjo by coupling with the instrument, the way I learned the pit by coupling with the orchestra.

The designer's task is not to specify rules for the participant to follow. It is to create the conditions under which the participant's body can discover the grammar through enactive engagementreaching, grasping, moving, and through the movement, learning what the environment affords, what combinations are possible, what "sentences" can be spoken with the body in this particular designed world.

VIII. What Bodies Already Know

The deepest insight of the enactivist position, and the one most relevant to XR design, is that bodies already know how to do this. Every human body arrives at every new environment with a lifetime of accumulated sensorimotor knowledgethe grammar of walking, reaching, grasping, looking, listening, balancing that it has developed through years of enactive engagement with the physical world. This accumulated knowledge is not stored as propositions in the brain. It is distributed across the bodyin the musculature, the vestibular system, the haptic sensitivity of the fingertips, the peripheral vision's sensitivity to motion.

When a participant enters a virtual environment for the first time, their body immediately begins reading affordanceswhat can I touch, what can I stand on, what can I reach, what happens when I move? These readings are automatic, pre-conscious, and largely accurate, because the body has spent decades calibrating its expectations against the physical world's regularities. Gravity pulls down. Solid surfaces support weight. Graspable objects fit the hand. These expectations are the body's grammarthe deep syntax of embodied engagement that XR can leverage, violate, or extend.

The VR environment that violates these expectationsthat makes solid-looking surfaces passable, or graspable-looking objects intangible, or safe-looking edges fatalis working against the body's knowledge. This can be powerful as an artistic strategy, as Essay 18's discussion of nonsense and broken systems explores. But as a default design stance, it produces disorientation, nausea, and the breakdown of couplingthe body withdraws from engagement because its grammar no longer works.

The enactivist designer trusts the body. They create environments that the body can read with its existing grammar and then, gradually, extend that grammarintroducing new affordances, new combinations, new possibilities that build on what the body already knows rather than contradicting it. The body is not a blank slate to be programmed. It is a deeply trained system that arrives with its own expertise, and the designer's job is to couple with that expertise, the way a good conductor couples with the musicians rather than overriding them. The affordances of the designed environment meet the affordances of the trained body, and from that couplingthat spinning of the thaumotrope between what the world offers and what the body bringsthe experience emerges.