I. The Ship in the Bottle
Jesse Schell, in The Art of Game Design, makes a claim so obvious it's easy to miss: the game is not the experience. The game is an artifact—cardboard, code, a bag of bits. The experience is what happens in the player's mind when they interact with that artifact, and it is invisible, unshareable, and can never be directly observed. The designer's job is to build something that will generate a particular experience in someone else's consciousness, without ever being able to see or touch the thing they're actually making.
Schell calls this "building a ship in a bottle." You're working at a permanent remove from the thing you care about. You create an artifact, a player interacts with it, and you cross your fingers that the experience arising from that interaction is something they'll find meaningful. You never truly see the output of your work, since it takes place inside someone else.
I know this problem in my body. Years before I encountered Schell, I was a banjo player in pit orchestras, and every night I experienced the gap he describes from the other side. I was producing the sonic architecture of a theater production—the rhythmic drive, the harmonic color, the emotional underscore—and I could never see what that architecture produced in the audience above me. I could hear them laugh, gasp, sometimes cry. But the experience I was helping create was invisible to me. I was building the ship. The bottle was on the other side of the stage floor.
This is the foundational problem of experience design—and it is the problem that virtual reality promises, and fails, to solve.
The promise is seductive. If the designer's real material is subjective experience, then surely a technology that surrounds you in a designed world, that tracks your head and hands and responds to your movement, that fills your visual field and replaces your acoustic environment—surely this technology closes the gap between artifact and experience. Surely VR lets you stop building the ship in the bottle and just build the ship.
But it doesn't. And the reasons it doesn't are the subject of this essay. Understanding why VR fails to close the experience gap—and what it does instead, which is something far more interesting—requires taking seriously what embodied cognition actually claims about how minds work.
II. What Embodied Cognition Actually Claims
Embodied cognition is not a single theory but a family of converging claims, and their convergence matters more than any individual formulation. Francisco Varela's enactive approach argues that cognition is not the manipulation of internal representations but the ongoing coupling of an organism with its environment through sensorimotor activity. Andy Clark's extended mind thesis argues that cognitive processes don't stop at the skull but extend through tools, technologies, and environmental structures that participate in thinking. Alva Noë's enactivism argues that perception is not something that happens to you but something you do—a skilled activity of exploration.
And beneath all of these, providing the philosophical ground they grow from, is Maurice Merleau-Ponty. In Phenomenology of Perception, Merleau-Ponty argues that the body is not an object in the world but the subject of experience. The body-subject doesn't have a perspective on the world; it is a perspective. Consciousness is not a ghostly presence observing the body from the inside; consciousness is the body's ongoing, active engagement with its surroundings—reaching, grasping, orienting, anticipating.
The body schema—Merleau-Ponty's term for the pre-reflective sense of what the body can do—is not a map of the body stored somewhere in the brain. It is the lived sense of one's own capacity for action, constantly updated by movement and feedback. When I reach for a glass, I don't calculate the trajectory; my hand arrives because my body schema already includes the glass as within-reach, part of the field of possible action that constitutes my perceptual world.
This has a specific implication for VR that the field has been slow to absorb: presence is not a perceptual state. It is not something produced by visual fidelity, spatial audio, or low latency, though all of these contribute. Presence is a motor state. It is what happens when the body schema extends into the virtual environment—when reaching, looking, stepping, and grasping in virtual space feel continuous with the body's sense of what it can do. Presence is the "I can" that Merleau-Ponty describes: the felt sense that this world affords action, that my intentions have purchase here.
III. The Paradox
Here is where it gets strange.
VR introduces what I call the presence paradox: the technology promises presence by entraining the same sensorimotor loops that bind action to perception in the physical world, while simultaneously creating a condition of dual occupancy, in which the user inhabits a physical body in one space and a virtual body in another. You are standing in your living room and standing on a cliff. Your feet are on carpet and on stone. Your hand grips a plastic controller and a virtual sword.
This dual occupancy is not a problem to be solved. It is the fundamental condition of the medium.
The paradox sharpens as fidelity increases. When the simulation is crude—a wireframe world, a blocky avatar—the body doesn't invest. It treats the virtual environment the way it treats a photograph: interesting to look at, but not a place where action has consequences. But as fidelity increases—as visual rendering approaches photorealism, as haptic feedback becomes more precise, as spatial audio becomes more convincing—the body begins to invest. The sensorimotor loops start to close. The body schema begins to extend.
And then the seams show.
A micro-stutter in hand tracking. A 12-millisecond latency spike. An inconsistency between the weight you see and the weight you feel. These mismatches, which would be invisible in a low-fidelity environment, become more salient as the overall fidelity increases, because the body is now testing the environment with every glance, reach, and step. The more convincing the simulation, the more the body invests, and the more it notices when the simulation fails to reciprocate.
This is the uncanny valley applied not to faces but to entire worlds. And unlike the facial uncanny valley, which is primarily visual, the presence uncanny valley is felt through the whole body—through proprioception, vestibular sense, motor expectation. It produces not just visual unease but a felt wrongness, a dissociation between intention and consequence that registers as something closer to nausea than to aesthetic displeasure.
IV. What the Neuroscience Actually Shows
Frank Rose, in The Sea We Swim In, traces a line of research that illuminates what's happening in the brain during immersive experience—and the findings don't say what the VR industry wants them to say.
Uri Hasson's neural coupling research at Princeton uses functional MRI to observe what happens when people experience a successful narrative. What he finds is that a well-told story literally synchronizes the brains of its audience—activating the same regions in the same sequence across different listeners. In an experiment comparing responses to a Hitchcock thriller, a Sergio Leone western, and a Larry David comedy, Hasson found that the Hitchcock episode—the most tightly controlled narrative—evoked similar responses in more than 65 percent of the neocortex. A master storyteller, Hasson concludes, is able to orchestrate the responses of many different brain regions in many different viewers, "turning them on and off at the same time."
The implication is that presence—the felt sense of being in a narrative, in a world—is not produced by sensory bombardment. It is produced by coherent temporal structure. Hitchcock synchronizes brains not because his images are photorealistic but because his narrative is tightly composed. The what-happens-next is so precisely controlled that every viewer's predictive machinery locks into the same pattern.
This matters enormously for VR design. The industry has invested billions in visual fidelity—higher resolution, wider field of view, more realistic rendering. Hasson's research suggests that the real mechanism of immersion is not sensory resolution but temporal coherence: the degree to which the environment's responses are tightly coupled to the participant's actions and expectations. A lower-fidelity environment with perfect temporal coupling will feel more present than a photorealistic environment with intermittent latency.
Rose also presents Melanie Green and Timothy Brock's transportation research, which demonstrates something the VR field has largely ignored: narrative immersion changes beliefs. In Green's foundational study, participants who were deeply "transported" by a story about a child's murder at a shopping mall came away believing that psychiatric patients should be more closely supervised and that violence in public places was more common—regardless of whether they'd been told the story was true or fictional. The transportation effect was independent of factual status. Fiction and nonfiction produced the same belief changes, as long as the reader was immersed.
And then there's Geoff Kaufman's identity-assumption studies, which may be the most unsettling finding for anyone designing immersive experiences. Kaufman found that readers who encountered a character's identity-defining trait (being gay, being Black) late in a story were significantly more likely to identify with that character and to shift their attitudes than readers who encountered the same trait early. The "gay-late" readers came away with more favorable attitudes toward gay people in general. The "Black-late" readers were less likely to rate the protagonist as hostile and less likely to express subtly racist attitudes.
The mechanism is transportation: once you're immersed—once your predictive machinery has locked onto the character's perspective—you've already accepted the character before the identity marker arrives. The revelation comes too late to trigger the defensive response that would have prevented identification in the first place.
This has direct implications for the presence paradox. It suggests that the most transformative immersive experiences may be the ones in which the participant doesn't fully realize what's happening to them until it's already happened. Not because they've been deceived, but because immersion works by bypassing the critical faculties that would normally filter incoming experience. Transportation doesn't just feel like absorption. It is a temporary restructuring of the boundary between self and not-self.
And if that's true in a novel or a movie, where the body is sitting still in a chair—what happens when the whole body is engaged, when the sensorimotor loops are closed, when the environment is responding to your movement in real time?
V. The Three Disciplines
Schell argues that no single field of study can map the territory of human experience. He identifies three approaches—psychology, anthropology, and design—each of which captures something the others miss.
Psychology gives you mechanisms. The behaviorists can tell you what people do under controlled conditions; the phenomenologists can tell you what experience feels like from the inside. But psychology, in its commitment to scientific rigor, is "obligated to avoid the thing we care about the most—the nature of human experience." The measurable proxy replaces the thing itself.
Anthropology gives you empathy. Cultural anthropologists "live with their subjects of study and try to immerse themselves completely in the world of the people they are trying to learn about." They combine objective observation with introspection, putting themselves in the place of their subjects to imagine what experience feels like from the inside. But anthropology's insights are particular, situated, hard to generalize.
Design gives you craft knowledge—rules of thumb, hard-won intuitions about what works. But designers "seldom publish papers about their discoveries. The very best designers in various fields often know little about the workings of other fields of design."
Abraham Burickson, in Experience Design: A Participatory Manifesto, arrives at a strikingly similar conclusion from the direction of immersive theater rather than game design. His "rigorous empathetic research" requires three simultaneous modes: experiential research (putting yourself in the conditions you're designing for), relational engagement (being physically present with the people who will participate), and inclusion (involving community members in the development of the experience itself). No single mode is sufficient. Empathy, for Burickson, is not a feeling but a methodology—and it requires the body.
What both Schell and Burickson are describing, without using the term, is what Merleau-Ponty would call intersubjectivity: the capacity to understand another's experience not by reasoning about it but by inhabiting a shared perceptual field. The body doesn't infer another's experience; it resonates with it, through gesture, tempo, rhythm, facial expression, the thousand micro-signals that pass between co-present bodies.
Peter Meineck, in Theatrocracy, shows that the ancient Greeks already understood this. The Theater of Dionysus was not designed for spectatorship alone. It was designed for intersubjective resonance—the bowl shape that let fifteen thousand citizens see each other watching, the masks that created a coupling between performer and audience, the chorus that gave collective voice to collective feeling. The architecture produced intersubjectivity through design. Meineck is describing what Hasson measures: neural coupling as the mechanism, designed space as the technology.
This is the epistemological foundation for everything that follows in this book. We cannot directly observe the experiences we design. We cannot measure presence with a questionnaire. We cannot close the gap between artifact and experience by increasing visual fidelity. What we can do is develop a practice—part phenomenological, part empirical, part embodied—that takes seriously the fact that presence is negotiated by the whole body, that transportation changes people whether they know it or not, and that the designer's ignorance of the participant's experience is not a limitation to be overcome but a condition to be respected.
VI. Presence as Negotiation
With this framework in place, we can describe presence more precisely.
Presence, phenomenologically, is a dynamic negotiation among three elements: intention, affordance, and prediction. Intention organizes the next possible action—I am reaching for the door handle. Affordance specifies what the environment allows—the handle is graspable, the door can be opened. Prediction closes the loop by anticipating the sensory consequences of movement—when I pull, I expect to feel resistance, hear the latch, see the door swing.
Visuomotor contingencies—how visual change maps to bodily motion—either reinforce agency or produce dissociation. When head turns produce immediate, coherent parallax and audio rotates accordingly, the user experiences ownership and control. When the mapping slips—subtle drift in hand pose, variable controller inertia, off-tempo audio—the user's forward models fail, and attention shifts from the world to the interface.
Experientially, the locus of attention migrates from task to tool: grasping gives way to troubleshooting, exploration to self-monitoring. The "I can" that Merleau-Ponty describes becomes an "I must manage," and the system is felt as external rather than incorporated. This is what Heidegger calls the shift from ready-to-hand to present-at-hand—and it is the phenomenological signature of presence failure.
But here is where the standard account goes wrong: it treats presence failure as a technical problem with a technical solution. Fix the latency. Increase the resolution. Add haptic feedback. The assumption is that if the technology were perfect, presence would be continuous.
This assumption is false, and it's false for the reason the dual occupancy paradox identifies: the user always has a physical body in physical space, and no amount of technological improvement can erase that body. The physical body continues to breathe, to feel gravity, to sense temperature, to notice the weight of the headset. These signals don't disappear when the virtual world becomes more convincing—they become the ground against which the virtual world is measured.
What good VR design does is not eliminate this tension but compose with it. The most effective immersive experiences don't try to make you forget your body. They give your body something meaningful to do—something that creates continuity between physical sensation and virtual consequence. This is why room-scale VR works better than seated VR for presence. Not because it's more visually immersive, but because walking, reaching, crouching, and turning engage the body schema in ways that sitting and pointing cannot. The metaphor I will return to throughout this book is the thaumotrope—a Victorian disc with a bird on one side and a cage on the other that, when spun, produces the image of the bird inside the cage. Presence works the same way. It is not on either side of the disc. It is in the spinning.
VII. The Essential Experience
Schell introduces a concept he calls "essential experience"—the idea that you don't need to perfectly replicate a real experience to create a compelling designed one. What you need is to identify the elements of an experience that are load-bearing—the ones that define it and make it what it is—and find ways to deliver those elements through your medium.
His example: if you're designing a snowball fight game, you don't need real snow. You need the feeling of cold (sound effects, visual breath), the feeling of rivalry (competitive mechanics), the feeling of exhilaration (fast pacing, unexpected hits). You strip away everything that doesn't carry experiential weight—the corduroy pants, the mints in your pocket—and amplify everything that does.
This is phenomenological reduction applied to design. Husserl brackets the natural attitude to reveal the essential structures of consciousness. Schell brackets the incidental features of experience to reveal the essential structures of engagement. Both are asking: what is the invariant core of this experience? What can you remove without destroying it?
For VR presence, the essential experience is not visual realism. It is reciprocity—the felt sense that the environment responds to your body in ways that are lawful, consistent, and meaningful. The systems that feel most "real" are not the ones with the most polygons. They are the ones that treat the body's signals with the most respect, closing the loop between intention and consequence with the tightest, most reliable coupling.
The implications for design are concrete. Invest in temporal coherence before visual fidelity. Ensure that every tracked movement produces a consequence the body can predict. Design interaction grammars that are learnable—that let the body develop expectations it can rely on. And accept—as a feature, not a bug—that the physical body will always be present, always sensing, always testing.
VIII. What This Book Will Argue
The essays that follow build on this foundation. Each one takes a different aspect of embodied experience—affordance, narrative structure, constraint, memory, tool use, distributed consciousness, ritual, music, movement—and asks what happens when that aspect meets immersive technology. The answers are sometimes hopeful and sometimes alarming, and they are always grounded in the same claim:
The body is not a peripheral to the mind. It is the site where experience becomes structure—where sensation becomes memory, where repetition becomes skill, where designed encounters become lasting change. Every immersive experience is an act of bodily inscription. Every VR session writes something on the participant's body that was not there before: a new motor habit, a recalibrated sense of space, an emotional association, a shifted threshold of arousal or discomfort.
Jane Alison would add that the shape of that inscription matters—that the temporal-spatial pattern traced by the body as it moves through the experience determines what kind of change is possible. A spiral inscription returns the body to what it has already passed through, deepening with each pass. A radial inscription circles an absence, never arriving, always approaching. These are not metaphors. They are the morphological vocabulary of experiential design, and this book will develop them in detail.
This is the thaumotrope I described in the introduction, stated as a research program. The body on one side of the disc, the designed environment on the other, and presence—the felt sense of being there, the experience that Schell can never directly see—produced by the spinning between them. Not in the body alone. Not in the technology alone. In the coupling. In the spin.
Rose's transportation research shows that narrative immersion changes beliefs. Schell's essential experience framework shows that designed artifacts create experiences the designer can never fully see or control. Merleau-Ponty's phenomenology shows that the body is the subject of experience, not its container. Put these together and you arrive at the thesis of this book: immersive experience design is the practice of inscribing bodies with experiences that the designer cannot directly observe, using a technology that bypasses the participant's critical faculties, in a medium whose effects are cumulative, embodied, and irreversible.
That is both the extraordinary potential and the profound responsibility of this work. The body remembers.