Psilocybin

Developing Methods for Observing Awe Narration in Psilocybin-Assisted Therapy

This observational methods study analysed 32 psilocybin-assisted therapy session recordings from eight people with advanced cancer to test a coding system for spotting moments of awe narration. The system appeared feasible and reasonably reliable, and coders were more confident when the narration included a sense of vastness.

Authors

  • Manish Agrawal

Published

Healthcare
meta Study

Abstract

Background

Understanding the benefits of psychedelic-assisted therapy (PAT) will require scientific attention to the causal interaction between the therapeutic context and process. Measuring what actually happens during PAT in large-scale studies will be an essential component of this work.

Objective

We aim to develop and preliminarily evaluate the feasibility and reliability of a direct observation coding system for narrations of awe experiences during PAT, one hypothesized therapeutic mechanism.

Methods

We analyzed 32 PAT clinical trial encounter recordings involving eight participants from a Phase 2 clinical trial study of psilocybin-assisted therapy in advanced cancer. Using a conceptually grounded structured codebook, two human coders independently identified start and stop times for moments exhibiting definitional characteristics of awe narration, including expressions of vastness, need for accommodation and ineffability. We used coder agreement and degree of confidence to refine the coding system.

Results

During 16,760 total minutes of video, coders collectively recorded 246 moments of awe narration. Of those moments, 42% (104/246) were identified by one coder and 58% (142/246) by two coders. Coders felt substantially more confident in their judgments about a moment of awe when vastness was present compared to when vastness was absent (OR: 4.3; 95% CI: 2.4, 7.8). Iterative refinement of the coding system led to accommodation being operationalized as two distinct components: an initial cognitive disruption followed by variable engagement in the process of accommodation.

Conclusions

Awe narration is directly observable using explicit definitional criteria. This work provides the empirical foundation for scalable coding systems of awe narration during PAT.

Unlocked with Blossom Pro

Research Summary of 'Developing Methods for Observing Awe Narration in Psilocybin-Assisted Therapy'

Editorial

βBlossom's Take

Awe might be one of the acute perceptive processes driving benefits in psychedelic treatments. This study takes one step toward developing (and/or validating, as one already exists) a measure of awe.

Introduction

Tarbi and colleagues frame psychedelic-assisted therapy as a field that still needs clearer methods for understanding what actually happens during treatment sessions and how the therapeutic context may shape outcomes. They argue that large-scale, direct observation will be important for identifying causal pathways and note that emerging digital recording infrastructure and machine-learning methods make this increasingly feasible. Within that broader challenge, they focus on awe as one plausible therapeutic mechanism, especially because awe experiences may be linked to improved quality of life and may overlap with mystical-type experiences that are thought to contribute to the effects of psychedelic-assisted therapy. The paper’s aim is to develop and preliminarily evaluate a direct observation coding system for narrations of awe during psilocybin-assisted therapy. The researchers set out to create explicit, human-readable criteria that could identify moments when participants verbally described awe, both to improve understanding of the phenomenon itself and to provide manually labelled examples that could later support computational detection. In this way, the study is presented as a methodological foundation for future mechanistic research in psychedelic-assisted therapy rather than as an efficacy trial.

Methods

This was a descriptive study using video and audio recordings from a recent Phase II, open-label psilocybin-assisted therapy trial in people with cancer and major depression. The sample included 8 participants, and the researchers purposefully selected participant-therapist dyads to maximise gender identity combinations among 7 therapists and 8 participants. All participants received psilocybin. Recordings covered preparation, dosing, and integration encounters across the three-week course of therapy. The researchers developed a conceptually grounded structured codebook based on a foundational definition of awe as involving perceived vastness and a need for cognitive accommodation. They initially included three candidate features of awe narration: vastness, need for accommodation, and ineffability. Vastness referred to expressions of something being larger than the self in physical, social, or conceptual terms. Need for accommodation referred to speech suggesting a disruption of ordinary understanding that would require mental adjustment. Ineffability referred to difficulty putting the experience into words, such as long pauses, restarts, or explicit statements of expressive difficulty. Two human coders independently reviewed all encounter videos and identified the start and stop times of moments that met the coding definition. After the first four participants, they paused to review coder feedback and refine the definitions. Coders rated their confidence in each identified moment as low, moderate, or high. Moments were counted as co-identified when the observed time periods overlapped, including boundary cases where one coder’s start matched the other’s end. The researchers also marked adjacent moments that felt like awe but did not meet the criteria as boundary moments, to improve transparency and help refine the codebook. They calculated the frequency and distribution of awe narration across preparation, dosing, and integration sessions, estimated inter-rater agreement with Cohen’s kappa, and used unadjusted odds ratios with 95% confidence intervals to examine whether coder confidence was associated with vastness. Mixed-effects logistic regression was used to assess whether clustering by participant experience affected the findings. The study had ethics approval from the relevant institutional review boards, and participants gave separate consent for analysis of the recorded trial sessions.

Results

The participant sample had a median age of 59 years, with a range from 30 to 78 years. Half were female. All participants identified as white, and one also identified as Asian American/Pacific Islander. Cancer diagnoses included breast, kidney, lymphoma, and colon cancer. One quarter of the sample had an expected survival of less than two years. Across 16,760 total minutes of video, the coders identified 246 moments of awe narration. Inter-rater reliability was substantial, with Cohen’s kappa of 0.73 (95% CI: 0.68, 0.78). Of the 246 moments, 42% (104/246) were identified by only one coder and 58% (142/246) were identified by both. Among the moments coded with high confidence, 30% (31/104) were identified by one coder and 70% (73/104) by both. Very few awe narration events occurred during preparation encounters in the first half of coding, so after the first four participants the researchers focused more heavily on dosing and integration encounters. Vastness emerged as the most salient and useful feature for identifying awe narration. Coders were substantially more confident in identifying a moment as awe when vastness was present than when it was absent (OR: 4.3; 95% CI: 2.4, 7.8). This association remained similar after adjustment for clustering by participant experience (adjusted OR: 4.1; 95% CI: 2.1, 8.0) and did not differ meaningfully across dosing versus preparation-integration encounters. Ineffability was common: more than half of all awe narrations (130/237; 55%) included it, and most of these occurred during the dosing session or the initial integration encounter the following morning (75%; 97/130). The text does not present a separate numeric summary for need for accommodation, but it reports that coders refined this feature into two parts: an initial cognitive disruption and a later, variable possibility of accommodation. The researchers also describe boundary moments that showed related qualities such as beauty or metaphor but were not coded as awe because they lacked vastness.

Discussion

The authors conclude that narration of awe during psychedelic-assisted therapy is directly observable when it is defined using explicit criteria. They argue that the findings support two practical recommendations for future coding systems. First, they suggest treating vastness as the key entry point for identifying candidate awe moments, with ineffability or accommodation used afterwards to confirm the judgement. They present this as both empirically supported by the coder data and consistent with earlier conceptual work in which vastness is central to awe. Second, they recommend operationalising accommodation as a two-step process: first identifying evidence of an initial cognitive disruption, and then examining whether the participant subsequently appears to accommodate that disruption. The authors suggest that this refinement may better capture the complexity of accommodation, including cases where participants do not try to make sense of the experience. They link this interpretation to broader psychedelic-assisted therapy frameworks, including ideas from neuroscience about temporary disruption followed by new neural pathways, and to the integration model in which post-dosing sessions help participants reframe and integrate insights. The authors also emphasise that the primary contribution of the study is methodological: a direct observation codebook that could support future human coding and, eventually, automated detection in large psychedelic-assisted therapy datasets. They note that machine-learning systems require accurate human-labelled training data and that human oversight remains important for quality control and ethical reasons. The main limitations they acknowledge are the small sample size, the lack of sufficient ethnic, racial, and linguistic diversity, and the fact that the study assessed narrated awe rather than the internal subjective state of awe itself. They state that awe narration often occurred at the same time as, or shortly after, the experience, making it a plausible observable proxy, but they call for further work in larger and more diverse samples to test validity and examine how narrated awe relates to internal experience and therapeutic outcomes.

View full paper sections

OVERVIEW

This is a descriptive study of 32 PAT clinical trial encounter recordings to identify directly observable moments in which participants narrate experiences of awe. Using a conceptually grounded structured codebook, two human coders independently identified start and stop times for moments exhibiting definitional characteristics of awe narration. We used co-identification and coder degree of confidence about each moment's overall expression of awe to refine the coding system.

PARTICIPANTS AND SETTING

This sample includes participants with cancer and major depression who completed a recent Phase 2, open-label study of psilocybin-assisted therapy in which all participants received psilocybin. Data include audio and video recordings of all preparation, dosing and integration encounters for each of the 8 participants over the three-week course of PAT. We purposefully selected participant-therapist dyads to maximize gender identity combinations among therapist (n = 7) and participant (n = 8) pairs.

CONCEPTUAL DEFINITION OF AWE

Keltner and Haidt (2003) foundationally define awe as a complex emotion that involves an experience of perceived vastness and a need for cognitive accommodation to adjust to overwhelmed mental structures.

CANDIDATE FEATURES OF AWE NARRATION

The overwhelm and adjustment inherent to awe's impact on cognitive processing can present communicative challenges for those attempting to explain it. Therefore, our initial direct observation coding system included the following three key features of what the speaker said and how they said it. (1) Vastness: We defined vastness as expressions relating to anything being much larger than the self, with physical, social, and conceptual sub-types. The "self" during PAT may be experienced at any size dimension, including infinitely small; thus, coder judgments about the presence of vastness reflect the expressed comparative relationship to self. (2) Need for Accommodation: We considered expressions of the "need for accommodation" to relate to discussions that suggested a violation to one's normal understanding of the world that would require a shift in mental structures to make sense of this new experience. (3) Ineffability: Expressions of "ineffability" could be recognized when participant descriptions suggested something was too extreme to be expressed in words. Manetta and Bhatiadefine ineffability in serious narratives as the temporary inability of the narrator to express themselves adequately due to the profound nature of the experience and their affective response. They identify ineffability as characterized by long pauses (+2 s), word-and sentence-level restarts, and dedicated discourse markers (i.e., ineffability just). This stands in sharp contrast to typical disfluencies, which are often attributable to normal utterance planning and cognitive burden and tend to go unnoticed/unmentioned by both conversational partners. The following participant quote contains all primary features of awe, which have been bolded and [bracketed] for emphasis: "It was one of the most profound and um meaningful experiences of my life. Shocking of its revelations, daunting at times, sometimes unbearably beautiful [need for accommodation], and mystical, divine, and it opened up a whole world of ideas and thoughts. Any metaphor I could use would be inadequate to describe it." (Participant T, Integration.)

ANALYSES

Coders independently double coded all participant-therapist encounter videos, pausing after the first four participants to evaluate coder feedback and refine operational feature definitions. Coders identified start and stop times for each instance of awe narration, defined inclusively as conversational moments containing at least one of the following: vastness, need for accommodation and ineffability. Our coding workflow is diagramed in Figure. For each identified moment, coders also responded to the following question, "How confident am I that this moment meets our coder definition? (low, moderate, high)". This gradient approach to coding, from low to high confidence, allowed us to incorporate the ambiguity of codingwith the complex and socially constructed concept of awe, as we have done previously in PAT settings. When coders encountered a moment that "felt like an expression of awe" but exhibited none of the candidate features, we marked those separately as "boundary moments". Consider the moment below, which exhibited many secondary features in our system (metaphor, beauty), but was not coded as an awe expression because it lacked vastness. "It's so many things in life. It's the taste of your favorite ice cream. It's the smell of your favorite flower. It's the sight of your favorite child, your wife, your loved ones." (Participant T, Integration.) Examining these events helped to ensure that our definition was sufficient for capturing awe narration as we had intended and to help with scientific transparency regarding the types of adjacent moments that would not be captured by our coding system. Throughout our coding process, real-time notes about coders' experience observing moments of awe were documented as an audit trail for review at regular research meetings. Coder debriefing involved informal prompts about the overall coding experience, such as aspects of the coding process that were interesting or challenging. Clarifications needed to improve operational definitions and suggestions for improving the recognition of narration of awe experiences were also discussed. The frequency and distribution of awe narration features were calculated for preparation, dosing, and integration encounters. We considered a moment of awe narration to be co-identified when any portion of the event duration observed by one coder overlapped with that of the other coder. We included situations of a common border (e.g., start time in minutes and seconds of video time identified by one coder is identical to the end time for the other coder) to count as overlap. We calculated Cohen's kappa for inter-rater agreement and used the average duration of identified awe moments as the unit of analysis. When presenting measures of association, we use unadjusted odds ratios (ORs) and 95% confidence intervals (CIs). We used mixed-effects logistic regression modeling to evaluate potential confounding by clustering of unmeasured phenomena for the 8 participant experiences. Institutional review boards at Advarra (parent clinical trial) and the University of Vermont (analyses of trial video data) approved all study protocols. Participants completed separate informed consent for analysis of clinical trial video recordings.

RESULTS

The median age for participants in this sample was 59 years (range: 30-78); half were female (50%), and all identified as white (100%), with one individual also identifying as Asian American/Pacific Islander (12.5%). Cancer diagnosis varied, including breast (37.5%), kidney (25%), lymphoma (25%), and colon cancer (12.5%). Twenty-five percent had an expected survival of less than two years. During 16,760 total minutes of video, coders collectively recorded 246 moments of awe narration with substantial inter-rater reliability (Cohen's kappa: 0.73; 95% CI: 0.68, 0.78). Of those moments, 42% (104/246) were identified by one coder and 58% (142/246) by two coders. Of those moments coders designated with "high confidence," 30% (31/104) were identified by one coder and 70% (73/104) by two coders. Very few awe narration events happened during preparation encounters during the first half of coding (5 of 168 total events), so we focused on dosing and integration encounters after coding the first four participants.

VASTNESS

Upon reflection, coders determined that vastness was highly salient and an obvious indicator for awe expressions. Coders felt substantially more confident in their judgments about a moment of awe when vastness was present compared to when vastness was absent (OR: 4.3; 95% CI: 2.4, 7.8). This association was not attenuated when adjusting for potential clustering by specific participant experience (OR adj : 4.1; 95% CI: 2.1, 8.0), nor did it differ substantially for dosing encounters and preparation-integration encounters.

INEFFABILITY

Awe narration frequently included intervals in which the participant seemed to struggle to find the right words to describe their experience. More than half of all awe narrations (130/237; 55%) featured ineffability, and most of these occurred during either the dosing session or the initial integration encounter the morning after the dosing experience (75%; 97/130).

NEED FOR ACCOMMODATION

Observed expressions of accommodation comprised two fundamental elements. The first involves description of an initial cognitive disruption from one's normal understanding of the world, and the second, the status of its potential accommodation. This second element raised challenges for coders, having encountered instances in the data in which participants seemed to resist or be uninterested in meaning-making or coming to an understanding of an experience outside of their mental structures. Coders were frequently uncertain about whether expressions indicated a "need" or even a "want" for accommodation. Therefore, we included situations regardless of participant intention or desire to attempt such accommodation. One participant discusses their experience in this way: "Well, I don't have to be concrete about it and say, define it. I don't have to. I could just experience it and feel it and remember it. Um, I don't, I don't have to put it in words." Final coding definitions and illustrative examples of vastness, need for accommodation and ineffability are described in Table. Our final coding process is diagramed in Figure.

VASTNESS

Expressions relating to anything being much larger than the self (i.e., physically, socially, conceptually). Lexical items referring to size or relative size of the physical world or iconically large things (e.g., NYC, Redwood Trees, Ocean), similes or metaphors tied to expansive concepts (e.g., time, space, infinity, the universe). "I was in that space where there was this, in front of me, like this cascade, and it was a cascade of stories, it was the infinite possibilities of all the stories that could ever exist." (Participant E, Integration.)

COGNITIVE DISRUPTION

Discussions related to an experience that is radically new or outside the current cognitive framings of the participant.

INEFFABILITY

When a participant is unable to articulate some aspect of their experience because of its profound nature and due to the strength of their affective response. Long pauses (+2 s), wordand sentence-level restarts, dedicated discourse markers (i.e., ineffability just), explicit acknowledgement of expressive difficulty. "I don't know how-it's hard to put into words. . ." (Participant I, Dosing.)

DISCUSSION

Our findings suggest that the narration of awe experiences in psychedelic-assisted therapy is directly observable and support two empirical recommendations for coding systems. First, our findings support recommending the expression of vastness as the crucial "point of entry" for coders to confidently identify candidate awe narration moments in need of subsequent review and confirmation. Though it is possible that vastness emerged as a key feature partially due to its relative salience and ease of identification by coders, thisapproach aligns with Keltner and Haidt's original proposition that vastness is foundational to the experience of awe, while preserving the particularity of the PAT encounter through the additional requirement of ineffability or accommodation. Second, we suggest operationalizing the notion of accommodation by first identifying narrative evidence of an initial substantial cognitive disruptionand, if present, then proceeding to consider the potential subsequent accommodation of that disruption. This may help to disentangle the complex and challenging-to-measure construct of "need for accommodation". This nuanced characterization of the accommodation process acknowledges that some experiences of awe may never be accommodated, and that some PAT participants may be unable or uninterested in trying to do so during their therapeutic process. Conceptually, this representation of the accommodation process is aligned with newer understandings of PAT based in neuroscience, in which neural networks are first disrupted so that new neural pathways may then be mapped. It is also in keeping with the integration model advanced in this therapeutic approach to PAT, which holds that the post-dosing sessions offer opportunities to restructure or reframe insights and integrate the experience. The primary outcome of our study is the development of a direct observation coding system for use by human coders. Given the size of PAT datasets (containing thousands of hours of recorded clinical trials), our codebook may also be deployed in the future to inform automated detection of moments of awe narration. While automated machine learning can enhance the efficiency and scalability of identifying patterns in therapeutic settings, these tools-particularly those that leverage supervised learning-struggle with ambiguous or complex training data. Robust and reliable automation requires learning from a subset of accurate "training" data manually labeled by humans. Thus, human oversight and refined codebooks, such as the one developed here, are essential for quality control, validation, and ensuring the ethical considerations of the labeled data. This study has important limitations. First, our observations arise from a small sample of PAT participants. Second, our sample lacked sufficient ethnic, racial, and linguistic representations to understand how awe narration may differ across populations and languages. It may well be that this coding system requires adaptation for deployment in a larger, more diverse sample. Third, our analyses do not attempt to quantify nor qualify the degree of awe experienced by participants (awe as an affective state) but instead investigate the nature of awe narration. We do note that in this study, awe narration often occurred simultaneous to or only shortly following the experience of awe, better positioning narration as an observable proxy for experience. Further work will be required to evaluate the validity in larger populations and the degree to which narrated awe is associated with both the internal experience of awe and the beneficial outcomes of PAT. analysis and writing-review and editing. M.R.: methodology and writing-review and editing. D.M.R.: conceptualization, formal analysis, methodology, and writing-review and editing. R.G.: conceptualization, formal analysis, methodology, and writing-review and editing. M.A.: conceptualization, investigation, methodology, and writing-review and editing. E.M.: conceptualization, formal analysis, methodology, writing-original draft, and writing-review and editing. All authors have read and agreed to the published version of the manuscript.

Full Text PDF

Full Paper PDF

Create a free account to open full-text PDFs.

Study Details

References (5)

Papers cited by this study that are also in Blossom

Long-term benefits of single-dose psilocybin in depressed patients with cancer

Agrawal, M., Roddy, K., Jenkins, B. et al. · Cancer (2025)

2 cited
Patient Experiences of Psilocybin-Assisted Psychotherapy: An Interpretative Phenomenological Analysis

Belser, A. B., Agin-Liebes, G. I., Swift, T. C. et al. · Journal of Humanistic Psychology (2017)

300 cited
Serotonin and brain function: a tale of two receptors

Carhart-Harris, R. L., Nutt, D. J. · Journal of Psychopharmacology (2017)

727 cited
Pivotal Mental States

Brouwer, A., Carhart-Harris, R. L. · Journal of Psychopharmacology (2020)

145 cited