Mixed-methods analysis on psychedelic-augmented... — Research Summary & Context | Blossom

Healthy VolunteersSet & SettingEquity and EthicsAyahuasca

Mixed-methods analysis on psychedelic-augmented meditation experiences from a randomized controlled mindfulness retreat

This secondary analysis of a randomised controlled trial of DMT/harmine in experienced meditators found that meditation under DMT-harmine differed from placebo in its thematic and experiential profile while still sharing some semantic features. Using NLP-based analysis of phenomenological interviews, the study also suggested overlap between meditative and psychedelic states and highlighted the strong influence of context and spiritual framing on subjective experience.

Authors

Milan Scheidegger
Daniel Meling

Published

March 18, 2026

Scientific Reports

individual Study

Links

Read Paper DOI Google Scholar

Abstract

The acute subjective effects (ASEs) of psychedelic substances are assumed to play a critical role for their therapeutic and well-being enhancing benefits. However, recent work voiced critique regarding the validity and adequacy of conventional measures and modalities utilized to study ASEs of psychedelics, and call for data-driven, unbiased, and experience-based research approaches. The emergence of advanced Natural Language Processing techniques as an enabler of data-driven qualitative research holds promise for addressing the current biases and limitations in the investigation of ASEs of psychedelics. In the present study, we employed an NLP-driven, multi-method analytical paradigm to study the subjective experiences of participants in an ecologically valid RCT examining the effect of DMT/harmine on meditative states in experienced meditators using phenomenological interviews. Our analysis showed differences in the thematic landscape and experiential diversity of meditation under placebo and meditation under DMT-harmine while showing overlap in their semantic topographies. The mixed-modal analysis successfully identified a wide range of well-established primary subjective effects while also detecting subtle, patterned regularities in language that traditional hypothesis-driven approaches alone may overlook. It revealed a pronounced use of Buddhist concepts and spiritual jargon to describe and integrate the subjective experience, independent of the experimental condition. Findings suggested shared experiential features between meditative and psychedelic states, a strong drug-context interconnection and potential synergistic effects of meditation and psychedelics. We advocate for using NLP-augmented, data-driven paradigms to deepen the understanding of psychedelic subjectivity and emphasize the importance of extra-pharmacological factors in shaping therapeutic outcomes.

Unlocked with Blossom Pro

Research Summary of 'Mixed-methods analysis on psychedelic-augmented meditation experiences from a randomized controlled mindfulness retreat'

Related Clinical Trial

CompletedPhase I

Mindfulness and Psychedelics

This double-blind, placebo-controlled trial (n=40) aims to investigate the potential neurophysiological synergy effects between mindfulness meditation and psychedelics, particularly ayahuasca, on experienced meditators.

Started: February 20, 2023
Type: interventional
Blinding: quadruple
Randomized: Yes
Registry ID: NCT05780216

Related Papers

Open Accessindividual

Editorial

βBlossom's Take

Another recent study dispelled the notion that psychedelic and psychosis experiences are similar. This study makes the case that ayahuasca (or 'pharmahuasca') experiences have significant overlap with what long-term mediators experience. Two other papers (so far) also report on other aspects of this trial.

Introduction

The authors frame the study around a longstanding debate in psychedelic science: acute subjective effects are widely considered important, and they correlate with later therapeutic benefits, but it remains uncertain whether these experiences are causally instrumental to change or simply accompany other drug effects. They note that standard questionnaires such as the Mystical Experience Questionnaire and 5-Dimensions Altered States of Consciousness scale have been criticised for being too fixed, conceptually limited, and unable to capture the full richness of first-person meaning-making. They also point to emerging evidence that meditation can produce altered states resembling psychedelic experiences, and that combining meditation with psychedelics may be informative, particularly because previous work has largely relied on psychometric tools or large public report databases rather than structured experimental narratives. The study set out to test whether natural language processing, specifically BERTopic topic modelling, could be used alongside manual qualitative analysis to examine phenomenological interview data from a double-blind, placebo-controlled randomised controlled trial conducted during a 3-day mindfulness retreat. The researchers aimed to compare the experiential landscapes of DMT/harmine and placebo-enhanced meditation, identify shared and divergent themes, and determine whether unsupervised NLP could detect latent phenomenological patterns that might be missed by conventional theory-driven approaches. The paper is presented as an ecologically valid analysis of structured interview data from experienced meditators in a controlled retreat setting, with relevance for understanding psychedelic phenomenology and the role of extra-pharmacological factors such as set and setting.

Methods

The study analysed qualitative interview data from a double-blind, placebo-controlled, between-subject randomised controlled trial conducted in 2023. Forty healthy participants were randomly assigned to DMT-harmine or placebo, with the groups matched for gender and reported to be similar at baseline. They attended one of two structurally identical 3-day meditation retreats at the Zen meditation centre Stiftung Felsentor in the Swiss Alps. The retreat consisted of preparation on day 1, drug or placebo administration on day 2, and integration on day 3. Participants practised sitting meditation, walking meditation, mindful physical work, short breaks and meals throughout the retreat. On the second day, participants received incremental sublingual doses of DMT-harmine or matched placebo in double-blind fashion while continuing meditation practice. Each active tablet contained 30 mg of DMT and 30 mg of harmine, and four tablets were given at 30-minute intervals for a total dose of 120 mg of each compound. The placebo tablets were taste-matched with sucralose, menthol and peppermint. Acute subjective experience was also assessed repeatedly with psychometric scales at eight time points during that day, although the present paper focuses on the interview data rather than those scale outcomes. The qualitative dataset came from phenomenological interviews inspired by the Micro-Phenomenological Interview method, which is designed to elicit detailed chronological descriptions of a specific lived experience while minimising retrospective distortion. Interviews were conducted after the second retreat phase. Because of staff and time constraints, 28 participants were interviewed in total, drawn randomly from the pool: 13 from the verum condition and 9 from placebo are reported here for the NLP analyses, while all 24 transcripts appear to have been used in the manual analysis. Some interviews were conducted in person shortly after the experience, and others online up to 11 days later. All interviews were audio recorded and the study was double-blind at the time of data collection. For the automated analysis, transcripts were cleaned and segmented in Python using spaCy and sentence-based segmentation. The researchers used BERTopic, an unsupervised topic-modelling approach built around sentence embeddings, UMAP dimensionality reduction, HDBSCAN clustering, CountVectorizer tokenisation, class-based term frequency-inverse document frequency weighting, and maximal marginal relevance to refine topics. Nouns and adjectives were emphasised, stop words were removed, and very incoherent or non-informative clusters were merged into a single noise topic. The model was applied first to the full corpus and then separately to the verum and placebo transcripts. For the manual component, two researchers independently coded the transcripts using a hybrid approach based on predefined codes informed by established psychometric scales and the team’s assumptions, while also allowing inductive new codes to emerge. Sentence-level coding was used, and intercoder disagreement was iteratively resolved with mediation until overall agreement was about 90%. A frequency-based thematic analysis then focused on seven main codes and related subcodes, including psychometrics, control, sensations, meaningfulness, well-being/life satisfaction, side effects and valence, together with contextual factors such as placebo versus substance. The manual and automated findings were compared and discussed within the research team.

Results

Across the full corpus, the BERTopic analysis yielded 28 distinct topics plus one noise topic. The researchers reported a complex semantic landscape with both shared and divergent themes across the two conditions. About 82.1% of topics were centred on subjective effects. The intertopic distance map suggested both overlap and separation: some topics formed dense clusters that were semantically close, while four more distant topic islands indicated condition-specific or more distinct themes. The authors describe this as a large thematic diversity score and interpret it as evidence of meaningful experiential differentiation. When the verum transcripts were analysed alone, BERTopic produced 24 distinct topics plus one noise topic, with 20 topics shared with the across-group model. The placebo-only analysis produced 11 distinct topics plus one noise topic. The paper states that the verum group showed greater thematic overlap with the across-group analysis, and that both analyses displayed two main hubs around subjective effects. The placebo analysis also surfaced a placebo-only topic labelled “Placebo effect”. In the manual coding analysis, all 24 transcripts were included. The most frequent top-level codes were Control, Mental Awareness and Psychometrics, with global counts of 346, 312 and 164 respectively. Four new top-level codes emerged inductively, leading to a final code system of 39 codes and subcodes. The manual analysis produced nearly three times as many coded sentences in the verum group as in placebo: 886 versus 320. This difference was especially marked for codes reflecting subjective effects, including the psychometrics subcodes and sensations. Mental Awareness appeared in 96% of interviews. The authors report that awareness of mind and body was salient in both groups, but the verum condition contributed more of the codes for visual, auditory and cognitive aspects. Emotional codes were present in 85% of interviews, with 76.2% of all emotional codes in the verum group. For cognition, the verum-to-placebo count ratio was particularly pronounced at 189 to 39. Bodily Awareness appeared in 92% of interviews and was also almost twice as frequent in the verum group. Sensory alterations were more common under DMT-harmine. Across all interviews, 71% contained auditory codes and 54% contained visual codes. Auditory experiences were more than twice as frequent in the verum group, ranging from heightened or “pure” hearing to synaesthesia and auditory pareidolia. Visual experiences were likewise much more common in the verum condition, with only 3 of 38 visual codes appearing in placebo. These ranged from increased light sensitivity to visual synaesthesia, pseudo-hallucinatory phenomena and complex imagery with eyes open or closed. The text also reports that positive and negative emotional valence were coded, but the extracted passage breaks off before giving the full numerical result for this section. Later in the Results, the authors state that 82.5% of participants guessed their group correctly after the retreat day-2 experience. They also note that verum participants tended to report more intense, diverse and acute subjective effects, whereas placebo participants reported more body-centric, familiar and context-shaped experiences. The placebo narratives contained more references to coping, containment and retreat-context factors, while verum narratives converged more strongly on acute psychedelic-like effects. The authors characterise this contrast as a focusing effect under DMT-harmine, with vivid and sometimes overwhelming experiences concentrating attention and language around acute effects.

Discussion

The authors argue that the study shows the value of combining unsupervised NLP with manual qualitative analysis to study psychedelic subjectivity in a small, structured trial dataset. They say the automated topic models captured both established acute psychedelic effects and context-specific themes tied to the retreat setting, and that the findings demonstrate the sensitivity of NLP methods to experiential details that may fall outside traditional questionnaire-based frameworks. They also emphasise that the mixed-method approach allowed triangulation: the manual coding and BERTopic outputs converged on a strong group effect, supporting the view that the observed differences were not merely investigator artefacts. The authors place their findings in the context of earlier work suggesting that language can reveal aspects of psychedelic experience that psychometric scales may miss, and that meditative and psychedelic phenomenology can overlap. They interpret the strong use of Buddhist and spiritual language across both conditions as evidence that participants drew on a contemplative framework to make sense of their experiences, which they suggest may have been shaped by the participants’ existing meditation backgrounds and the retreat context. They also note that such language may serve as a bridge for articulating ineffable states, and may be relevant for therapeutic integration. At the same time, they stress experiential divergence between conditions. In their interpretation, DMT-harmine produced a more intense, diverse and acute subjective effect profile, whereas placebo-enhanced meditation was less heterogeneous and more closely tied to familiar meditative and bodily experiences, often framed through coping and containment rather than surrendering to novelty. They suggest that the placebo condition still contained meaningful contextual and experiential effects, and that environmental influences may have been active in both groups even if they were expressed differently. The authors acknowledge several limitations. They note that some topic clusters were less coherent and that unsupervised topic modelling can yield less interpretable outputs when datasets are small. They caution that effectiveness depends on sample quality and size, and recommend care when applying BERTopic to limited corpora. They also point out that the manual coding process was potentially susceptible to investigator bias, especially because coders were aware of experimental assignments and because verum interviews were longer and more vivid. Finally, they state that the retreat setting, while increasing ecological validity, makes it difficult to separate pharmacological effects from contextual influences, so the findings should be treated as exploratory and specific to this experimental context rather than broadly generalisable. The authors suggest that future work should continue to use data-driven, multi-method approaches and should pay close attention to extra-pharmacological factors, including set and setting, when studying psychedelic outcomes.

Conclusion

The conclusion states that this study is an early step towards more qualitative, less biased, and more data-driven research on psychedelic subjective effects, and argues that transformer-based NLP methods such as BERTopic can complement traditional qualitative approaches by detecting experiential features that theory-driven methods may miss while also reducing time and labour demands. It also emphasises that extra-pharmacological factors appear to strongly shape subjective effects, supports the idea that experiences induced by psychedelics or mindfulness-based practices may be central to therapeutic and wellbeing benefits, and calls for stronger phenomenological framing and improved psychometric approaches in future research.

View full paper sections

STUDY PROCEDURES AND SETTING

The main study was designed as a double-blind, placebo-controlled, between-subject trial. Forty healthy participants were randomly assigned to one of two groups (DMT-harmine or placebo) which were matched for gender. The two groups were similar across several baseline variables (Supplementary data I). Participants took part in one of two structurally identical 3-day meditation retreats held at the Zen meditation center Stiftung Felsentor in the Swiss Alps. Throughout the retreat, participants practiced sitting meditation in structured daily group sessions. The meditation practice was interleaved with walking meditation, mindful physical work, short breaks and meals. The meditation retreat consisted of three phases: preparation (day 1), placebo or Drug administration (day 2), and integration (day 3). Each evening participants filled out structured questionnaires. On the second day of the retreat, participants received incremental doses of DMT-harmine or placebo while continuing their meditation practice. Study drugs were administered in a double-blind fashion. Participants were encouraged to adhere to the formal retreat structure only to the extent to which it remained comfortable to them and to take a gentle approach to meditation. Even more, the retreat schedule was adapted on that day, containing additional elements, such as guitar play and a gong ceremony, and relaxation periods after the last dose administration. In addition, participants acute subjective experience was assessed through psychometric scales (quantitative) eight times during that day: 90 minutes prior to the first dose administration, and 30, 60, 90, 120, 180, 240 and 360 minutes after the first dosing. Please find the original publicationfor further information on psychometric assessment and study design.

SUBSTANCE AND DOSING

The pharmacological intervention utilized sublingual tablets containing DMT and harmine as described elsewhere. Each tablet contained 30 mg of DMT and 30 mg of harmine. A total of four tablets was administered at 30-min intervals, resulting in a total dose of 120 mg of DMT and harmine. Placebo tablets were matched in taste and appearance using sucralose, menthol, and peppermint flavor for taste masking. Both, the placebo tablets and the DMT/harmine sublingual tablets were prepared for double-blind administration by a certified pharmacist at the University of Zurich and administered in randomized order based on our previous studies with DMT and harmine in healthy human subjects (BASEC-Nr. 2018-01385). For more information on the pharmaceutical formulation of the study drug please find Egger et al..

DATA COLLECTION

The qualitative data considered in the study at hand was collected using phenomenological interviews inspired by the MPI (Micro-Phenomenological Interview) method. MPI is an interview technique meant to obtain detailed chronological accounts of subjective experiences, while considerably reducing the influence of subjective biases otherwise associated with experiential reports. MPIs follow a pre-defined iterative three-steps procedure. First, the interviewer guides the subject towards a single, temporally and spatially precisely situated particular experience. The evocation is considered successful when this particular experience becomes more vivid for the participant than the present situation. Second, the interviewer facilitates acts of épochè -suspensions of conceptual superimpositions on lived experience -as he or she brings the subject back to that experience whenever their attention shifts away from it. Third, with each successive iteration, the interviewer gains progressively more detailed synchronic and diachronic descriptions of the same experience. Interviews were conducted after the second phase of the retreat. Neither the participants nor the interviewer or study team present at the study site were informed about the group allocations (double-blind). Due to time and staff constraints, we interviewed 28 participants -20 from the first and 8 from the second retreat -which were randomly selected from the participant pool. Interviews were conducted in a quiet and separate location to create a safe and pleasant atmosphere for the participant. Only 7 participants from retreat 1 were interviewed in-person, within 2 days of their experience. For logistical reasons, the residue of participants was interviewed online, no later than 11 days after their experience. All 8 participants of retreat 2 had their interview in-person and on the same day of their experience. All in all, we interviewed 13 people of the verum and 9 of the control condition. All interviews were audio recorded.

DATA PREPROCESSING

All data preprocessing was performed in Python (version 3.9 and 3.

TEXT DATA FOR NLP MODEL

For automated text annotation with BERTopic, interview transcripts had to be segmented into smaller analysis units. In line with the literature, we considered both participant's and interviewer's speaking parts for analysis. Unspecific or isolated text segments, for example silence notations or another person's comment within a person's speaking part, were removed from the text corpus. Advanced transcript cleaning and segmentation was performed using spaCy's German transformer pipeline "de_dep_news_trf". We opted for a sentence-based segmentation of the transcripts. The procedure yielded 6636 individual analysis units for all 24 interviews. For the individual experimental conditions, this led to 4609 and 2026 sentences for verum and placebo group respectively. All python scripts used for cleaning and segmentation were written by JTTS (see Supplementary data II for code example).

NLP TEXT ANALYSIS

To explore subjective effects and common themes present during participants' psychedelic or meditative experience, respectively, we utilized the state-of-the-art Python library BERTopic. BERTopic leverages transformer-based language models in several steps along the modeling process and has a modular structure that by default comprises four main steps which are run in sequence: sentence-transformers, UMAP, HDBSCAN, and c-TF-IDF. Our topic modeling procedure is informed by the work of Haag and colleaguesand was adapted to accommodate the unique characteristics of the dataset at hand. The individual steps of the procedure are outlined in Fig.(see Supplementary data III for an implementation in python code). We opted for an unsupervised topic modeling approach, which means that the topic model and its constituent sub-models were not trained on domainspecific data, resulting in a more explorative and unbiased ("unsupervised") modeling procedure. First, we transformed the text data into numerical representations, resulting in so-called "embeddings". This was done using the well-established multilingual sentence transformer model paraphrase-multilingual-mpnet-base-v2. Next, we reduced the dimensionality of the data to save memory and computational resources. The target dimensionality was defined based on the two quantitative metrics Nearest-Neighbor-Preservation (NNP)and Distance Preservation Error (DPE). Dimensionality reduction was done using the built-in UMAP algorithm. Next, we clustered the data using BERTopic's default HDBSCAN clustering algorithm. Afterwards, we tokenized the data using the well-established CountVectorizer from Scikit-Learn library. We combined Tokenization with Lemmatization; a process that converts a word to its root word (e.g., hearing to hear). Nouns and adjectives were found most informative for our research aim; hence we extracted all lemmatized nouns and adjectives from the text corpus using Part-of-Speech Tagging. Also, we provided a list of stop words that would be neglected for the topic modeling procedure. The list of all stop words can be found in Supplementary data III. We performed all four steps using the python natural language analysis package Stanza. Next, we let BERTopic assign importance scores to each unique word within the topic representations based on the statistical metric class-based term frequencyinverse document frequency. Finally, we fine-tuned the model output using Maximal Marginal Relevance (MMR). Topics with substantial thematic overlap were merged. For the sake of simplicity, all topics that did not sufficiently contribute to the understanding of participants' subjective experience, because they exhibited an unusual degree of incoherence or unspecific mentions, were labeled as "noise" and manually merged into one single noise topic. Following Lezhnina, an interdisciplinary team (JTTS, EV, MS) combined qualitative and quantitative metrics to assess model outputs (see Table). We tried to strike a balance between meaningfulness of a topic and appreciation of experiential diversity. For the final outcomes, we inspected each topic individually and assigned a custom label which seemed to best capture the content of the assigned sentences. Labels were created using the Ollama-embedding of the large language reasoning model DeepSeek-R1and manually fine-tuned. We visualized the final topic representations using the Python WordCloud libraryand the built-in Intertopic Distance Map (IDM) functionality of BERTopic. An IDM is a simplified mapping of the total number of topics and visualizes the overlap or separation between all identified topics as well as each topic's size in a 2-dimensional vector space. The described procedure was first applied to the full text corpus comprising all 24 interview transcripts. Afterwards, we investigated the interview transcripts from the verum and the placebo group individually.

MANUAL INTERVIEW CODING

Two researchers (RG and DV) reviewed all 24 transcripts to prepare for manual text annotation. We utilized a hybrid coding approach, following a pre-defined coding system which was derived from established psychometric scales used in the main study and the team's researchinformed assumptions about psychedelic and meditative experiences. The initial code system comprised six main codes (see coding system in Supplementary data IV). Meanwhile, due to limited empirical evidence on psychedelic use in experienced meditators in the context of a meditation retreat, we allowed for the emergence of new codes during the coding process. We used sentence-based coding to avoid excessive loss of information and to maintain comparability with the results of the NLP analysis. We refined the code system iteratively, adding new codes as they arose from the data. A team consisting of the two coders and at least one mediator (JTTS or EV) reflected on the coding process. Among other things, the team evaluated the intercoder reliability, using the built-in advanced analysis functionalities of MAXQDA Analytics Pro. We carved out the codes and interviews, respectively, with the highest intercoder disagreement and had them recoded by both coders. This was repeated until we reached an overall intercoder agreement of about 90%, considering the codes used in each document. The manual coding of the interviews was performed independent from the results of the NLP analysis. The two researchers coding the interviews were not involved with the NLP analysis.

MANUAL QUANTITATIVE ANALYSIS

Upon completion of the coding process, JTTS performed a frequency-based thematic analysis of the interview transcripts. We focused our analysis on the following seven main codes, and their associated sub-codes: Psychometrics, Control, Sensations, Meaningfulness, Well-being/Life Satisfaction, Side effects and Valence. In addition, we included the study's setting factors, especially the code Placebo or Substance, in our analysis, which was informed by the bottom-up results of the NLP analysis. For each code and sub-code, we looked at total and co-appearances across both groups and performed a between-group comparison. As a last step, we compared the results from the manual to the outputs of the automated analysis. All findings and their possible implications were discussed in the research team.

NLP ANALYSIS

Descriptive statistics for each analysis are presented as either frequencies (in integers) or proportions (in %). Topic descriptions, sample quotes and topic prevalence across analyses are provided in Supplementary data V. For visualizations of topic frequency distributions see Supplementary data VI.

ACROSS-GROUP FINDINGS

The topic modeling analysis yielded 28 distinct topics plus one noise topic across all 23 interview transcripts. On average, participants referenced 23 outlier sentences (37.8%). We found a complex semantic landscape with both shared and divergent themes emerging across participant groups. Some topics showed significant semantic overlap, as indicated by two vertically arranged dense clusters of topics close to each other in the upperright corner of the Intertopic Distance Map (Fig.). This was supported by our subjective topic interpretability analysis which showed that a great share of topics had formed around subjective effects (82.1%). Meanwhile, the map showed four isles of topics -two in the upper-right and two in the lower-left corned -positioned further apart from the clusters, indicating that the procedure discovered semantically distinct and group-specific themes. This was also reflected in the large TD score.

BETWEEN-GROUP FINDINGS -VERUM

The topic modeling procedure resulted in 24 distinct topics plus one noise topic across the 1 interview transcripts from the verum group. We identified We found 20 shared topics between the outputs of the verum group and across-group analysis, indicating a significant thematic overlap. There were also clear parallels in the semantic topographies of the two outputs. Both showed two main thematic hubs addressing the prominent theme of subjective effects. Group-specific (across-group) or sub-themes of the dominant topic (verum-only) were covered by a few moderately prevalent outlier topics. Comparison of the Intertopic Distance Maps (IDM) reinforced this finding (see Fig.and Fig.). In both cases we found small distances within-hubs and greater separation from outlier topics. Clustering for verum group-only showed lower heterogeneity compared to the across-group

BETWEEN-GROUP FINDINGS -PLACEBO

The topic modeling analysis on the 9 interviews from the placebo group resulted in 11 distinct topics and one noise topic. We classified 7 acute and one reflective and integrative effect(s). (verum), 453 (across) and 107 (placebo). Another latent topic 'Placebo effect' stood out because it appeared in the placebo-only but not the verumonly analysis. An overview of all latent topics and their prevalence is provided in Table. Table. Latent topics automated topic modeling procedure. The table shows the custom label assigned to a given latent topic and its prevalence across the outputs of all three NLP Analyses. In brackets: Total number of coding units assigned to a given topic for a given form of analysis. All 24 interview transcripts were included in the manual analysis. The most frequently used top-level codes were 'Control', 'Mental Awareness' and 'Psychometrics', with global counts of 346, 312 and 164. Four new top-level codes arose from the inductive coding of the text data. Data-driven subcodes were iteratively added to the code system, resulting in a total of 39 (sub)codes. We refer to Supplementary data IV for an overview of the final code system, a description of, and sample text data for each of the codes.

DESCRIPTIVE ANALYSIS -DISCREPANCY VERUM VS. PLACEBO

The manual analysis yielded almost three times more coded sentences in the verum than in the placebo group -a respective total of 886 and 320 sentences. The bias was especially pronounced among codes representative of subjective effects; a) the four 'Psychometrics' sub-codes and b) sub-codes of the top-level code 'Sensations'. Tableshows absolute counts for all 9 sub-codes and illustrates the discrepancies in code appearances between the two experimental conditions. Table. Between-group comparison of main code 'Psychometry' and main code 'Sensations' from manual thematic analysis. The table shows the group-wise and the total counts for sub-codes of the two primary deductive codes 'Psychometry' and 'Sensations' used to code for (acute) subjective effects in participants' experience reports.). In brackets: Percentage contribution of each sub-code to the total number of Psychometrics codes within the placebo and verum condition. Percentages are normalized within condition. Awareness of mind and body ('Mindfulness') saliently stood out in reports from both groups (Table). Code Mental Awareness appeared in 96% of all interviews. The sense of increased mindfulness seemed heavily driven by an awareness of mental processes in the verum group, indicated by greater counts for sub-codes 'Visual' and 'Auditory'. Sub-code 'Emotion' was present in 85% of the interviews but 76.2% of all codes fell into the verum group. For 'Cognition', the effect was even more pronounced with a ratio of 39 / 189 counts (placebo / verum). Mental Awareness was followed by Bodily Awareness (92% of all interviews) with almost twice as many codes in the verum group.

ALTERED SENSORY PERCEPTION

Across all interview transcripts, 71% contained code Auditory and 54% code Visual. Total counts for 'Auditory' were more than twice as many in the verum condition, reinforcing the notion of a treatment effect (Table). Auditory effects ranged from a "perfect' or "pure" hearing experience to experiences of synaesthesia (e.g., seeing sounds or colors changing with the breath) and auditory pareidolia (imposing meaningful patterns on random sounds or noise). Similarly, visual effects seemed starkly more present for the verum group. Only 3 out of 38 codes fell in the placebo group (Table). The spectrum of experienced effects ranged from high sensitivity to light, visual synaesthesia and (pseudo-) hallucinations, to complex visual imagery with open and closed eyes.). In brackets: Percentage contribution of each sub-code to the total number of Sensations codes within the placebo and verum condition. Percentages are normalized within condition.

EMOTIONAL VALENCE AND AFFECT

We coded for positive and negative emotional valence in the manual analysis. The results showed 78 codes across 21 interviews for positive and experience under placebo and a dominance of acute subjective effects under DMT-harmine. Our findings support the growing promise of data-driven, multi-method approaches for investigating psychedelic subjective effects. In particular, we demonstrate the utility of NLP techniques to complement qualitative analysis methods. Despite the unsupervised nature of our topic modeling approach and the relatively small data set, the topic models effectively captured both the diversity and nuance of participants' subjective experiences when compared to the independent manual analysis. The analyses identified not only well-established acute subjective effects of psychedelics, and similar states induced by meditationbut context-specific and extra-pharmacological topics tied to the retreat setting. These results highlight the unique sensitivity of NLP methods to aspects of experience that often fall outside traditional theory-driven frameworks. Similar advances were recently reported by Bzdok et al., who used large language models (LLMs) to distinguish the experiential profiles of 30 psychoactive substances based on user reports. Relatedly, Noah and colleagues used an embedding model from OpenAI to uncover compound-specific variations in reported visual effects of psychedelics. Our results extend these insights to the setting of a controlled RCT and underscore the applicability of NLP even in smaller, structured datasets. Several topics including the high-ranked topics 'Acute phenomenology: Dynamic self-perception shifts and state transitions' (verum), 'Acute phenomenology: Somatic overload, emotional release, and regulation' (placebo) and 'Contrasts and similarities between ordinary and enhanced meditation experience' (across all three analyses), exhibited notable thematic heterogeneity. This was reflected by incoherent key word representations of these topics, a reduced within-topic coherence and a smaller between-topic diversity. Such outcomes are characteristic of unsupervised topic modeling approaches, which, while powerful for exploratory analyses, may produce less targeted outputs without domainspecific guidance. As a result, some topics were less interpretable, with diminished thematic clarity. The effect was least pronounced in the across- group analysis. Based on these findings, we recommend caution when applying NLP techniques, such as BERTopic, to small-sized data sets. In line with previous findings, we found that the effectiveness of NLP techniques strongly depends on both the quality and the size of the data sample. Second, our results highlight the value of combining a data-and a theorydriven analysis approach. While traditional qualitative methods are prone to subjectivity bias, this risk was mitigated by triangulating a manual coding process with NLP-based topic modeling. Both approaches independently revealed a strong group-effect for topic prevalence, particularly regarding code (topic) frequency and diversity of acute subjective effects. This was further supported by the high degree of thematic overlap between outputs of the automated across-group and the verum-group analysis. The convergence suggests that the observed group differences were not merely an artifact of investigator bias but likely reflect genuine experiential divergence between the two conditions. At the same time, the manual procedure enriched and corroborated the exploratory NLP analyses, including the prominence of spiritual jargon in both experimental groups and of contextual factors and their contributions to participants' subjective experiences in the placebo condition. These results illustrate the complementary strengths of manual and automated methods in capturing the complexity of psychedelic experiences. Our mixed-method approach uncovered a set of less recognized yet theoretically compelling themes. These latent topics, including themes like control, equanimity, impermanence or embodied wisdom, have largely remained underrepresented in mainstream psychedelic literature but gained increased attention in recent studies exploring the convergence of psychedelics and mindfulness practices. We use the term latent not merely in the statistical sense, but to highlight how these themes have remained hidden in plain sight: present in anecdotal data yet rarely foregrounded in empirical research. Their emergence in our data reflects the capacity of automated topic modeling to detect subtle, patterned regularities in language that traditional hypothesis-driven approaches alone may overlook. These findings broaden the interpretive scope of psychedelic science and invite further inquiry into underexplored dimensions of psychedelic experiences. A notable finding from the across-group NLP analysis was the high semantic similarity between reports from the placebo and the verum group, indicating that participants drew on similar vocabulary to describe their experience. This aligns with previous research highlighting similarities between psychedelic and meditative phenomenologyand pointing toward potential convergence between these modalities. Even more, the acute and post-acute subjective effects reported by placebo participants seemed to mirror those of the verum group, which corresponded to well-established features of psychedelic experiences. In support of these findings, Dikovskaya et al.'s natural language analysis revealed that reports from floating tank and meditation experiences most closely resembled those from 5-MeO-DMT, and N,N-DMT, and ketamine sessions, suggesting that these pharmacological methods of inducing altered states of consciousness are likely to produce subjective effects that are generalizable across ASCs, rather than being limited to drug-specific artifacts. Our quantitative investigation pointed towards a semantically homogenous conceptual framework for reports from both groups. Manual analysis revealed that many participants used Buddhist philosophy or spiritual concepts to describe and integrate their experiences; a pattern mirrored in NLP-derived topics such as 'Personal meaning and embodied insight', 'Transitions, fluctuations, and impermanence of experiential states' or 'Interactions between meditation and psychedelics'. Some participants stated how their experience helped them to truly understand and embody the teachings from their meditation tradition ("Two days later, I remembered the Heart Sutra and unpacked it. I really understood half of it." -participant 09; "And when I first experienced its effects, it was clear to me that this was the path to the heavens, but not where Buddha wanted to get to the root of suffering." -participant 05). Others used their tradition's jargon to express experiential differences ("Um, and I feel like my psychedelic experience wasn't about somehow deconstructing the experience, it wasn't about seeing in every sound, in every, in every sensory perception, the impermanence and how it ends and re-emerges and is not me." -participant 25; "So during normal meditation, I don't usually experience these strong energetic phenomena that I just demonstrated during the session." ), or how the two modalities seem to intersect, support or even interfere with one another ("And I feel that the meditative state, depending on the type of meditation, is probably the important thing, as it can be both beneficial and keep you firmly in the here and now." -participant 22; "That it may take several units to achieve certain effects that may be possible more quickly with psychedelics. This means that psychedelics could speed up processes that take longer with meditation." -participant 17). The finding is supported by previous research emphasizing how language under psychedelics serves as 'a window into the psychedelic mind', and how spiritual or philosophical frameworks can serve as conceptual containers for meaning-making and integration of psychedelic experiences. It has been repeatedly pointed out that the phenomenology of psychedelic experiences is profoundly shaped by cultural and religious beliefs and that individuals often connect their subjective effects to culturally specific or spiritual and religious interpretations. Given our RCT design and the participants' extensive meditation background in various Buddhist traditions, a contemplative lens seems to have naturally shaped their narratives regardless of group assignment. However, we should assume to find different themes when conducting the same study in a population with different religious backgrounds (e.g. no contemplative background at all or trained in other traditions, for example shamanic practices) or embedding it in a different cultural framework (e.g. performing the study in an African or South American country). Nonetheless, spiritual and philosophical language may have offered participants a linguistic bridge to articulate ineffable or non-ordinary states that otherwise lay beyond linguistic habits; awareness of such meaning-making processes may therefore be relevant in therapeutic settings. Despite strong similarities, we found notable experiential divergence between the two groups. In the verum group, 75% of topics from the automated analysis pertained to ASEs compared to 63.4% in the placebo group. Reported acute effects were often intense and exhibited great diversity, including perceptual, emotional, cognitive and often self-related domains. In contrast, placebo participants' subjective experiences showed less heterogeneity, effects were largely body-centric, and participants' reports focused on coping and containment than allowing and experiencing the effects. Moreover, two of the seven topics from the placebo-only analysis contained a substantial number of references to contextual factors. This imbalance was mirrored by differences in thematic landscapes: while placebo participants distributed attention across a broader range of distinct topics, verum participants' narratives converged on fewer, more dominant and related experiential themes. We interpret this as a kind of focusing effect, in which the vivid, intense, and sometimes overwhelming nature of the DMT-harmine experience foregrounded acute subjective effects in participants' awareness and language. This led to high thematic diversity within a relatively narrow conceptual domain. In contrast, placebo-group participants reported effects that were often familiar, accessible and modulated by their meditation experience, seldom entirely novel ("So all the elements that I know anyway, equanimity, focus, concentration, um, it also was much easier for me to meditate" -participant XY; "[…] That is the energetic phenomenon, which you are also familiar with, but to a lesser degree, from [your] meditation practice" -participant 16). 82.5% of all participants guessed their group correctly when asked in a survey after their study experience at the end of retreat day 2. Nonetheless, both the manual and the automated analysis highlighted a sensitivity to environmental factors under psychedelics, and other non-psychedelic psychoactive drugs, implying that such influences may have been equally active but less verbally articulated. Contextual factors have recently regained recognition as essential determinants of the intensity and quality of psychedelic experiences. Structured, supportive settings were found to be conducive to positive and therapeutically valuable experiences, while also buffeting against the negative impact of challenging experiences. Moreover, agents, but rather as context-sensitive catalysts whose therapeutic potential emerges through complex interactions between drug, the immediate and extended environment and participants mindsetLimitations The present study has several limitations that should be considered. Fourth, our manual text analysis was prone to potential investigator's bias. Although coders were not involved in NLP analyses, they were aware of experimental assignments, and verum interviews tended to be longer and more vivid. These factors may have influenced coding salience and perceived thematic prominence. Lastly, the retreat setting itself shaped the content and framing of participants' experiences. While this enhances ecological validity, it also makes it difficult to disentangle the influence of pharmacological from contextual factors. Thus, our findings should be viewed as exploratory and specific to this experimental context, rather than universally generalizable.

CONCLUSION

This study presents a first step to answer toward more qualitative, unbiased, and data-driven investigations of the subjective effects elicited by psychedelics. Transformer-based NLP methods (such as BERTopic) complement traditional qualitative methods as they offer a unique sensitivity to experiential facets that often fall outside traditional theory-driven frameworks. These methods can reduce the time and labor demands of conventional approaches substantially, even when applied to smaller, structured datasets. Our findings underscore the salient interrelation between extra-pharmacological factors and subjective effects, calling for dedicated studies on how contextual influences shape psychedelic experiences, and their potential therapeutic or well-being effects. Although the present results do not allow for conclusions regarding the therapeutic relevance of specific subjective effects, they do support the hypothesis that such experiences – whether induced by psychedelic substances or mindfulness-based modalities – may be central to their therapeutic and wellbeing potential. More broadly, our findings highlight the importance of embedding subjective experience more firmly within phenomenological frameworks and refining psychometric

Company

Product

Legal