Method

This page describes the two-level analysis architecture in WhatYouSay.

1) Overview

  • WhatYouSay uses a two-level analysis pipeline for exported text conversations.
  • Level A produces deterministic, reproducible metrics from raw text.
  • Level B generates a narrative interpretation via a large language model, grounded in Level A metrics.
  • The levels are separated by design: Level A is independently verifiable; Level B is interpretive and should be read as pattern description, not diagnosis.

2) Level A: Lexicon-Based Analysis

Input: raw .txt file parsed into individual messages attributed to speakers.

Metrics produced:

  • Message counts, character counts, and estimated word counts per speaker.
  • Speaker participation distribution (percentage of corpus).
  • Average message length (words per message).
  • Question rate (percentage of messages containing question marks).
  • Emoji usage rate.
  • Overall tone valence (composite score).
  • Emotion distribution via NRC Word-Emotion Association Lexicon (Mohammad and Turney, 2013).
  • Moral/value framing via LibertyMFD (Araque et al., 2022) and MoralStrength (Araque et al., 2019).

Reliability thresholds:

  • Individual analysis is most reliable with at least 250 messages from the selected user.
  • Group comparisons require at least 5% contribution to total corpus.
  • Below these thresholds, outputs are flagged as potentially noisy or incomplete.

All Level A processing is deterministic: identical input yields identical Level A metrics.

3) Level B: Narrative Synthesis

Level B uses a large language model to generate a reflective narrative summary of the selected user's communication patterns in group context.

Input to Level B includes Level A metrics plus contextual anonymised excerpts from the conversation.

Output sections include:

  • Your role in the group (in context).
  • Emotional tone and pressure points.
  • Ideas, complexity, and conversational style.
  • Values and moral framing (light touch).
  • Practical upgrades (small, doable, non-cringy).
  • Closing note.

Level B is interpretive, not deterministic. Re-running may produce different wording, while core pattern anchors remain tied to Level A data.

All speaker names except the user handle are anonymised in Level B output.

4) What this is not

  • WhatYouSay is not a clinical, psychiatric, or diagnostic tool.
  • It does not assess personality, mental health, intelligence, or character.
  • Outputs describe language-use patterns and do not evaluate individuals.
  • The Practical upgrades section offers observational suggestions, not therapeutic advice.

5) References

  • Mohammad, S. M., and Turney, P. D. (2013). Crowdsourcing a Word-Emotion Association Lexicon. NRC Emotion Lexicon resource.
  • Mohammad, S. M., and Turney, P. D. (2010). Emotions Evoked by Common Words and Phrases. Resource page and references.
  • Araque, O., Gatti, L., and Kalimeri, K. (2022). LibertyMFD: A Lexicon to Assess the Moral Foundation of Liberty. DOI: 10.1145/3524458.3547264.
  • Araque, O., Gatti, L., and Kalimeri, K. (2019). MoralStrength. arXiv:1904.08314.
  • Vasse, BR. (2026). Measuring Within-Person Variation in Written Communication Patterns Across Social Contexts. Zenodo:18890804.