We Don't Have Earlids: The Science of Sonic Branding and Memory Encoding
- ErikWalterThompson
- 1 day ago
- 10 min read
By Erik W. Thompson, Founder & CEO, Walter Audio
You spent real money on that media placement. A TV buy timed to reach high-value households. A CTV campaign running across premium streaming inventory. A cinema spot in front of a captive audience.
And in the moment your ad played, a meaningful share of those viewers glanced down at their phone.
This is the dirty secret of the attention economy: the channel most brands have optimized for is quietly degrading. According to eMarketer, 82% of the US population will be second-screen users by 2027, and the sharpest growth isn't happening with Gen Z. It's with Gen X, the high-spending demographic that used to sit and watch television. The screen they were raised on has lost their undivided attention.
Most brands respond to this the way they respond to every media efficiency problem: they optimize the visual creative, they tighten the first three seconds, they A/B test the thumbnail. All of that is fine. None of it addresses the actual problem.
The actual problem is that you've been optimizing the wrong channel. Sound, the one most brands treat as an afterthought, is the only one that keeps working when the eyes are elsewhere.
The Two Pathways Into Memory
The brain processes sensory information through different systems, and those systems are not created equal when it comes to brand recall.
Iconic memory is the brain's visual sensory buffer: the system that holds what your eyes just saw for approximately 250–500 milliseconds before it fades. Echoic memory is its auditory equivalent: the system that holds what your ears just heard for two to four seconds before it decays. Sound persists in sensory memory six to eight times longer than visual information.
Visual information, what your customer sees when they look at your ad, enters through iconic memory. Iconic memory is fast and impressionistic, but it fades within 500 milliseconds. Half a second. Before the analytical brain has a chance to do anything with what it just saw, it's gone.
Auditory information operates differently. Sound enters through echoic memory, which persists for two to four seconds, six to eight times longer than its visual counterpart. That extended window gives sound more time to transfer into short-term memory, and from there into the long-term storage where brand associations actually live.

This isn't a marginal advantage. It's a structural biological edge that almost no brand is deliberately building into its strategy.
The gap gets even wider when emotion enters the equation. In July 2025, researchers at Rice University and UCLA published a study in the Journal of Neuroscience examining exactly how music affects memory consolidation. Their finding was precise: music-induced emotional arousal uniquely modulates memory consolidation, with moderate arousal producing the strongest effect on detailed recall, via the hippocampus and basolateral amygdala, the brain structures responsible for long-term memory formation. Critically, this effect was specific to music. Neutral sounds at similar arousal levels didn't produce the same result.
Translation: deliberately engineered brand audio doesn't just accompany an impression. At the right emotional level, it shapes what gets remembered about the entire surrounding experience.
When I present this finding to clients, it reframes the conversation immediately. Most brand teams think about sound as something that fills the silence in their ads. The Clark and Leal research suggests it's actually doing something far more consequential: it's influencing the memory architecture of everything happening around it. That's not a support role. That's a lead role most brands have never cast intentionally.
This is the mechanism behind what we call memory structures. Brands exist in the mind as networks of associations (sounds, images, feelings, moments) that fire together when a distinctive asset is encountered. Sound builds those structures faster and more durably than any visual asset because it routes directly through the brain's emotional core, often before the conscious mind has registered anything at all.
Sound is not a support channel for brand memory. It is the primary biological mechanism through which brand memory is built.
Professor Charles Spence of Oxford University's Crossmodal Research Laboratory put it plainly: "All perception is influenced by multiple senses. Even though it feels like we're seeing or hearing or feeling, in fact much more goes into our judgement and perception than we realise."
We Don't Have Earlids
Here is the part of the argument that should stop you cold.
You spend millions on ad placements. The moment your ad plays, your customer looks down at their phone. Their eyes are on Instagram. Their hands are on the remote. They are, by every visual metric, not watching.
But we don't have earlids.
The human auditory system has no off switch. Sound reaches the brain whether or not the listener is paying conscious attention, making audio the only brand channel that continues building memory when a customer's eyes are elsewhere.

This is why you can be deep in a conversation across a crowded room and still hear your name. Your brain was processing the ambient sound the entire time, just below the threshold of conscious awareness.
For brands, this is either a wasted asset or a compounding one. Every ad impression you buy has two channels running simultaneously: the visual your customer may or may not see, and the audio they cannot turn off. If the audio is random, unmemorable stock music, that impression builds nothing. The neural pathway fires and dies.
But if that audio is a consistent, deliberately engineered sonic signal, something that sounds unmistakably like your brand, built from your emotional DNA and validated against real consumer data, then every impression, seen or not, makes a deposit into the memory bank.
This is the principle Home Depot has been quietly executing for years. Ad after ad, across television, radio, podcast, and streaming, the same music. The same sonic signal. Consistent enough that when it went viral on TikTok, with users dancing to it and meme-ifying the audio, the reaction wasn't confusion. It was recognition. The memory structure was already there. The culture just surfaced it.
That's not an accident. That's what intentional sonic branding looks like at scale.
What Brands Are Actually Building (Without Knowing It)
I'm constantly having the same conversation with brand marketers, CMOs, and marketing leaders across the country. It starts the same way almost every time.
I ask if they've heard of sonic branding. Most say no, or maybe. They're aware of the concept abstractly, but don't think it applies to them in any meaningful way.
Then I ask them to hum the McDonald's jingle. Every single person can do it. I mention the Pillsbury Doughboy giggle. The Ricola yodel. The first four notes of the State Farm sonic logo, which started as a Barry Manilow-penned jingle and has been evolving, compressing, and hardwiring itself into American memory structures for over forty years.
Something shifts. They don't just recognize the assets. They start retrieving memories: family dinners, road trips, specific moments in childhood kitchens. The sound didn't just remind them of the brand. It pulled entire emotional episodes out of storage, moments that had been encoded alongside those sounds for years.
The memory structures were already there, built over decades of consistent, repeated exposure. Nobody built them deliberately. The brands that own those assets often don't fully understand what they have.
This is the gap. Most brands have accidentally built something with their audio. A few have intentionally built something extraordinary. The distance between those two outcomes is almost entirely a question of strategy.
Jenni Romaniuk of the Ehrenberg-Bass Institute argues in Building Distinctive Brand Assets that memory is in a constant state of decline. Every memory your brand has built is eroding. Not slowly. Continuously. What keeps it alive is consistent, repeated exposure. The regular refresh of the neural pathway that keeps your brand top of mind when your customer is standing in the aisle ready to buy.
Nothing refreshes a memory structure more efficiently than sound. A two-second sonic logo in a radio ad your customer only half-heard is still a deposit. Their eyes were elsewhere. Their ears were not.
The Question Most Brands Have Never Asked
Here is what the consumer data reveals when you test sonic assets the way we do, running assets through 1,200+ responses across multiple variants, measuring appeal, brand attribute alignment, recall, and emotional resonance with real consumers:
The musical bed alone, without any brand identifier, can score impressively on appeal. In a recent engagement we ran through SoundOut's methodology, the standalone audio version of a CPG sonic logo scored close to the "Strong" benchmark on consumer appeal out of the gate.
But here's where it gets instructive: when we measured BrandMatch™, the emotional alignment between the music's personality and the brand's personality, the audio-only version landed at the 52nd percentile. Essentially average. The music was pleasant. It was not distinctively theirs.
The moment the brand name was integrated, through visual, through audio, or both, BrandMatch™ jumped to the 82nd percentile. Excellent. The same underlying music, anchored to the brand it was built for, performing at an entirely different level.
The attribution data from SoundOut's broader research base tells the same story in starker terms: sonic logos without the brand name embedded in them show 5% actual attribution. With the brand name: 46%. A 9x lift in actual attribution from one strategic decision.
Most brands and most creative directors choosing music on feel don't know this data exists. They're optimizing for what sounds right to them in the room. That's one person's taste standing in for an entire target audience. Walter Audio's process solves for exactly this gap. Before a single note is composed, our SONICfoundation methodology uses SoundOut's OnBrand AI model, trained on half a million consumer tests, to identify the musical territory that mathematically aligns with a brand's emotional DNA. The LIMBICprofiler then brings the brand's own stakeholders into the selection process. And consumer testing validates the final assets before they go into the world.
The result is a sonic identity that isn't built on instinct. It's built on evidence. And it's built to compound.
Sonic Brand Cues Outperform Everything Else
According to Ipsos research, sonic brand cues generate 8.53x more branded attention than the next best performing distinctive asset (characters at 6.01x) across all media. The assets that get the vast majority of a brand's attention budget (logos, fonts, brand colors) land at 1.17x, 12%, and negative 33% respectively.

System 1's research on short-form video advertising found that when a sonic brand cue appears in the first two seconds of a video ad, brand awareness lifts by 191% compared to video without sonic branding. Not 19%. 191%.
And yet Ipsos data shows sonic brand cues are deployed in only 6% of brand communications. The highest-performing distinctive asset, used least. That gap is not a creative problem. It's a strategic blind spot, and it's costing brands every time their ad plays to an audience that isn't watching.
Sonic brand cues deliver 8.53x more branded attention than any other distinctive asset, yet appear in only 6% of brand communications. No other asset class has a wider gap between performance and deployment. (Source: Ipsos, The Power of You, 2020)
The brands that close that gap first, that stop treating audio as a production line item to be briefed out to whoever is making the campaign this quarter, will build memory structures their competitors cannot replicate. Because owned sonic IP, unlike a logo refresh or a new campaign direction, compounds. Every impression, heard or unheard, adds another layer. Another deposit. Another neural connection between your brand and the emotion you're building toward.
The Question Worth Asking Before Your Next Campaign
What happens to our brand when our customer's eyes are on their phone?
If your audio is random stock music chosen by a creative director on deadline, the answer is: nothing. The impression fires and dies. The memory structure doesn't refresh.
If your audio is a data-backed, consumer-validated sonic signal built from your brand's emotional DNA, deployed consistently across every touchpoint, owned by you, hardwired into the subconscious over time, the answer is different.
The impression still fires. The eyes are still elsewhere. But the memory structure refreshes. The neural pathway strengthens. The brand gets a little more impossible to forget.
That's not magic. That's sensory architecture. And most brands haven't built it yet.
Frequently Asked Questions About Sonic Branding and Memory
What is sonic branding? Sonic branding is the strategic practice of developing and deploying consistent, owned audio assets (including sonic logos, brand anthems, and UI sounds) to build brand recognition and memory structures in the consumer's mind. Unlike visual branding, sonic branding works through the auditory system, which processes sound subconsciously and encodes it directly in the brain's emotional memory centers, regardless of whether the listener is actively paying attention.
Why does sound build stronger brand memories than visuals? Sound persists in sensory memory (echoic memory) for two to four seconds, compared to 250–500 milliseconds for visual information (iconic memory). This six-to-eight-times advantage gives audio more time to transfer into long-term memory. Research published in the Journal of Neuroscience (Clark & Leal, 2025) found that music-induced emotional arousal uniquely modulates memory consolidation via the hippocampus and amygdala, an effect not replicated by neutral sounds or visual stimuli.
What is a sonic logo, and how does it work? A sonic logo is a short audio mnemonic, typically two to five seconds, engineered to trigger instant brand recognition. It functions like a visual logo but through the auditory system: consistent repetition across media touchpoints gradually hardwires the sound-to-brand association into the listener's subconscious memory. According to SoundOut research, sonic logos that include the brand name generate 46% actual attribution from listeners, compared to 5% for sonic logos without the brand name.
How do sonic brand cues perform compared to other brand assets? According to Ipsos research (The Power of You, 2020), sonic brand cues generate 8.53x more branded attention than any other distinctive brand asset. In short-form video advertising, sonic cues placed in the first two seconds of an ad produce a 191% lift in brand awareness compared to ads without branded audio (System 1, The Long and Short(form) of It, 2023). Despite this performance advantage, sonic brand cues appear in only 6% of brand communications, a significant deployment gap.
What is the Audio Identity Gap? The Audio Identity Gap is the strategic disconnect between the visual consistency brands enforce (consistent logos, colors, typography across all touchpoints) and the complete absence of intentional audio strategy. Most brands demand pixel-perfect visual consistency while allowing entirely random, unmanaged stock music across their media mix, a condition Walter Audio calls Sonic Schizophrenia. Closing the Audio Identity Gap means developing owned sonic assets that are deployed as consistently as visual brand elements.
How does Walter Audio build a sonic identity? Walter Audio's process begins with the SONICfoundation, a consumer-validated strategy phase that maps a brand's emotional DNA using SoundOut's OnBrand AI model (trained on over 500,000 consumer tests) and the LIMBICprofiler stakeholder voting platform. This phase produces three SONIC MOODboards: curated audio collages that define the brand's sonic direction. All final assets are tested on real consumers before deployment to ensure they resonate with the right audience at the right emotional register.
Walter Audio builds hi-fidelity branded audio ecosystems that hardwire brands into the subconscious. Our process begins with consumer-validated strategy and ends with assets built to compound. Heard or unheard, every impression counts.
Ready to close your Audio Identity Gap? Talk to the CEO →
Sources
Clark, K.R. & Leal, S.L. (2025). Fine-Tuning the Details: Post-encoding Music Differentially Impacts General and Detailed Memory. Journal of Neuroscience, 45(31). https://doi.org/10.1523/JNEUROSCI.0158-25.2025
eMarketer (March 2026). Second-screen viewing goes mainstream, raising ad recall risks. https://www.emarketer.com/content/second-screen-viewing-goes-mainstream-raising-ad-recall-risks
Ipsos (2020). The Power of You. Branded attention and asset deployment data. https://www.ipsos.com/en/power-you-why-distinctive-brand-assets-are-driving-force-creative-effectiveness
Romaniuk, J. (2018). Building Distinctive Brand Assets. Oxford University Press. https://global.oup.com/academic/product/building-distinctive-brand-assets-9780190311506
SoundOut (2025). Sonic Branding Effectiveness Data. SoundOut Index / BrandMatch methodology.
System 1 (2024). The Long and the Short(form) of It. Brand awareness lift data. https://system1group.com/creative-effectiveness-tiktok
Spence, C. & Keller, S. (2024). Psychology & Marketing. https://psycnet.apa.org/record/2024-69013-001

