"Say when" for AI in music

How far can a song lean on AI before it becomes unlistenable?

Elias Goodman

Nov 01, 2025

If you haven’t checked out the test from last week, give that a go first.

Experiment 01: Trust your Ear

Elias Goodman

October 21, 2025

Read full story

picture that - original track (Version A)

0:00

-1:48

On an overcast October afternoon, over Kona drafts and disappointing New York football, our collective gaze fell from the TVs to the table, to the phone in Alex’s hand.

What a way to enjoy retirement! A culturally relevant act by a nationally beloved figure. Everyone wants to be a DJ, even (especially) Obama.

I so badly wanted this to be real, but a pitch-black background for a DJing president would never be the case. Sadly, slop.

Our conversation shifted towards the slop we’re being fed by platforms originally meant to connect us.

How do you know when you’ve seen AI? What throws you off?

Alex (videographer/tri-linguist at Bloomberg) said something to the tune of an inconsistency in image clarity. Gordon (guitarist/producer/mRNA at Sanofi) echoed with “you know it when you see it.”

How about when you’ve heard AI?

A Bavarian pretzel and a puzzled silence arrived at the table. Alex and Gordon knew it was here, but hadn’t heard anything as obvious as what they’d seen. A collective shrug of shoulders signaled a non-resolution as we panned back to the Giants-Broncos game that had finally gotten interesting in the 4th quarter.

Beers finished, but questions loomed.

If our ears can’t reliably detect AI’s presence in music, what does that mean for trust in what we listen to?

And if a song attracts you but was shaped by a machine, does it matter, or does knowing its artificial influence change the value we assign to the music?

Is that music actually “slop”?

Perceived “slop” is as attitudinal as it is sensory. Several studies conducted across the arts conclude that adding an “AI-made” label depresses aesthetic ratings and authenticity, even when stimuli are otherwise indistinguishable. This is more obvious in the appearance of text and image, but it’s the same dance in music, where simply believing creative work is AI-generated can shift our evaluations of it. The point at which listeners say “this is too much AI” depends not only on sound but also on disclosure and expectation.

But music is tougher to decipher. Repeating what Gordon said: you know it when you see it with visual work. There, AI influence is more black-and-white; you observe a person move strangely, a blur of a logo, the em-dash we’ve become more sensitive to in writing. Once you see it, the work is written off as AI, and it has now been grouped as some inhuman “other”.

But with music, AI may have made its mark on composition, sound design, mixing, mastering, or performance.

And to make things harder, we are, at best, just okay at detecting synthetic audio. Our hearing is predictive and integrative; we group notes, tones, phrases, timbres over time. So rather than honing in on a single static instance, edits are smoothed out by our hearing systems. We effectively get in our own way. Classic results show listeners hear missing sounds when they’re masked and perceive interrupted tones or textures as continuous. If there’s an unnatural hiccup in a track, we’ll definitely notice it, but then it’s gone by the next few bars.

This mostly explains why small manipulations are easy for us to miss, and why AI influence on music exists on a more slippery spectrum than visual media.

Experiment 01: Setting the Stage

With Suno’s cover/remix feature, I took my original track and ran it through their audio engine, controlling the effect of AI-influence using the parameters provided. To bypass major stylistic influence, I was able to put a period in the lyrics section, which satisfied the program requirements for remixing (they’ve since fixed this).

I worked within the Advanced Options dropdown, where the sliders allowed adjustment to the level of influence of Suno’s model on my original track.

Weirdness: encourages less typical generations
Style Influence: how strongly the model enforces tagged style genres (bypassed this)
Audio Influence: how closely the output aligns with the sound of my original audio.

In the UI, higher = closer to original (0,0,100 ≈ minimal model sway)

With weirdness and style influence at 0 and audio influence at 100, Suno’s output should wholly maintain the sonic character and qualities of my original track.

This generated Version B in the experiment. All versions are below again for reference.

Version A (baseline) - my work