#+TITLE: Experiments with WebAudio #+DATE: <2016-05-03> #+NOCOMMENTS: true #+MACRO: button @@html:@@ #+COMMENT: unfortunately the macro can't be used when the function call has more than one parameter, because the comma is parsed for the macro #+begin_export html #+end_export I read the [[https://www.html5rocks.com/en/tutorials/webaudio/intro/][HTML5Rocks article]] and made some notes for myself. I also read parts of [[https://webaudio.github.io/web-audio-api/][the spec]]. I have yet to read [[https://padenot.github.io/web-audio-perf/][this guide to performance]]. An =AudioContext= is the entry point into everything. You use it to create /sound sources/, connect them to the /sound destination/ (speakers), and play the sounds. * Simplest example :PROPERTIES: :CUSTOM_ID: simplest-example :END: It's node based. The sound source is a node. You /connect/ it to a sound destination like speakers. If you instead connect the sound source to a =GainNode= and then connect that to the speakers, you can control volume. A =BiQuadFilterNode= does low/high/band pass filters. #+begin_src js :tangle yes var context = new AudioContext(); function tone() { var osc440 = context.createOscillator() osc440.frequency.value = 440; osc440.connect(context.destination); osc440.start(); osc440.stop(context.currentTime + 2); } #+end_src {{{button(Run,tone())}}} * Timed start/stop :PROPERTIES: :CUSTOM_ID: timed-start-stop :END: When playing a sound you can tell it /what time to start/ (or stop). This way you can schedule several sounds in sequence and not have to hook into the timer for each one. ** Serial :PROPERTIES: :CUSTOM_ID: serial :END: #+begin_src js :tangle yes function two_tones() { var osc440 = context.createOscillator(); osc440.frequency.value = 440; osc440.connect(context.destination); osc440.start(); osc440.stop(context.currentTime + 1 /* seconds */); var osc880 = context.createOscillator(); osc880.frequency.value = 880; osc880.connect(context.destination); osc880.start(context.currentTime + 1.1); osc880.stop(context.currentTime + 2.1); } #+end_src {{{button(Run,two_tones())}}} ** Parallel - telephone tones :PROPERTIES: :CUSTOM_ID: parallel-telephone-tones :END: I was hoping to use the same oscillator and start/stop it, but it won't start multiple times, so I created multiple oscillators. An alternative structure would be to hook both oscillators up to a gain node, and then use a custom piecewise curve to control gain between 0 and 1. Then to clean up, use a =setTimeout= to stop the oscillators at the very end. #+begin_src js :tangle yes function touch_tone(freqs, time_on, time_off) { var T = context.currentTime; var total_time = time_on + time_off; for (var i = 0; i < 4; i++) { var t = T + i * total_time; freqs.forEach(function(freq) { var osc = context.createOscillator(); osc.frequency.value = freq; osc.connect(context.destination); osc.start(t); osc.stop(t + time_on); }); } } #+end_src #+begin_export html

#+end_export There are some other frequency combinations and timings listed [[https://physics.info/beats/][here]]. * Parameter control :PROPERTIES: :CUSTOM_ID: parameter-control :END: Parameters like =frequency= are objects with a =value= field that you can read/write, and also functions to [[https://www.w3.org/TR/webaudio/#AudioParam][set that value programmatically]], using linear or exponential curves, or a custom piecewise linear curve. I needed to use =linearRampToValueAtTime= twice, once to set the beginning point and once to set the end point: #+begin_src js :tangle yes function tone_rising() { var T = context.currentTime; var osc = context.createOscillator(); osc.type = "square"; osc.frequency.exponentialRampToValueAtTime(440, T); osc.frequency.exponentialRampToValueAtTime(880, T + 2); osc.connect(context.destination); osc.start(); osc.stop(T + 4); } #+end_src #+begin_export html

#+end_export * ADSR :PROPERTIES: :CUSTOM_ID: adsr :END: I used the =linearRampToValueAtTime= function to make ADSR (attack-decay-sustain-release) envelopes, which are just piecewise linear curves of volume levels: #+begin_src js :tangle yes function adsr(T, a, d, s, r, sustain) { var gain = context.createGain(); function set(v, t) { gain.gain.linearRampToValueAtTime(v, T + t); } set(0.0, -T); set(0.0, 0); set(1.0, a); set(sustain, a + d); set(sustain, a + d + s); set(0.0, a + d + s + r); return gain; } function note(freq, offset) { var T = context.currentTime; var env = {a: 0.1, d: 0.2, s: 0.4, r: 0.2, sustain: 0.2}; var osc = context.createOscillator(); osc.type = "sawtooth"; osc.frequency.value = freq; var gain = adsr(T + offset, env.a, env.d, env.s, env.r, env.sustain); osc.connect(gain); gain.connect(context.destination); osc.start(T + offset); osc.stop(T + offset + env.a + env.d + env.s + env.r + 3); } function knock(freq, offset) { var T = context.currentTime; var env = {a: 0.025, d: 0.025, s: 0.025, r: 0.025, sustain: 0.7}; var osc = context.createOscillator(); osc.frequency.value = freq; var gain = adsr(T + offset, env.a, env.d, env.s, env.r, env.sustain); osc.connect(gain); gain.connect(context.destination); osc.start(T + offset); osc.stop(T + offset + env.a + env.d + env.s + env.r + 3); } function tweet(freq, offset) { var T = context.currentTime; var gain = adsr(T + offset + 0.03, 0.01, 0.08, 0, 0, 0); var osc = context.createOscillator(); osc.frequency.value = freq; osc.frequency.setValueAtTime(freq, T + offset); osc.frequency.exponentialRampToValueAtTime(freq * 2, T + offset + 0.1); osc.connect(gain); gain.connect(context.destination); osc.start(); osc.stop(T + offset + 0.15); } function tweets() { for (var i = 0; i < 10; i++) { tweet(1000 * (1 + 2*Math.random()), i*0.2); } } #+end_src #+begin_src js :tangle yes :exports none function aliens() { note(784, 0); note(880, 0.8); note(698, 1.6); note(349, 2.4); note(523, 3.2); } #+end_src #+begin_export html

#+end_export The Tweets example was supposed to be bubbles from [[https://mitpress.mit.edu/9780262014410/designing-sound/][this book]] but it ended up sounding like tweets instead. John L tells me that brass instruments have more frequency modulation during the 10-50ms attack and then it tapers off during the sustain. It's /not/ the simple model of only the gain of the waveform being shaped by the envelope. * Simpler envelope :PROPERTIES: :CUSTOM_ID: simpler-envelope :END: On [[http://outputchannel.com/post/tr-808-cowbell-web-audio/][this page]] there's a simpler envelope: use =exponentialRampToValueAtTime= to go to zero. #+begin_src js :tangle yes function cowbell() { var T = context.currentTime; var osc1 = context.createOscillator(); osc1.type = "square"; osc1.frequency.value = 800; var osc2 = context.createOscillator(); osc2.type = "square"; osc2.frequency.value = 540; var gain = context.createGain(); osc1.connect(gain); osc2.connect(gain); gain.gain.setValueAtTime(0.5, T); gain.gain.exponentialRampToValueAtTime(0.01, T + 1.0); var filter = context.createBiquadFilter(); filter.type = "bandpass"; filter.frequency.value = 800; gain.connect(filter); filter.connect(context.destination); osc1.start(T); osc2.start(T); osc1.stop(T + 1.1); osc2.stop(T + 1.1); } #+end_src #+begin_export html

#+end_export * Modulating parameters :PROPERTIES: :CUSTOM_ID: modulating-parameters :END: We can vary the volume (tremolo) or frequency (vibrato) by connecting an oscillator to another parameter. ** Tremolo :PROPERTIES: :CUSTOM_ID: tremolo :END: We want to vary the gain (volume) with an oscillator. Set the output to =(k1 + k2*sin(freq2*t)) * sin(freq1)=. WebAudio doesn't allow that directly but we can compose: - Any value is the *sum* of the inputs. Set =gain.value= to =k1= and then connect the second part using =.connect()=. - The second part has to go through another gain for =k2=. - The second part's oscillator has frequency =freq=. So this would be #+begin_src dot :cmd dot :file webaudio-tremolo.png :exports results digraph { rankdir=LR; node [fontname=Avenir, fontsize=9, shape=rect, style=filled, color="#aaaaaa", fillcolor="#eeeeee"]; edge [fontname=Avenir, fontsize=8]; k1 -> gain2 [label="gain"]; osc2 -> gain1; gain1 -> gain2 [label="gain"]; gain2 -> output; osc1 -> gain2; osc1 [label="osc(freq1)"]; osc2 [label="osc(freq2)"]; gain1 [label="gain(k2)"]; gain2 [label="gain(+)"]; k1 [label="const(k1)"]; } #+end_src #+results: [[file:webaudio-tremolo.png]] #+begin_src js :tangle yes function tremolo() { var k1 = 0.3, k2 = 0.1; var T = context.currentTime; var osc2 = context.createOscillator(); osc2.frequency.value = 10; var osc1 = context.createOscillator(); osc1.frequency.value = 440; var gain2 = context.createGain(); osc2.connect(gain2.gain); gain2.gain.value = k2; var gain1 = context.createGain(); gain1.gain.value = k1; osc2.connect(gain1.gain); osc1.connect(gain1); gain1.connect(context.destination); osc1.start(T); osc2.start(T); osc1.stop(T + 3); osc2.stop(T + 3); } #+end_src #+begin_export html

#+end_export ** Vibrato :PROPERTIES: :CUSTOM_ID: vibrato :END: oscillator connected to pitch (=detune= parameter): =sin(freq1 + k * sin(freq2))= #+begin_src dot :cmd dot :file webaudio-vibrato.png :exports results digraph { rankdir=LR; node [fontname=Avenir, fontsize=9, shape=rect, style=filled, color="#aaaaaa", fillcolor="#eeeeee"]; edge [fontname=Avenir, fontsize=8]; osc1 -> gain; gain -> osc2 [label="detune"]; osc2 -> output; gain [label="gain(k)"]; osc1 [label="osc(freq1)"]; osc2 [label="osc(freq2)"]; } #+end_src #+results: [[file:webaudio-vibrato.png]] #+begin_src js :tangle yes function vibrato() { var T = context.currentTime; var osc1 = context.createOscillator(); osc1.frequency.value = 10; var osc2 = context.createOscillator(); osc2.frequency.value = 440; var gain = context.createGain(); gain.gain.value = 100.0; osc1.connect(gain); gain.connect(osc2.detune); osc2.connect(context.destination); osc1.start(T); osc2.start(T); osc1.stop(T + 3); osc2.stop(T + 3); } #+end_src #+begin_export html

#+end_export * Fill AudioBuffer directly :PROPERTIES: :CUSTOM_ID: fill-audiobuffer-directly :END: An =AudioBuffer= is normally loaded by decoding a WAV/MP3/OGG file, but I can fill it directly: #+begin_src js :tangle yes function make_buffer(fill, env) { var count = context.sampleRate * 2; var buffer = context.createBuffer(1, count, context.sampleRate); var data = buffer.getChannelData(0 /* channel */); var state = {}; var prev_random = 0.0; for (var i = 0; i < count; i++) { var t = i / context.sampleRate; data[i] = fill(t, env, state); } var source = context.createBufferSource(); source.buffer = buffer; return source; } function fill_thump(t, env, state) { var frequency = 60; return Math.sin(frequency * Math.PI * 2 * Math.pow(t, env.s)); } function fill_snare(t, env, state) { var prev_random = state.prev_random || 0; var next_random = Math.random() * 2 - 1; var curr_random = (prev_random + next_random) / 2; prev_random = next_random; return Math.sin(120 * Math.pow(t, 0.05) * 2 * Math.PI) + 0.5 * curr_random; } function fill_hihat(t, env, state) { var prev_random = state.prev_random || 0; var next_random = Math.random() * 2 - 1; var curr = (3*next_random - prev_random) / 2; prev_random = next_random; return curr; } function drum(fill, env) { var source = make_buffer(fill, env); var gain = adsr(context.currentTime, env.a, env.d, env.s, env.r, env.sustain); source.connect(gain); gain.connect(context.destination); source.start(); } #+end_src #+begin_export html

#+end_export The drum functions are from [[https://tinyrave.com/tracks/2-car-seat-drum-kit][this code]]. ** Wave shaper :PROPERTIES: :CUSTOM_ID: wave-shaper :END: Instead of making a buffer for a given function, use =createWaveShaper()= and give it a shape. *This would especially be useful for 0-1 phasors* by using =[0,1]= as the shape. But it can also be useful for /clipping/ a signal that exceeds the -1:+1 range by using =[-1,+1]= as the shape. (No demo - just a note to myself) * Switch sounds :PROPERTIES: :CUSTOM_ID: switch-sounds :END: #+begin_src js :tangle yes function start_switch(offset, a, d, freq) { var T = context.currentTime + offset; var noise = make_buffer(fill_hihat, {a: a, d: d, s: 0, r: 0, sustain: 0}); var gain1 = adsr(T, a, d, 0, 0, 0); noise.connect(gain1); var filter1 = context.createBiquadFilter(); filter1.type = "bandpass"; filter1.frequency.value = freq; filter1.Q.value = 12; gain1.connect(filter1); var delay = context.createDelay(); delay.delayTime = 0.050; filter1.connect(delay); var filter2 = context.createBiquadFilter(); filter2.type = "bandpass"; filter2.frequency.value = 700; filter2.Q.value = 3; delay.connect(filter2); var gain2 = context.createGain(); gain2.gain = 0.01; filter2.connect(gain2); gain2.connect(delay); delay.connect(context.destination); noise.start(T); noise.stop(T + 0.1); } function multi_switch() { start_switch(0, 0.001, 0.020, 5000); start_switch(0.030, 0.001, 0.050, 3000); start_switch(0.140, 0.001, 0.020, 4000); start_switch(0.150, 0.001, 0.050, 7000); } #+end_src #+begin_export html

#+end_export Why does this "ring"? I don't understand. I'm following the Switches chapter of Designing Sound but don't understand why my results are different from his. * Motor :PROPERTIES: :CUSTOM_ID: motor :END: The Motors chapter of Designing Sound divides the motor into a rotor and stator. #+begin_src js :tangle yes var rotor_level = 0.5; function get_slider(id) { return document.getElementById(id).value; } function fill_one(t, env, state) { return 1.0; } function fill_phasor_power(t, env, state) { var phase = (t * env.freq) % 1.0; return Math.pow(phase, env.power); } function rotor() { var T = context.currentTime; var noise = make_buffer(fill_hihat, {}); var filter1 = context.createBiquadFilter(); filter1.type = "bandpass"; filter1.frequency.value = 4000; filter1.Q.value = 1; noise.connect(filter1); var gain1 = context.createGain(); gain1.gain.value = get_slider('brush-level'); filter1.connect(gain1); var constant = make_buffer(fill_one, {}); var gain2 = context.createGain(); gain2.gain.value = get_slider('rotor-level'); constant.connect(gain2); var gain3 = context.createGain(); gain3.gain.value = 0; gain1.connect(gain3); gain2.connect(gain3); var freq = get_slider('motor-drive'); var drive = make_buffer(fill_phasor_power, {power: 4, freq: freq}); drive.loop = true; drive.connect(gain3.gain); gain3.connect(context.destination); noise.start(T); drive.start(T); constant.start(T); noise.stop(T + 1); drive.stop(T + 1); constant.stop(T + 1); } #+end_src #+begin_export html

Drive:
Brush:
Rotor:
Set parameters then click

#+end_export This procedurally generated sound was used in http://trackstar.glitch.me/intro! ([[https://github.com/joegaffey/trackstar][source]]) * Karplus-Strong algorithm :PROPERTIES: :CUSTOM_ID: karplus-strong-algorithm :END: [[https://blog.demofox.org/2016/06/16/synthesizing-a-pluked-string-sound-with-the-karplus-strong-algorithm/][See this explanation]] and also [[http://sites.music.columbia.edu/cmc/MusicAndComputers/chapter4/04_09.php][this explanation]]. There's a WebAudio demo [[http://amid.fish/javascript-karplus-strong][here]] that uses hand-written asm.js (!) but it doesn't work well when it's in a background tab. There's a much fancier demo [[https://chinmay.audio/castro/][here]] using the same library (which isn't open source licensed). There's another demo [[https://tinyrave.com/tracks/67/remix][here]] that uses very little code. (I didn't write my own demo) * Other :PROPERTIES: :CUSTOM_ID: other :END: Like PureData and SuperCollider and probably other DSP software, there are parameters that are set per audio sample ("a-rate" in WebAudio, "audio rate" in Pure Data) and lower frequency ("k-rate" in WebAudio, "control rate" in Pure Data). This probably shouldn't be visible to me. There's a =duration= field on an audio buffer -- not sure about this. Other possibly useful nodes: - =StereoPannerNode= (2D) - =PannerNode= (3D space, doppler effects, conical area, requires you to set the listener position) - =ConvolverNode= (reverb) - convolution parameters generated by firing an "impulse" ping and then recording reflections. Impulse responses can be found on the internet. - =DelayNode= - =IIRFilterNode= (more general than BiQuadFilterNode but less numerically stable?) - =OscillatorNode= sine/square/triangle/sawtooth, or custom controlled by =PeriodicWave= - =WaveShaperNode= ("non-linear distortion effects") - =ScriptProcessorNode= (run your own js function to process each chunk of audio buffer -- may be deprecated) - =DynamicsCompressorNode= (raises volume of soft parts, lowers volume of loud parts, used in games) WebAudio provides all this so that it can run it on a high priority audio thread separate from the main javascript execution thread. (Maybe this is why ScriptProcessorNode is deprecated) Why do we need all this special processing for audio, which is under 200k/sec of data, but not for video, which can be gigabytes/sec of data? It's about *latency*. (Thanks to [[https://www.mikeash.com/pyblog/why-coreaudio-is-hard.html][this article]] for making me think about this.) Video is played at 60 frames/second (16 milliseconds) but audio is played at 44,100 samples/second (22 microseconds), almost a factor of 1000 lower. Video needs high bandwidth; audio needs low latency. Operating systems run tasks at around 60-100 times per second. So audio work has to be done with buffers. You want to keep buffers small to be able to react quickly, but you want to keep buffers large to avoid running out of data. ** More other :PROPERTIES: :CUSTOM_ID: more-other :END: - Google's using [[https://github.com/magenta/magenta][AI to generate music]]. - Try triangle wave — gain X10 — waveshaper =[-1,+1]= - Try low band pass =<5000= to cut out some sudden audio switch distortion - [[https://djen.co/#/][This procedural audio generator]] makes “metal” music (but I think it's using mp3/wav samples underneath) - [[https://codepen.io/jakealbaugh/full/qNrZyw/][Musical Chord Progression Generator]] - [[https://cmc.music.columbia.edu/MusicAndComputers/chapter5/05_02.php][Reverb]] is something you can measure by emitting an "impulse" and then recording what reflects off of all the surfaces. You can then use convolution to apply this to all the sounds you generate. - [[http://aspress.co.uk/sd/][Downloadable versions of the Designing Sound examples]] My overall experience playing with WebAudio in 2016 was unpleasant. I've heard the [[https://wiki.mozilla.org/Audio_Data_API][Mozilla API]] seemed nicer but I didn't get a chance to use it. I thought maybe I was doing something wrong but [[https://news.ycombinator.com/item?id=15240762][others have had bad experiences too]]. The whole system seems to have too many specific node types and not enough flexibility to make your own. By mid-2021 Audio Worklets became available across modern browsers, but they weren't around when I wrote this page in 2016. #+begin_export html