Tag: deep learning

Odd Challenges of Using Deep Learning in Designing a Feedback Delay Network Reverb – Wojciech Kacper Werkowicz & Benjamin Whateley

  • Lobby
  • Tag Archives: deep learning

Join Us For ADC24 - Bristol - 11-13 November 2024
More Info: https://audio.dev/
@audiodevcon​

Odd Challenges of Designing a Feedback Delay Network Reverb With Deep Learning - Wojciech Kacper Werkowicz & Benjamin Whateley - ADC 2023

Past lustrum have seen the rise of interest in optimization of audio effects and synthesizer parameters in use cases including parameter inference from audio input, as well as approaches for Differentiable Digital Signal Processing (such as Magenta's DDSP). However, there are still notable limitations in the area, exemplified well by the problems posed by some fundamental DSP units such as IIR filters - issues of stability, interpretability and differentiability.

In this talk, we will take on all of the above. It will be done so in the context of a research endeavour into modelling room Impulse Responses using Feedback Delay Network (FDNs). Covering a range of approaches, from naive to more advanced, we will take multiple detours to look into machine learning challenges in context of direct applications to DSP, such as approximating common transformations, tackling computational efficiency, taming the explosivity of feedback systems, at last, hopefully, differentiating the undifferentiable.
_

Wojciech Kacper Werkowicz

Programmer, computer musician, improviser from Pruszków, Poland. After being introduced to electronic music by "Ishkur's Guide" in early episode of life, his interest persisted over years. Graduated from Music Computing and Technology BSc program at Goldsmiths in 2023, where he studied under Michael Zbyszynski, Seth Horvitz and Lance Putnam. Currently surveying historical and contemporary digital synthesis methods as a part of his Masters research at Institute of Sonology, The Hague, aiming to critically contextualise synthesis technologies through the lens of sound culture and philosophy. Interested in algorithmic music, machine learning, internet culture. Often enjoys mixing lo-fi technologies with the cutting edge.
_

Benjamin Whateley
_

Streamed & Edited by Digital Medium Ltd: https://online.digital-medium.co.uk
_

Organized and produced by JUCE: https://juce.com/
_

Special thanks to the ADC23 Team:

Sophie Carus
Derek Heimlich
Andrew Kirk
Bobby Lombardi
Tom Poole
Ralph Richbourg
Jim Roper
Jonathan Roper
Prashant Mishra

#adc #deeplearning #dsp #audio

Filed under: UncategorizedTagged with: , , , ,

Collaborative Songwriting & Production With Symbolic Generative AI – Sadie Allen & Anirudh Mani – ADC23

  • Lobby
  • Tag Archives: deep learning

https://audio.dev/ -- @audiodevcon​

Collaborative Songwriting and Production With Symbolic Generative AI - Sadie Allen & Anirudh Mani - ADC23

Generative AI has experienced remarkable advancements in various domains, including audio and music. However, despite these breakthroughs, we have yet to reach a stage where musicians can seamlessly incorporate generative AI into their creative processes. In this talk, we will delve into the techniques, proposals, and ongoing work that can facilitate collaborative songwriting and production with machine learning.

During the session, we will explore several key topics:
• Overview of existing tools and models - we will discuss the motivations behind symbolic generation versus raw audio for music production applications. Furthermore, we will highlight the contrasting approaches and techniques that aim to augment the creative process rather than replace it entirely.
• Utilization of AI-generated MIDI as a songwriting tool - this will involve examining different ML architectures for conditional MIDI generation, as well as employing reinforcement learning (RL) to generate MIDI sequences.
• Examples showcasing how speakers and other musicians currently utilize AI-generated MIDI as part of their songwriting/production process.

Attendees will gain insights into cutting-edge techniques and research, paving the way for a future where generative AI becomes an integral part of the creative process for musicians.

Link to Slides: https://drive.google.com/file/d/15qYW-SbgmodMZ_wiMKKvH8pXmrDCZQpY/view?usp=sharing
_

Sadie Allen
_

Anirudh Mani

I build creative AI tools for artists.I am the co-founder of Lemonaide Music. https://www.lemonaide.ai/ https://www.linkedin.com/in/anirudh-mani-1796934b/ https://twitter.com/anirudh3
_

Streamed & Edited by Digital Medium Ltd: https://online.digital-medium.co.uk
_

Organized and produced by JUCE: https://juce.com/
_

Special thanks to the ADC23 Team:

Sophie Carus
Derek Heimlich
Andrew Kirk
Bobby Lombardi
Tom Poole
Ralph Richbourg
Jim Roper
Jonathan Roper
Prashant Mishra

#adc #ai #dsp #audio #generativeai

Filed under: UncategorizedTagged with: , , , ,

Real-Time Inference of Neural Networks: A Guide for DSP Engineers – Valentin Ackva & Fares Schulz

  • Lobby
  • Tag Archives: deep learning

https://audio.dev/ -- @audiodevcon​

Real-time Inference of Neural Networks: A Practical Approach for DSP Engineers - Valentin Ackva & Fares Schulz - ADC 2023

In upcoming audio processing innovations the intersection of neural networks and real-time environments is set to play a decisive role. Our recent experience of implementing neural timbre transfer technology in a real-time setting has presented us with diverse challenges. Overcoming them has provided us with significant insights into the practicalities of inferencing neural networks inside an audio plugin.

This talk presents a pragmatic approach: Starting with a trained model, we guide you through the necessary steps for inferencing the model in a real-time environment. On our way we delve into the critical aspect of maintaining real-time safety, share proven strategies to ensure a seamless and uninterrupted signal flow. Moreover, we address the delicate balance between latency, performance, and stability. For this we utilize three different inference engines: libtorch, tensorflow-lite and onnxruntime. While the in-house solutions for the popular machine learning frameworks PyTorch and TensorFlow, seem obvious choices, sometimes other engines may be better suited for certain use cases. By contrasting the characteristics of the engines, we hope to simplify your decision-making process.
_

Valentin Ackva

I am an audio programmer and electronic musician based in Berlin. With a background in computer science, I'm currently working towards my master's degree in Audio Communication and Technology at the Technische Universität Berlin. My passion lies at the intersection of music, programming, and technology, especially where artistry meets innovation. For the last 3 years, I have been working as an audio software developer at a speech processing startup in Leipzig. At my position there, I am responsible for the development of audio effects for speech enhancement. This role includes research into the real-time implementation of state-of-the-art neural networks for tasks such as denoising, audio super-resolution, and dereverberation. This year, I have co-founded a collective that combines the fields of DSP and AI, bringing together a group of audio programmers, machine learning engineers, and artists based in Berlin. In March, we released our first software, "Scyclone", an audio plugin that utilizes neural timbre transfer technology, introducing a new approach to automatic layering. Scyclone's innovative design and interaction of DSP and AI led to it winning the Audio Plugin Competition organised by the Audio Programmer.
_

Fares Schulz

Hello! I am a student assistant at the Electronic Studio of Technische Universität Berlin, currently pursuing a master's degree in Audio Communication and Technology. My educational background includes two bachelor's degrees in physics and audio engineering. During this time, my passion for audio software gradually led me from theoretical mathematical equations and abstract artistic concepts to their development as DSP algorithms in Python and their implementation as real-time audio applications in C++. Currently, I am particularly interested in exploring novel applications of neural networks for digital signal processing. Together with like-minded people, I recently developed the open source project Scyclone, which won the Neural Audio Plugin Competition organized by Audio Programmer. In addition to my academic and open source endeavors, I am actively involved in the development of noise reduction algorithms in Python and C++ for medical devices at Miethke.
_

Streamed & Edited by Digital Medium Ltd: https://online.digital-medium.co.uk
_

Organized and produced by JUCE: https://juce.com/
_

Special thanks to the ADC23 Team:

Sophie Carus
Derek Heimlich
Andrew Kirk
Bobby Lombardi
Tom Poole
Ralph Richbourg
Jim Roper
Jonathan Roper
Prashant Mishra

#adc #audiodev #dsp #audio

Filed under: UncategorizedTagged with: , , ,