Tag: AI

Developing an AI-Powered Karaoke Experience – Thomas Hézard & Clément Tabary – ADC23

https://audio.dev/ -- @audiodevcon​

Developing an AI-Powered Karaoke Experience - Thomas Hézard & Clément Tabary - ADC23

Karaoke has been of popular interest for many years, from the first karaoke bars in the 1970s to the karaoke video games of today, and the recent progress in deep learning technologies has opened up new horizons. Audio source separation and voice transcription algorithms now give the opportunity to create a complete karaoke song, with instrumental track and synchronised lyrics, from any mixed music track. Real-time stems remixing, pitch and tempo control, and singing quality assessment are other useful audio features to go beyond the traditional karaoke experience. In this talk we will discuss the challenges we had to tackle to provide our users with a fully automatic and integrated karaoke system adapted for both mobile and web platforms.
_

Thomas Hézard

Thomas leads the Audio Research & Development team at MWM, working with his team on innovative signal processing algorithms and their optimised implementation on various platforms. Before joining the MWM adventure, Thomas completed a PhD on voice analysis-synthesis at IRCAM in Paris. Fascinated by every aspect of sound and music, both artistic and scientific, Thomas is also a musician, a sound engineer, a passionate teacher, and an amateur photographer.
_

Clément Tabary

Clément is a deep-learning research engineer at MWM. He applies ML algorithms to a wide range of multimedia fields, from music information retrieval to image generation. He's currently working on audio source separation, music transcription, and automatic DJing.
_

Streamed & Edited by Digital Medium Ltd: https://online.digital-medium.co.uk
_

Organized and produced by JUCE: https://juce.com/
_

Special thanks to the ADC23 Team:

Sophie Carus
Derek Heimlich
Andrew Kirk
Bobby Lombardi
Tom Poole
Ralph Richbourg
Jim Roper
Jonathan Roper
Prashant Mishra

#adc #audiodev #ai #karaoke

Filed under: UncategorizedTagged with: , , ,

Virtual Studio Production Tools With AI Driven Personalized Spatial Audio for Immersive Mixing

Join Us For ADC24 - Bristol - 11-13 November 2024
More Info: https://audio.dev/
@audiodevcon​

Virtual Studio Production Tools With AI Driven Personalized Spatial Audio for Immersive Mixing - Dr. Kaushik Sunder & Krishnan Subramanian - ADCx India 2024

In recent years, Spatial audio formats such as Dolby Atmos, Sony 360 Reality, Auro 3D are on the rise. As a result of this, there is also an increasing need for having multi channel speaker setups and associated gear in the studio to produce, mix, and master music in such formats. These systems are extremely expensive, occupy space, time consuming to set up, and therefore a massive barrier to entry for most mixing engineers. In this talk, we will present some of the latest innovations in enabling an ecosystem of Virtual Studio Production with AI driven personalized spatial audio. We explore the need and integration of personalized HRTFs, Room acoustics modeling, and personalized headphone equalization for such virtual production tools. We will also present our experience leveraging JUCE for building spatial audio plugins, particularly as it pertains to virtualizing real world acoustic environments. By sharing our insights, this talk aims to provide valuable information to developers interested in building spatial audio plugins that bring down barriers of cost, accessibility, making “immersive for all” a reality for creative professionals.

Link to Slides: https://data.audio.dev/talks/ADCxIndia/2024/ai-driven-personalized-spatial-audio-for-immersive-mixing.pdf
_

Edited by Digital Medium Ltd - online.digital-medium.co.uk
_

Organized and produced by JUCE: https://juce.com/
_

Special thanks to the ADC24 Team:

Sophie Carus
Derek Heimlich
Andrew Kirk
Bobby Lombardi
Tom Poole
Ralph Richbourg
Prashant Mishra

#adc #ai #audio #virtualstudio

Filed under: UncategorizedTagged with: , , , ,

AI Generated Voices: Towards Emotive Speech Synthesis – Vibhor Saran – ADCx India 2024

Join Us For ADC24 - Bristol - 11-13 November 2024
More Info: https://audio.dev/
@audiodevcon​

AI Generated Voices: Towards Emotive Speech Synthesis - Vibhor Saran - ADCx India 2024

Traditionally, machine generated voices were synthesised by joining the phonemes of any language, which made these voices robotic in nature. With the availability of more data and advent of deep learning, these AI voices started becoming more human and engaging. The next step is to make these AI generated voices more emotive so that it can laugh, be sad or even cry just like how expressive human speech is. In this talk, we touch base upon deep learning approaches to make synthetic voices more emotive. Specifically, we will focus on how to manipulate the Mel Spectrogram of the speech to make it engaging, removing the dependency of large quantums of data.

Link to Slides: https://data.audio.dev/talks/ADCxIndia/2024/towards-emotive-speech-synthesis.pdf
_

Edited by Digital Medium Ltd - online.digital-medium.co.uk
_

Organized and produced by JUCE: https://juce.com/
_

Special thanks to the ADC24 Team:

Sophie Carus
Derek Heimlich
Andrew Kirk
Bobby Lombardi
Tom Poole
Ralph Richbourg
Prashant Mishra

#adc #ai #dsp #audio #speechsynthesis

Filed under: UncategorizedTagged with: , , ,

Building AI Music Tools: An Engineer’s Guide to Prototyping – Jamie Pond – ADC23

https://audio.dev/ -- @audiodevcon​

Building AI Music Tools for the 99%: An Engineer’s Guide to Prototyping - Jamie Pond - ADC23

How to go from idea, to lo-if prototype, to validation, to hi-fi prototype to production.
Exploring the method we used to develop and ship 3 large appeal consumer audio apps this year, to millions of users.

Link to Slides: https://data.audio.dev/talks/2023/an-engineers-guide-to-prototyping/slides.pdf
_

Jamie Pond
_

Streamed & Edited by Digital Medium Ltd: https://online.digital-medium.co.uk
_

Organized and produced by JUCE: https://juce.com/
_

Special thanks to the ADC23 Team:

Sophie Carus
Derek Heimlich
Andrew Kirk
Bobby Lombardi
Tom Poole
Ralph Richbourg
Jim Roper
Jonathan Roper
Prashant Mishra

#adc #audiodev #ai #audio

Filed under: UncategorizedTagged with: , , , , , ,