The Co-Explorer is a software agent that uses deep reinforcement learning to support sound exploration through positive or negative feedback.

It allows people to explore parametric sound spaces by only communicating positive or negative preferences to the machine. A deep reinforcement learning algorithm is used to autonomously explore the space while interactively learning the user’s tastes, in an expressive workflow that we called co-exploration.

The Co-Explorer result from a human-centered design process of deep reinforcement learning with expert sound designers. We first led a pilot study with an early reinforcement learning prototype, called Sarsa, to understand what features would be important for sound designers leading parameter exploration. We then injected their feedback in the design of our final model prototype, based on Deep TAMER, which we evaluated in a creative workshop.

The current version of the software exists as a Python library; the OSC protocol supports connection between the deep reinforcement learning model (coded with the TensorFlow library) and parametric environments.

The Co-Explorer was also practiced within a research-creation project, which resulted in the ægo performance artwork.

The project was developed with Bavo Van Kerrebroeck, Baptiste Caramiaux, and Frédéric Bevilacqua in collaboration with the ISMM group of IRCAM and the ex)situ team of LRI (INRIA), in the context of a PhD thesis at Sorbonne Université.
Article at ACM TOCHI (2021)
Exhibition @ HMC 2022 (Dec.02.2022)
Workshops @ Ircam (Jul.19.2018)
Exhibition @ ISMIR 2018 (Sep.24.2018)

This slideshow requires JavaScript.