Showing papers for
.
Score Reduction for Guitar Through Reinforcement Learning
Christodoulos Benetatos (University of Rochester)*, Zhiyao Duan (Unversity of Rochester)
Poster is presented in-person.
BeatlesFC: Harmonic function annotations of Isophonics' The Beatles dataset
Jiyeoung Sim (The CUNY Graduate Center), Rebecca Moranis (CUNY Graduate Center), Johanna Devaney (Brooklyn College)*
Poster is presented virtually.
Localify.org: Contextualizing Long-Tail Music for Local Artist Discovery
Paul Gagliano (Ithaca College)*, Griffin Homan (Ithaca College), Cassandra Raleault (Ithaca College), Ruth Ayambem (Ithaca College), Bridget Burns (Ithaca College), Douglas Turnbull (Ithaca College)
Poster is presented in-person.
Using feature-based composer classification to test musicological evidence for Josquin attribution
Cory McKay (Marianopolis College)*, Julie Cumming (McGill University)
Poster is presented in-person.
Mamba-Based Model for Automatic Chord Recognition
Chunyu N Yuan (the Graduate Center, CUNY), Johanna Devaney (Brooklyn College)*
Poster is presented virtually.
Language Models for Music Medicine Generation
Emmanouil Nikolakakis (University of California, Santa Cruz), Joann Ching (Johannes Kepler University), Emmanouil Karystinaios (Johannes Kepler University)*, Gabrielle Sipin (Liberty Healthcare Corporation), Gerhard Widmer (Johannes Kepler University), Razvan V Marinescu (UC Santa Cruz)
Poster is presented in-person.
A Token-based Neural Synthesizer for Instrument Cloning and Text-to-Instrument
Kyungsu Kim (Seoul National University)*, Junghyun Koo (Sony AI), Sungho Lee (Seoul National University), Haesun Joung (Seoul National University), Kyogu Lee (Seoul National University)
Poster is presented in-person.
HI-AUDIO ONLINE PLATFORM: OPPORTUNITIES AND CHALLENGES OF COLLECTING VARIED MUSIC DATA ON THE WEB
Jose Gil Panal (LTCI, Telecom Paris, Institut polytechnique de Paris), Aurelien David (LTCI, Telecom Paris, Institut polytechnique de Paris), Gaël Richard (LTCI, Telecom Paris, Institut polytechnique de Paris)*
Poster is presented in-person.
Symbotunes: unified hub for symbolic music generative models
Paweł Skierś (Warsaw University of Technology), Maksymilian Łazarski (Warsaw University of Technology), Michał Kopeć (Warsaw University of Technology), Mateusz Modrzejewski (Warsaw University of Technology)*
Poster is presented virtually.
ITO-Master: Inference-Time Optimization for Music Mastering Style Transfer
Junghyun Koo (Sony AI)*, Marco A Martinez Ramirez (Sony AI), Wei-Hsiang Liao (Sony Group Corporation), Giorgio Fabbro (Sony), Michele Mancusi (Sony Europe), Yuki Mitsufuji (Sony AI)
Poster is presented in-person.
How does the teacher rate? Observations from the NeuroPiano dataset
Huan Zhang (Queen Mary University of London)*, Vincent K.M. Cheung (Sony Computer Science Laboratories, Inc.), Hayato Nishioka (Sony Computer Science Laboratories, Inc), Simon Dixon (Queen Mary University of London), Shinichi Furuya (Sony Computer Science Laboratories Inc.)
Poster is presented in-person.
Multimodal Structured Extraction for Self-Querying Music Video Retrieval and Playlist Generation
Kevin Dela Rosa (Aviary Labs)*
Poster is presented in-person.
ARTIFICIAL ACOUSTIC PIANO RESONANCE WITH A SOUNDBOARD-MOUNTED SHAKER
William A Thompson (University of Southern Mississippi)*, Austin A Franklin (Louisiana State University)
Poster is presented virtually.
Self-Supervised Multi-View Learning for Disentangled Music Audio Representations
Julia Wilkins (New York University)*, Sivan Ding (NYU), Magdalena Fuentes (New York University), Juan P Bello (New York University)
Poster is presented in-person.
CloserMusicDB: A Modern Multipurpose Dataset of High Quality Music
Mateusz Modrzejewski (Warsaw University of Technology)*, Aleksander Tym (Closer Music), Aleksandra Piekarzewicz (Closer Music), Tomasz Sroka (Closer Music)
Poster is presented virtually.
MODELING PREDOMINANT INSTRUMENTATION WITH DIFFUSION
Charis Cochran (Drexel University)*, Youngmoo Kim (Drexel University)
Poster is presented in-person.
Exploring Transformer-Based Music Overpainting for Jazz Piano Variations
Eleanor Row (Queen Mary University of London)*, Ivan Shanin (Queen Mary University of London), George Fazekas (QMUL)
Poster is presented in-person.
Audio Atlas: Visualizing and Exploring Audio Datasets
Luca A Lanzendoerfer (ETH Zurich)*, Florian Grötschla (ETH Zürich), Uzeyir Valizada (ETH Zurich), Roger Wattenhofer (ETH Zurich)
Poster is presented in-person.
Audio Data Defenses: Protecting Music and Speech Data from Targeted Attacks
Julia Barnett (Northwestern University)*, William Agnew (Carnegie Mellon University), Robin Netzorg (UC Berkeley), Patrick O'Reilly (Northwestern University), Ezra Awumey (Carnegie Mellon University), Chris Donahue (Carnegie Mellon University), Sauvik Das (Carnegie Mellon University)
Poster is presented in-person.
Hookpad Aria: A Copilot for Songwriters
Chris Donahue (CMU)*, Shih-Lun Wu (Carnegie Mellon University), Yewon Kim (KAIST), Dave Carlton (Hooktheory), Ryan Miyakawa (Hooktheory), John Thickstun (University of Washington)
Poster is presented in-person.
Conditional piano music generation by flow matching for performance style transfer
Ahyeon Choi (Seoul National University)*, Dohoon Lee (Seoul National University), Kyogu Lee (Seoul National University)
Poster is presented in-person.
Chord Naming for Analysis
Mayank Sanganeria (Independent), Christopher G Leeper (Sharp15 Studios)*
Poster is presented virtually.
S3: A Symbolic Music Dataset for Computational Music Analysis of Symphonies
Zih-Syuan Lin (Academia Sinica)*, Yu-Chia Kuo (McGill University ), TZU-YUN Hung (National Taiwan Normal University), Wei-Yang Lin (National Taiwan University), YA-HSUAN CHU (National Yang-Ming Chiao-Tung University), Ting-Kang Wang (Academia Sinica), Jing-Heng Huang (Academia Sinica), Chien Chang (Academia Sinica), Christofer Julio (University of Malaya), Gloria Hsieh (Academia Sinica), Li Su (Academia Sinica)
Poster is presented in-person.
Piano Concerto Accompaniment Creation
Yigitcan Özer (International Audio Laboratories Erlangen), Simon J Schwär (International Audio Laboratories Erlangen)*, Meinard Müller (International Audio Laboratories Erlangen)
Poster is presented in-person.
OPTIMIZING MUSIC CAPTIONING WITH REINFORCEMENT LEARNING AND RETRIEVAL-AUGMENTED METHODS
Haesun Joung (Seoul National University)*, Jinwoo Lee (Huawei Tech.), Kyogu Lee (Seoul National University)
Poster is presented in-person.
Text2EQ: Human-in-the-Loop Co-Creation Interface for EQ
Annie Chu (Northwestern University)*, Hugo Flores García (Northwestern University), Patrick O'Reilly (Northwestern University), Bryan Pardo (Northwestern University)
Poster is presented in-person.
HARP 2.0: Expanding Hosted, Asynchronous, Remote Processing for Deep Learning in the DAW
Christodoulos Benetatos (University of Rochester), Frank Cwitkowitz (University of Rochester), Nathan Pruyne (Northwestern University), Hugo Flores García (Northwestern University), Patrick O'Reilly (Northwestern University)*, Zhiyao Duan (Unversity of Rochester), Bryan Pardo (Northwestern University)
Poster is presented in-person.
Computationally Validating Synchronisation Between Musical Phrase Arcs and Autonomic Variables
Natalia Cotic (King's College London)*, Vanessa Pope (King's College London), Mateusz Solinski ( King's College London), Pier Lambiase (University College London), Elaine Chew (King's College London)
Poster is presented virtually.
Local Deployment of Large-Scale Music AI Models on Commodity Hardware
Xun Zhou (Carnegie Mellon University)*, Charlie Ruan (Carnegie Melllon University), Zihe Zhao (Carnegie Melllon University), Chris Donahue (Carnegie Mellon University)
Poster is presented in-person.
Musical Source Separation of Brazilian Percussion
Richa Namballa (New York University)*, Giovana V Morais (New York University), Magdalena Fuentes (New York University)
Poster is presented in-person.
ENHANCED FORMULATION OF THE LATENT ORDER LOGISTIC REGRESSION (LOLOG) MODEL FOR ANALYSIS OF AUSTRALIAN MUSICIAN NETWORKS
Lekshmy Hema Nair (Western Sydney University)*, Simon Chambers (Western Sydney University), Roger T. Dean (The MARCS Institute for Brain, Behaviour and Development, Western Sydney University)
Poster is presented in-person.
symusic: A swift and unified toolkit for symbolic music processing
Yikai Liao (Beijing University of Posts and Telecommunications), Zhongqi Luo (Qiyin Technology Co., Ltd.)*, Yue Wang (China Conservatory of Music), Yujie Wu (殷渝杰),
Poster is presented virtually.
Masked Token Modeling for Zero-Shot Anything-to-Drums Conversion
Patrick O'Reilly (Northwestern University)*, Hugo Flores García (Northwestern University), Prem Seetharaman (Adobe), Bryan Pardo (Northwestern University)
Poster is presented in-person.
A Music Information Retrieval Approach to Classify Sub-Genres in Role Playing Games
Daeun Hwang (University of California, Santa Cruz)*, Xuyuan Cai (University of California, Santa Cruz), Edward Melcer (University of California, Santa Cruz), Elin Carstensdottir (University of California, Santa Cruz)
Poster is presented in-person.
Groove Transfer VST for Latin American Rhythms
Anmol Mishra (Universitat Pompeu Fabra)*, Behzad Haki (Universitat Pompeu Fabra), Satyajeet Prabhu (Universitat Pompeu Fabra), Martín Rocamora (Universitat Pompeu Fabra)
Poster is presented in-person.
Zero-shot Crate Digging: DJ Tool retrieval using Speech Activity, Music Structure and CLAP embeddings
Iroro Orife (Netflix)*
Poster is presented in-person.
Boundary Regression for Leitmotif Detection in Music Audio
Sihun Lee (Sogang University)*, Dasaem Jeong (Sogang University)
Poster is presented in-person.
GENERATIVE SINGING STYLE TRANSFER ACROSS GENRES
Saanvi Bhargava (The Harker School), Ethan Chu (Monta Vista High School), Matthew Lee (Riverdale Country School), Chuyang Chen (New York University), Kelvin Walls (New York University), Bea Steers (New York University), Iran R Roman (Stanford University)*
Poster is presented in-person.
The Surprising Effect of Song-Level Demixing for Music Foundation Model Pretraining
Junyan Jiang (New York University Shanghai)*, Akira Maezawa (Yamaha Corporation), Gus Xia (New York University Shanghai)
Poster is presented virtually.
Skip That Beat: Augmenting Meter Tracking Models for Underrepresented Time Signatures
Giovana V Morais (New York University)*, Brian McFee (New York University), Magdalena Fuentes (New York University)
Poster is presented in-person.
Automatic Album Sequencing
Vincent Herrmann (IDSIA/USI/SUPSI)*, Dylan R Ashley (The Swiss AI Lab IDSIA, USI, SUPSI), Jürgen Schmidhuber (IDSIA - Lugano)
Poster is presented virtually.
mshoxxDB - a Versioned Dataset for Electronic Music
Michael Taenzer (Fraunhofer IDMT / UPF)*
Poster is presented in-person.
Collecting & Managing the Metadata on the Data of ISMIR
Ashley Luna (Smith College), Diana Diaz (Smith College), Charis Cochran (Drexel University), Andrew Wiggins (Drexel University), Katherine M. Kinnaird (Smith College)*
Poster is presented in-person.
Zero-Shot Structure Labeling with Audio and Language Model Embeddings
Morgan Buisson (Telecom-Paris)*, Christopher A Ick (New York University), Qingyang Xi (NYU), Brian McFee (New York University)
Poster is presented in-person.
Real-time Flutist Gesture Cue Detection System for Auto-Accompaniment
Jaeran Choi (KAIST)*, Taegyun Kwon (KAIST), Joonhyung Bae (KAIST), Jiyun Park (KAIST), Yonghyun Kim (Georgia Institute of Technology), Juhan Nam (KAIST)
Poster is presented in-person.
MidiTok Visualizer: a tool for visualization and analysis of tokenized MIDI symbolic music
Michał Wiszenko (Warsaw University of Technology), Kacper Stefański (Warsaw University of Technology), Piotr Malesa (Warsaw University of Technology), Łukasz Pokorzyński (Warsaw University of Technology), Mateusz Modrzejewski (Warsaw University of Technology)*
Poster is presented virtually.
LyCon: Lyrics Reconstruction from the Bag-of-Words Using Large Language Models
Haven Kim (University of California San Diego)*, Kahyun Choi (UIUC)
Poster is presented virtually.
Towards Computational Analysis of Pansori Singing
Sangheon Park (Georgia Institute of Technology)*, Danbinaerin Han (KAIST), Dasaem Jeong (Sogang University)
Poster is presented in-person.
Diff-MST^C: A Mixing Style Transfer Prototype for Cubase
Soumya Sai Vanka (QMUL)*, Lennart Hannink (Steinberg Media Technologies GmbH), Jean-Baptiste Rolland (Steinberg Media Technologies GmbH), George Fazekas (QMUL)
Poster is presented in-person.
UNCOVERING THE MICROTONES IN A RAAG FROM NOTE TRANSCRIPTIONS
Neeraja Abhyankar (Unaffiliated)*
Poster is presented in-person.
Emotion-based Piano Score Generation via Two-stage Transformer VAE
Jiahao Zhao (Kyoto University)*, Kazuyoshi Yoshii (Kyoto University)
Poster is presented in-person.
REVAMP: VISUALISATION AND ANALYSIS IN THE DIGITAL AUDIO WORKSTATION
Chris Cannam (QMUL), George Fazekas (QMUL)*
Poster is presented in-person.
VizMuc - Vizualization of Music Corpora
Filip Trplan (University of Ljubljana), Klara Žnideršič (University of Ljubljana), Vid Klopčič (University of Ljubljana), Matevž Pesek (University of Ljubljana)*, Leon Stefanija (University of Ljubljana), Matija Marolt (University of Ljubljana)
Poster is presented in-person.
Optical Music Recognition for Jeongganbo Notation of Korean Court Music
DongMin Kim (Sogang University), Danbinaerin Han (KAIST), Dasaem Jeong (Sogang University)*, Jose J. Valero-Mas (University of Alicante)
Poster is presented in-person.
Do Captioning Metrics Reflect Music Semantic Alignment?
Jinwoo Lee (Huawei Tech.)*, Kyogu Lee (Seoul National University)
Poster is presented virtually.
Perception of ragas is influenced by enculturation and musical training - a pilot study
Vidya Rangasayee (Stanford University)*, Prahlad Saravanapriyan (Washington High School), Takako Fujioka (Department of Music, Center for Computer Research in Music and Acoustics, Stanford University, Stanford, CA, USA)
Poster is presented in-person.
REFFLY: MELODY-CONSTRAINED LYRICS EDITING MODEL
Songyan Zhao (University of California, Los Angeles)*
Poster is presented in-person.
Analysis of the Originality of Gen-AI Song Audio
Rajesh Fotedar (University of Miami), Tom Collins (University of Miami)*
Poster is presented in-person.
Pitch ControlNet: Continuous Pitch Control for Monophonic Instrument Sound Generation
Dabin Kim (Korea Advanced Institute of Science and Technology)*, Junwon Lee (KAIST), Minseo Kim (KAIST), Juhan Nam (KAIST)
Poster is presented in-person.
Matchmaker: A Python library for Real-time Music Alignment
Jiyun Park (KAIST)*, Carlos Eduardo Cancino-Chacón (Johannes Kepler University Linz), Taegyun Kwon (KAIST), Juhan Nam (KAIST)
Poster is presented in-person.
Enhancement of Speech and Language Models through unsupervised Learning with Music Datasets
Eviatar Bas (Independent)*, Iran R Roman (Queen Mary University of London)
Poster is presented in-person.
Interval Mover’s Distance: Melodic Stylistic Analysis Using Theoretical Frameworks
Valeri Sazonov (University of Alabama)*
Poster is presented virtually.
SOURCE-LEVEL PITCH AND TIMBRE EDITING FOR MIXTURES OF TONES USING DISENTANGLED REPRESENTATIONS
Yin-Jyun Luo (Queen Mary University of London)*, Kin Wai Cheuk (Sony AI), Woosung Choi (Sony AI), Toshimitsu Uesaka (Sony Group Corporation), Keisuke Toyama (Sony Group Corporation), Wei-Hsiang Liao (Sony Group Corporation), Simon Dixon (Queen Mary University of London), Yuki Mitsufuji (Sony AI)
Poster is presented in-person.
Demo of Zero-Shot Guitar Amplifier Modelling: Enhancing Modeling with Hyper Neural Networks
Yu-Hua Chen (NTU)*, Yuan-Chiao Cheng (Positive Grid), Yen-Tung Yeh (National Taiwan University), Jui-Te Wu (Positive Grid), Yu-Hsiang Ho (Positive Grid ), Jyh-Shing Roger Jang (National Taiwan University), Yi-Hsuan Yang (National Taiwan University)
Poster is presented in-person.
A New Dataset for Tag- and Text-based Controllable Symbolic Music Generation
Weihan Xu (Duke University)*, Julian McAuley (UCSD), Taylor Berg-Kirkpatrick (UCSD), Shlomo Dubnov (UC San Diego), Hao-Wen Dong (University of Michigan)
Poster is presented in-person.
Exploring Tokenization Methods for Multitrack Sheet Music Generation
Yashan Wang (Central Conservatory of Music)*, Shangda Wu (Central Conservatory of Music), Xingjian Du (University of Rochester), Maosong Sun (Tsinghua University)
Poster is presented in-person.
Enhanced Automatic Drum Transcription via Drum Stem Source Separation
Xavier Riley (C4DM)*, Simon Dixon (Queen Mary University of London)
Poster is presented in-person.
Towards Robust Transcription: Exploring Noise Injection Strategies for Training Data Augmentation
Yonghyun Kim (Georgia Institute of Technology)*, Alexander Lerch (Georgia Institute of Technology)
Poster is presented in-person.
PyNeuralFx: A Python Package for Neural Audio Effect Modeling
Yen-Tung Yeh (National Taiwan University)*, Wen-Yi Hsiao (Indepedent Researcher), Yi-Hsuan Yang (National Taiwan University)
Poster is presented in-person.
Facing the Music: Tackling Singing Voice Separation in Cinematic Audio Source Separation
Karn N Watcharasupat (Georgia Institute of Technology)*, Chih-Wei Wu (Netflix, Inc.), Iroro Orife (Netflix)
Poster is presented in-person.
A DBN-Based Regularization Approach for Training Postprocessing-free Joint Beat and Downbeat Estimator
Yiming Wu (AlphaTheta Corporation)*, Yuya Yamamoto (AlphaTheta Corporation), Shunya Ishikawa (The University of Electro-Communications)
Poster is presented in-person.
PYAMPACT: A SCORE-AUDIO ALIGNMENT TOOLKIT FOR PERFORMANCE DATA ESTIMATION AND MULTI-MODAL PROCESSING
Johanna Devaney (Brooklyn College)*, Daniel McKemie (Brooklyn College), Alexander Morgan (Independent)
Poster is presented virtually.
The Voice of an Instrument: Analysis of X-vectors for Music Emotion Recognition
Mariana Rodríguez Castañeda (UNAM), Iran R Roman (Stanford University)*
Poster is presented virtually.
MusicGen-Chord: Advancing Music Generation through Chord Progressions and Interactive Web-UI
Jongmin Jung (Sogang University)*, Andreas Jansson (Replicate), Dasaem Jeong (Sogang University)
Poster is presented in-person.
MIRFLEX: Music Information Retrieval Feature Library for Extraction
Anuradha Chopra (Singapore University of Technology and Design)*, Abhinaba Roy (SUTD), Dorien Herremans (Singapore University of Technology and Design)
Poster is presented in-person.
Demonstrating OpenMU-LightBench: A benchmark suite for music understanding
Mengjie Zhao (Sony Group Corporation)*, Zhi Zhong (Sony Group Corporation), Zhuoyuan Mao (Sony Group Corporation), Shiqi Yang (Sony), Wei-Hsiang Liao (Sony Group Corporation), Shusuke Takahashi (Sony Group Corporation), Hiromi Wakaki (Sony Group Corporation), Yuki Mitsufuji (Sony AI)
Poster is presented in-person.
AN EXPLORATION OF MUSIC STRUCTURE SEGMENTATION USING EEG DATA AND MSAF ALGORITHMS
Neha Rajagopalan (Stanford University)*, Blair Kaneshiro (Stanford University)
Poster is presented in-person.
SONG REVIEW GENERATION USING ACOUSTIC INFORMATION AND LYRICS
Keita Kawachi (Nagoya Institute of Technology)*, Shinji Sako (Nagoya Institute of Technology)
Poster is presented in-person.
Understanding Human Perception of Music Plagiarism Through a Computational Approach
Daeun Hwang (University of California, Santa Cruz)*, Hyeonbin Hwang (KAIST)
Poster is presented in-person.
Wavespace: A Highly Explorable Wavetable Generator
Hazounne Lee (Seoul National University)*, Kihong Kim (Kyungpook National University), Sungho Lee (Seoul National University), Felix You (The University of Texas at Austin), Kyogu Lee (Seoul National University)
Poster is presented in-person.