Abstract is missing.
- GlobalMood: A Cross-Cultural Benchmark for Music Emotion RecognitionHarin Lee, Elif Celen, Peter M. C. Harrison, Manuel Anglada-Tort, Pol van Rijn, Minsu Park, Marc Schönwiesner, Nori Jacoby. 11-19 [doi]
- RISE: Music Rearrangement for Realtime Intensity Synchronization With ExerciseAlexander Wang, Chris Donahue, Dhruv Jain. 20-27 [doi]
- Expanding the HAISP Dataset: AI's Impact on Songwriting Across Two AI Song ContestsLidia J. Morris, Michele Newman, Xinya Tang, Renee Singh, Marcel A. Vélez Vásquez, Rebecca Leger, Jin Ha Lee 0001. 28-35 [doi]
- Quantifying Regularity in Music Structure AnalysisBrian McFee. 36-43 [doi]
- On the De-Duplication of the Lakh MIDI DatasetEunjin Choi, Hyerin Kim, Jiwoo Ryu, Juhan Nam, Dasaem Jeong. 44-51 [doi]
- Conditional Diffusion as Latent Constraints for Unconditional Symbolic Music Generation ModelsMatteo Pettenò, Alessandro Ilic Mezza, Alberto Bernardini. 52-59 [doi]
- Radif Corpus; Symbolic Dataset for Non-Metric Iranian Classical MusicMaziar Kanani, Seán O'Leary, James McDermott. 60-67 [doi]
- Melodic and Metrical Elements of Expressiveness in Hindustani Vocal MusicYash Bhake, Ankit Anand, Preeti Rao. 68-74 [doi]
- Coloring Music: Bridging Music and Color Palettes for Graphic DesignTakayuki Nakatsuka, Masahiro Hamasaki, Masataka Goto. 75-82 [doi]
- Exploring Network Adaptations for Minimum Latency Real-Time Piano TranscriptionPatricia Hu, Silvan Peter, Jan Schlüter, Gerhard Widmer. 83-90 [doi]
- A Systematic Evaluation of Real-Time Audio Score Following for Piano PerformanceJiyun Park, Carlos Eduardo Cancino Chacón, Suhit Chiruthapudi, Juhan Nam. 91-99 [doi]
- Predicting Flutist Onset Timing in Duet Performance: A Multimodal Analysis of Gesture and Breath CuesJaeran Choi, Taegyun Kwon, Juhan Nam. 100-106 [doi]
- AI-Generated Song Detection via Lyrics TranscriptsMarkus Frohmann, Elena V. Epure, Gabriel Meseguer-Brocal, Markus Schedl, Romain Hennequin. 107-116 [doi]
- Measuring Sensory Dissonance In Multi-Track Music Recordings: A Case Study With Wind QuartetsSimon J. Schwär, Stefan Balke, Meinard Müller. 117-126 [doi]
- Reformulating Soft Dynamic Time Warping: Insights Into Target Artifacts and Prediction QualityJohannes Zeitler, Meinard Müller. 127-133 [doi]
- ITO-Master: Inference-Time Optimization for Audio Effects Modeling of Music Mastering ProcessorsJunghyun Koo, Marco A. Martínez Ramírez, Wei-Hsiang Liao 0001, Giorgio Fabbro, Michele Mancusi, Yuki Mitsufuji. 134-141 [doi]
- A Multidimensional Approach to Opera Analysis: Harmony, Tempo, and Dramatic Interaction in Wagner's Siegfried Act IIIPascal Schmolenzky, Stephanie Klauk, Rainer Kleinertz, Christof Weiß, Meinard Müller. 142-149 [doi]
- Exploring the Feasibility of LLMs for Automated Music Emotion AnnotationMeng Yang, Jon McCormack, Maria Teresa Llano, Wanchao Su. 150-157 [doi]
- An Evaluation Strategy for Local Key Estimation: Exploiting Cross-Version ConsistencyYiwei Ding, Yannik Venohr, Christof Weiß. 158-165 [doi]
- Tuning Matters: Analyzing Musical Tuning Bias in Neural VocodersHans-Ulrich Berendes, Ben Maman, Meinard Müller. 166-173 [doi]
- Aligning Text-to-Music Evaluation With Human PreferencesYichen Huang, Zachary Novack, Koichi Saito, Jiatong Shi, Shinji Watanabe 0001, Yuki Mitsufuji, John Thickstun, Chris Donahue. 174-181 [doi]
- Investigating Music Track Liking in the Halo of Album CoversOleg Lesota, Anna Hausberger, Ivanna Pshenychna, Oleksandr Shvydanenko, Olha Yehorova, Markus Schedl. 182-189 [doi]
- Phylo-Analysis of Folk Traditions: A Methodology for the Hierarchical Musical Similarity AnalysisHilda Romero-Velo, Gilberto Bernardes, Susana Ladra, José R. Paramá, Fernando Silva-Coira. 190-197 [doi]
- dPLP: A Differentiable Version of Predominant Local Pulse EstimationChing-Yu Chiu, Sebastian Strahl, Meinard Müller. 198-205 [doi]
- PeakNetFP: Peak-Based Neural Audio Fingerprinting Robust to Extreme Time StretchingGuillem Cortès-Sebastià, Benjamin Martin 0001, Emilio Molina, Xavier Serra, Romain Hennequin. 206-214 [doi]
- Generating Symbolic Music From Natural Language Prompts Using an LLM-Enhanced DatasetWeihan Xu, Julian J. McAuley, Taylor Berg-Kirkpatrick, Shlomo Dubnov, Hao-Wen Dong. 215-222 [doi]
- A Survey on Vision-to-Music Generation: Methods, Datasets, Evaluation, and ChallengesZhaokai Wang, Chenxi Bao, Le Zhuo, Jingrui Han, Yang Yue, Yihong Tang, Victor Shea-Jay Huang, Yue Liao. 223-234 [doi]
- Emergent Musical Properties of a Transformer Under Contrastive Self-Supervised LearningYuexuan Kong, Gabriel Meseguer-Brocal, Vincent Lostanlen, Mathieu Lagrange, Romain Hennequin. 235-246 [doi]
- Are You Really Listening? Boosting Perceptual Awareness in Music-QA BenchmarksYongyi Zang, Sean O'Brien, Taylor Berg-Kirkpatrick, Julian J. McAuley, Zachary Novack. 247-261 [doi]
- GD-Retriever: Controllable Generative Text-Music Retrieval With Diffusion ModelsJulien Guinot, Elio Quinton, George Fazekas. 262-270 [doi]
- Towards Robust Automatic Music Transcription By Measuring Cross-Version ConsistencyYannik Venohr, Yiwei Ding, Christof Weiß. 271-278 [doi]
- Beyond Genre: Diagnosing Bias in Music Embeddings Using Concept Activation VectorsRoman B. Gebhardt, Arne Kuhle, Eylül Bektur. 279-286 [doi]
- LiLAC: A Lightweight Latent ControlNet for Musical Audio GenerationTom Baker, Javier Nistal. 287-295 [doi]
- What Song Now? Personalized Rhythm Guitar Learning in Western Popular MusicZakaria Hassein-Bey, Yohann Abbou, Alexandre D'Hooge, Mathieu Giraud, Gilles Guillemain, Aurélien Jeanneau. 296-302 [doi]
- Universal Music Representations? Evaluating Foundation Models on World Music CorporaCharilaos Papaioannou, Emmanouil Benetos, Alexandros Potamianos. 303-311 [doi]
- A Theoretical Model of Musical FormMartin Rohrmeier. 312-319 [doi]
- Towards Human-in-the-Loop Onset Detection: A Transfer Learning Approach for MaracatuAntónio Pinto. 320-327 [doi]
- Instruct-MusicGen: Unlocking Text-to-Music Editing for Music Language Models via Instruction TuningYixiao Zhang 0002, Yukara Ikemiya, Woosung Choi, Naoki Murata, Marco A. Martínez Ramírez, Liwei Lin, Gus Xia, Wei-Hsiang Liao 0001, Yuki Mitsufuji, Simon Dixon. 328-336 [doi]
- TOMI: Transforming and Organizing Music Ideas for Multi-Track Compositions With Full-Song StructureQi He, Ziyu Wang 0008, Gus Xia. 337-345 [doi]
- Automatic Melody Reduction via Shortest Path FindingZiyu Wang 0008, Yuxuan Wu, Roger B. Dannenberg, Gus Xia. 346-353 [doi]
- Expotion: Facial Expression and Motion Control for Multimodal Music GenerationFathinah Asma Izzati, Xinyue Li, Gus Xia. 354-362 [doi]
- When Voices Interleave: Timing Deviations in Six Performances of Telemann's Fantasias for Solo FlutePatrice Thibaud, Mathieu Giraud, Yann Teytaut. 363-372 [doi]
- Audio Synthesizer Inversion in Symmetric Parameter Spaces With Approximately Equivariant Flow MatchingBen Hayes, Charalampos Saitis, György Fazekas. 373-381 [doi]
- SLAP: Siamese Language-Audio Pretraining Without Negative Samples for Music UnderstandingJulien Guinot, Alain Riou, Elio Quinton, George Fazekas. 382-390 [doi]
- PianoBind: A Multi-Modal Joint Embedding Model for Pop-Piano MusicHayeon Bang, Eunjin Choi, Seungheon Doh, Juhan Nam. 391-398 [doi]
- Enhancing Neural Audio Fingerprint Robustness to Audio Degradation for Music IdentificationRecep Oguz Araz, Guillem Cortès-Sebastià, Emilio Molina, Joan Serrà, Xavier Serra, Yuki Mitsufuji, Dmitry Bogdanov. 399-406 [doi]
- Beyond Notation: A Digital Platform for Transcribing and Analyzing Oral Melodic TraditionsJonathan Myers, Dard Neuman. 407-415 [doi]
- CMI-Bench: A Comprehensive Benchmark for Evaluating Music Instruction FollowingYinghao Ma, Siyou Li, Juntao Yu, Emmanouil Benetos, Akira Maezawa. 416-425 [doi]
- Lose the Frames: Exact Metrics for More Responsible Music Structure Analysis EvaluationsQingyang Xi, Brian McFee. 426-432 [doi]
- Unifying Continuous and Discrete Compressed Representations of AudioMarco Pasini, Stefan Lattner, George Fazekas. 433-441 [doi]
- Improving BERT for Symbolic Music Understanding Using Token Denoising and Pianoroll PredictionJun-You Wang, Li Su 0004. 442-450 [doi]
- Scaling Self-Supervised Representation Learning for Symbolic Piano PerformanceLouis Bradshaw, Alexander Spangher, Honglu Fan, Stella Biderman, Simon Colton. 451-459 [doi]
- The Rhythm In Anything: Audio-Prompted Drums Generation With Masked Language ModelingPatrick O'Reilly, Julia Barnett, Hugo Flores García, Annie Chu, Nathan Pruyne, Prem Seetharaman, Bryan Pardo. 460-468 [doi]
- Count the Notes: Histogram-Based Supervision for Automatic Music TranscriptionJonathan Yaffe, Ben Maman, Meinard Müller, Amit Bermano. 469-476 [doi]
- Joint Transcription of Acoustic Guitar Strumming Directions and ChordsSebastian Murgul, Johannes Schimper, Michael Heizmann. 477-483 [doi]
- Enabling Empirical Analysis of Piano Performance Rehearsal With the Rach3 MIDI DatasetAlia Morsi, Suhit Chiruthapudi, Silvan Peter, Ivan Pilkov, Laura Bishop, Akira Maezawa, Xavier Serra, Carlos Eduardo Cancino Chacón. 484-491 [doi]
- From Discord to Harmony: Consonance-Based Smoothing for Improved Audio Chord EstimationAndrea Poltronieri, Xavier Serra, Martín Rocamora. 492-502 [doi]
- Keyboard Temperament Estimation From Symbolic Data: A Case Study on Bach's Well-Tempered ClavierPeter van Kranenburg, Gerben Bisschop. 503-510 [doi]
- Refining Music Sample Identification With a Self-Supervised Graph Neural NetworkAditya Bhattacharjee, Ivan Meresman Higgs, Mark Sandler 0001, Emmanouil Benetos. 511-517 [doi]
- Video-Guided Text-to-Music Generation Using Public Domain Movie CollectionsHaven Kim, Zachary Novack, Weihan Xu, Julian J. McAuley, Hao-Wen Dong. 518-527 [doi]
- PianoVAM: A Multimodal Piano Performance DatasetYonghyun Kim, Junhyung Park, Joonhyung Bae, Kirak Kim, Taegyun Kwon, Alexander Lerch 0001, Juhan Nam. 528-535 [doi]
- LoopGen: Training-Free Loopable Music GenerationDavide Marincione, Giorgio Strano, Donato Crisostomi, Roberto Ribuoli, Emanuele Rodolà. 536-546 [doi]
- Enhancing Music Recommender Systems With Multimedia Content: A Context-Aware ApproachOleg Lesota, Veronica Clavijo, Attia Rizwani, Markus Schedl, Bruce Ferwerda. 547-554 [doi]
- CultureMERT: Continual Pre-Training for Cross-Cultural Music Representation LearningAngelos-Nikolaos Kanatas, Charilaos Papaioannou, Alexandros Potamianos. 555-564 [doi]
- Adaptive Path of Prediction: An Unsupervised Method for Modeling Note-Level Informational Hierarchy of PolyphonyXiaoxuan Wang, Martin Rohrmeier. 565-572 [doi]
- Versatile Music-for-Music Modeling via Function AlignmentJunyan Jiang, Daniel Chin, Xuanjie Liu, Liwei Lin, Gus Xia. 573-581 [doi]
- Understanding Performance Limitations in Automatic Drum TranscriptionPhilipp Weyers, Christian Uhle, Meinard Müller, Matthias Lang. 582-588 [doi]
- High-Resolution Sustain Pedal Depth Estimation From Piano Audio Across Room AcousticsHanwen Zhang, Kun Fang, Ziyu Wang 0008, Ichiro Fujinaga. 589-595 [doi]
- Investigating an Overfitting and Degeneration Phenomenon in Self-Supervised Multi-Pitch EstimationFrank Cwitkowitz, Zhiyao Duan. 596-603 [doi]
- Sheet Music Benchmark: Standardized Optical Music Recognition EvaluationJuan C. Martinez-Sevilla, Joan Cerveto-Serrano, Noelia N. Luna-Barahona, Greg Chapman, Craig Sapp, David Rizo, Jorge Calvo-Zaragoza. 604-611 [doi]
- Fx-Encoder++: Extracting Instrument-Wise Audio Effect Representations From MixturesYen-Tung Yeh, Junghyun Koo, Marco A. Martínez Ramírez, Wei-Hsiang Liao 0001, Yi-Hsuan Yang, Yuki Mitsufuji. 612-622 [doi]
- MIDI-VALLE: Improving Expressive Piano Performance Synthesis Through Neural Codec Language ModellingJingjing Tang, Xin Wang 0037, Zhe Zhang, Junichi Yamagishi, Geraint A. Wiggins, George Fazekas. 623-630 [doi]
- Playability Prediction in Digital Guitar Learning Using Interpretable Student and Song RepresentationsManuel Müllerschön, Anssi Klapuri, Marcelo Rodriguez, Christian Cardin. 631-637 [doi]
- Gregorian Melody, Modality, and Memory: Segmenting Chant With Bayesian NonparametricsVojtech Lanz, Jan Hajic Jr.. 638-646 [doi]
- IdolSongsJp Corpus: A Multi-Singer Song Corpus in the Style of Japanese Idol GroupsHitoshi Suda, Junya Koguchi, Shunsuke Yoshida, Tomohiko Nakamura, Satoru Fukayama, Jun Ogata. 647-654 [doi]
- GOAT: A Large Dataset of Paired Guitar Audio Recordings and TablaturesJackson Loth, Pedro Sarmento, Saurjya Sarkar, Zixun Guo, Mathieu Barthet, Mark Sandler 0001. 655-662 [doi]
- STAGE: Stemmed Accompaniment Generation Through Prefix-Based ConditioningGiorgio Strano, Chiara Ballanti, Donato Crisostomi, Michele Mancusi, Luca Cosmo, Emanuele Rodolà. 663-670 [doi]
- Do Music Source Separation Models Preserve Spatial Information in Binaural Audio?Richa Namballa, Agnieszka Roginska, Magdalena Fuentes. 671-678 [doi]
- Estimating Musical Surprisal From Audio in Autoregressive Diffusion Model Noise SpacesMathias Rose Bjare, Stefan Lattner, Gerhard Widmer. 679-687 [doi]
- Improving Neural Pitch Estimation With SWIPE KernelsDavid Marttila, Joshua D. Reiss. 688-695 [doi]
- Optical Music Recognition of Jazz Lead SheetsJuan Carlos Martinez-Sevilla, Francesco Foscarin, Patricia García-Iasci, David Rizo, Jorge Calvo-Zaragoza, Gerhard Widmer. 696-702 [doi]
- Human Vs. Machine: Comparing Selection Strategies in Active Learning for Optical Music RecognitionJuan Pedro Martinez-Esteso, Alejandro Galán-Cuenca, Carlos Pérez-Sancho, Francisco J. Castellanos 0001, Antonio Javier Gallego 0001. 703-709 [doi]
- Assessing the Alignment of Audio Representations With Timbre Similarity RatingsHaokun Tian, Stefan Lattner, Charalampos Saitis. 710-718 [doi]
- Simple and Effective Semantic Song SegmentationFilip Korzeniowski, Richard Vogl. 719-726 [doi]
- MusGO: A Community-Driven Framework for Assessing Openness in Music-Generative AIRoser Batlle-Roca, Laura Ibáñez-Martínez, Xavier Serra, Emilia Gómez, Martín Rocamora. 727-738 [doi]
- A Fourier Explanation of AI-Music ArtifactsDarius Afchar, Gabriel Meseguer-Brocal, Kamil Akesbi, Romain Hennequin. 739-746 [doi]
- Modeling the Difficulty of Saxophone MusicSimon Librický, Jan Hajic Jr.. 747-754 [doi]
- The Jam_bot, a Real-Time System for Collaborative Free Improvisation With Music Language ModelsLancelot Blanchard, Perry Naseck, Stephen Brade, Kimaya Lecamwasam, Jordan Rudess, Cheng-Zhi Anna Huang, Joseph A. Paradiso. 755-762 [doi]
- Fretboardflow: A Dual-Model Approach to Optimize Chord Voicings on the Guitar FretboardMarcel A. Vélez Vásquez, Mariëlle Baelemans, Jonathan Driedger, John Ashley Burgoyne. 763-770 [doi]
- The Florence Price Art Song Dataset and Piano Accompaniment GeneratorTao-Tao He, Martin E. Malandro, Douglas Shadle. 771-778 [doi]
- Adding Temporal Musical Controls on Top of Pretrained Generative ModelsSarah Nabi, Nils Demerlé, Geoffroy Peeters, Frédéric Bevilacqua, Philippe Esling. 779-786 [doi]
- Quantize & Factorize: A Fast Yet Effective Unsupervised Audio Representation Without Deep LearningJaehun Kim, Matthew C. McCallum, Andreas F. Ehmann. 787-796 [doi]
- Identification and Clustering of Unseen Ragas in Indian Art MusicParampreet Singh, Adwik Gupta, Aakarsh Mishra, Vipul Arora 0001. 797-804 [doi]
- MAIA: An Inpainting-Based Approach for Music Adversarial AttacksYuxuan Liu, Peihong Zhang, Rui Sang, Zhixin Li, Shengchen Li. 805-812 [doi]
- Joint Object Detection and Sound Source SeparationSunyoo Kim, Yunjeong Choi, Doyeon Lee, Seoyoung Lee, Eunyi Lyou, Seungju Kim, Junhyug Noh, Joonseok Lee. 813-820 [doi]
- User-Guided Generative Source SeparationYutong Wen, Minje Kim, Paris Smaragdis. 821-829 [doi]
- Singing Voice Separation From Carnatic Music Mixtures Using a Regression-Guided Latent Diffusion ModelGenís Plaja-Roglans, Xavier Serra, Martín Rocamora. 830-838 [doi]
- Looking Beyond Averaged Metrics in Music Source SeparationSaurjya Sarkar, Victoria Moomijan, Basil Woods, Emmanouil Benetos, Mark Sandler 0001. 839-846 [doi]
- Barwise Section Boundary Detection in Symbolic Music Using Convolutional Neural NetworksOmar Eldeeb, Martin E. Malandro. 847-854 [doi]