Music Information Retrieval (MIR)

Marc Groenewegen, HKU Music & Technology, 2025

MIR is a research field that tries to understand sound such that it can be described with certain features.

Levels of features

Physical and mathematical features

High-level/second order features

Techniques for feature extraction

YIN

https://www.youtube.com/watch?v=W585xR3bjLM

YINFFT

Cepstrum

Inverse FFT (or just FFT with different scaling) of log magnitude spectrum. The highest peak (excluding the DC component) indicates the fundamental frequency.

{ IFFT (log( FFT(f(t))) ^ 2) } ^ 2

AI

basic pitch

Applications

Libraries & tools

Paper: AN EVALUATION OF AUDIO FEATURE EXTRACTION TOOLBOXES --> tables on page 4 and 5 about feature extraction tools

Demos

Books

Meinard Mueller: "Fundamentals of Music Processing" (available in Mediatheek IBB-laan)

Research assignment

Using sources like websites, books and papers, see what MIR can mean to one of your projects. Can you use it as a controller? Try not to discard anything at first but keep your mind open for new insights. When you are ready to use a technique that is the time to do a reality check and see whether your ideas are feasible.

Some starters: