Conference Paper, Publication »

[14 Jul 2014 | Comments Off | 191 views]
Abstract. This paper presents a comparative study of classification performance in automatic audio chord recognition based on three chroma feature implementations, with the aim of distinguishing effects of frame size, instrumentation, and choice of chroma feature. Research in automatic chord recognition has to date focused on the development of complete systems. While results have remarkably improved, the understanding of the error sources remains low. In order to isolate sources of chord recognition error we create a corpus of artificial instrument mixtures and investigate (a) the influence of different chroma frame sizes and (b) the impact of instrumentation and pitch height. We show that recognition performance i…

Seen and Liked »

[14 Jul 2014 | Comments Off | 201 views]
I went to a talk on the Science of Singing by David Howard recently. He’s a fascinating talker, a bit self-indulgent at times, but highly entertaining. He talked a lot about singing, and a lot of it I already knew, but there are two things I took away which I’d like to share. First, the reason why you should drink plenty (water) when singing — yes, it’s so your voice stays nicely “lubricated”, but what I had not appreciated is this: the lubrication does not work locally, i.e. it’s not that the water going past your vocal cords keeps them in shape. Instead, your body keeps a global water household and distributes it where it’s needed most. The problem is that the voice comes pretty far down the lis…

from me to you »

[4 Jun 2014 | Comments Off | 444 views]
[Edit: the survey is now closed.] You could help us improve pitch and note annotation tools by filling in a short survey. If you do music or speech research related to pitch and notes in audio, then you are probably aware of software to aid the manual creation of (ground truth) annotations. We (QMUL and NYU) are developing open source software to simplify the (monophonic) pitch annotation process — and we would like to get it right! So we were wondering: what software is currently used by you, the professionals, and what do you love/hate about it? It would be great if you could tell us in our 7-question mini-survey here. Thank you so much in advance!

Done and Liked, Featured, Journal Paper, Publication »

[19 Mai 2014 | Comments Off | 376 views]
http://schall-und-mauch.de/artificialmusicality/wp-content/uploads/2014/05/happy_birthday.png The preprint of our singing intonation paper is now available! Enjoy! Abstract:
This paper presents a study on intonation and intonation drift in unaccompanied singing and proposes a simple model of reference pitch memory that accounts for many of the effects observed. Singing experiments were conducted with 24 singers of varying ability under 3 conditions (Normal, Masked, Imagined). Over the duration of a recording, approximately 50 seconds, a median absolute intonation drift of 11 cents was observed. While smaller than the median note error (19 cents), drift was significant in 22% of recordings. Drift magnitude did not correlate with other measures of singing accuracy, singing experience or with the presence of conditio…

Done and Liked »

[29 Apr 2014 | Comments Off | 1,493 views]
Well, it’s not really news, but I thought I might say it again, since I hadn’t really written a dedicated blog post on Segmentino: Segmentino segments songs into segments. Also I finally got round to making a lil Segmentino page on Isophonics: http://www.isophonics.net/segmentino. The repository, software builds and all can still be found on the Segmentino page on SoundSoftware. That’s it. Check it out!

Done and Liked »

[11 Apr 2014 | Comments Off | 600 views]
I had this code lying around for a long time, and nothing happened to it, so I thought I might just as well give it away so that people can try it out. It is based on my thesis work and employs the Dynamic Bayesian Network (DBN) described in this paper and my PhD thesis). It is not a masterpiece of software engineering and it uses unspeakable amounts of memory (songs of about 6 minutes can use >10GB!, but your standard 4 minute pop song should be doable on today’s laptops). But it still works. You need Matlab to run it, and the matlab binary should be in your path. To test, run ./doChordID-osx.sh testFileList.txt testout/ on OS/X or ./doChordID.sh testFileList.txt testout/. So for people who missed the first link, the code can be downloade…

Done and Liked, Featured »

[3 Apr 2014 | Comments Off | 585 views]
http://schall-und-mauch.de/artificialmusicality/wp-content/uploads/2014/04/screen-shot-2014-04-03-at-140203.png Today I presented Tony, the melody annotation software developed by Chris Cannam, George Fazekas and myself, at this year’s Conference of the Society for Education, Music and Psychology Research. Excitingly, this is now the much improved version 0.5, and Chris managed to put new builds of Tony up on the project’s Download page. I had a relatively small audience, maybe 25 people, but reception was generally good, with some people being very excited. Not sure what this means. I hope some people will start using Tony soon. I uploaded my slides here. The conference itself is different from what I’m used to usually. Obviously, it’s focused on education, and there’s actually quite a lot of technology going on. So it’s quite funny to be at a music technology conference where there is essentially no-one I know! Thanks to Chris and George, but also, for their invaluable input: Rachel Bittner, Justin Salamon and Juan Pablo Bello at NYU.…

Conference Paper, Publication »

[28 Jan 2014 | Comments Off | 449 views]
Abstract: We introduce the Audio Degradation Toolbox (ADT) for the controlled degradation of audio signals, and propose its usage as a means of evaluating and comparing the robustness of audio processing algorithms. Music recordings encountered in practical applications are subject to varied, sometimes unpredictable degradation. For example, audio is degraded by low-quality microphones, noisy recording environments, MP3 compression, dynamic compression in broadcasting or vinyl decay. In spite of this, no standard software for the degradation of audio exists, and music processing methods are usually evaluated against clean data. The ADT fills this gap by providing Matlab scripts that emulate a wide range of degrada…

Conference Paper, Publication »

[28 Jan 2014 | Comments Off | 423 views]
Abstract: In the past decade, non-negative matrix factorisation (NMF) and probabilistic latent component analysis (PLCA) have been used widely in automatic music transcription. Despite their successes, these methods only guarantee that the decomposition converges to a local minimum in the cost function. In order to find better local minima, we propose to extend an existing PLCA-based transcription method with the deterministic annealing EM (DAEM) algorithm. The PLCA update rules are modified by introducing a “temperature” parameter. At higher temperatures, general areas of the search space containing good solutions are found. As the temperature is gradually decreased, distinctions in the data are sharpened, resu…

from me to you »

[7 Jan 2014 | Comments Off | 274 views]
Just a quick reminder that you can still apply for a funded PhD position in Signal Processing and Data Mining Tools for the Analysis of Musical Evolution, as previously posted. Job description is here (at jobs.ac.uk).