Index of /~echew/papers/ISMIR6

Icon  Name                    Last modified      Size  Description
[DIR] Parent Directory - [   ] cc-ismir2005.pdf 19-Oct-2005 09:43 390K [TXT] reference.txt 09-Aug-2005 02:19 414
In this directory are the PDF files for the paper titled

"Fuzzy Analysis in Pitch Class Determination for Polyphonic Audio Key Finding"
by Ching-Hua Chuan (chinghuc@usc.edu) and Elaine Chew (echew@usc.edu)

The results were presented at the 
Sixth International Conference on Music Information Retrieval
London, UK.  September 11-15, 2005.

The paper is published in the
Proceedings of the 6th ISMIR Conference
Click on reference.txt for the BibTeX reference.

The conference website is at 
http://ismir2005.ismir.net

THE COMPLETE PAPER, text with figures, can be viewed as a PDF document.
Click on cc-ismir2005.pdf if you wish to view the paper in PDF format.

--'--,--'--,--'--,--'--,--'--,--'--,--'--,--'--,--'--,--'--,--'--,--'--

"Fuzzy Analysis in Pitch Class Determination for Polyphonic Audio Key Finding"
by Ching-Hua Chuan (chinghuc@usc.edu) and Elaine Chew (echew@usc.edu)

ABSTRACT: This paper presents a fuzzy analysis technique for pitch
class determination that improves the accuracy of key finding from
audio information. Errors in audio key finding, typically incorrect
assignments of closely related keys, commonly result from imprecise
pitch class determination and biases introduced by the quality of the
sound. Our technique is motivated by hypotheses on the sources of
audio key finding errors, and uses fuzzy analysis to reduce the errors
caused by noisy detection of lower pitches, and to refine the biased
raw frequency data, in order to extract more correct pitch classes. We
compare the proposed system to two others, an earlier one employing
only peak detection from FFT results, and another providing direct key
finding from MIDI. All three used the same key finding algorithm
(Chewbnd the same 410 classical music pieces (ranging from Baroque to
Contemporary). Considering only the first 15 seconds of music in each
piece, the proposed fuzzy analysis technique outperforms the peak
detection method by 12.18% on average, matches the performance of
direct key finding from MIDI 41.73% of the time, and achieves an
overall maximum correct rate of 75.25% (compared to 80.34% for MIDI
key finding).

Keywords: audio key finding, pitch classes, fuzzy analysis, key proximity.