BPM Measurement of Digital Audio by Means of Beat Graphs & Ray Shooting

Werner Van Belle¹^* - werner@yellowcouch.org, werner.van.belle@gmail.com

1- Yellowcouch;

Abstract : In this paper we present a) a novel audio visualization technique, called beat-graphs and b) a fully automatic algorithm to measure the mean tempo of a song with a very high accuracy. The algorithm itself is an easy implementable offline search algorithm that looks for the tempo that best describes the song. For every investigated tempo, it steps through the song and compares the similarity of consecutive pieces of information (bass drum, a hi-hat, ...). Its accuracy is two times higher than other fully automatic techniques, including Fourier analysis, envelope-spectrum analysis and autocorrelation.

Keywords: bpmdj meta data extraction tempo measurement beats per minute BPM measurement
Reference: Werner Van Belle; BPM Measurement of Digital Audio by Means of Beat Graphs & Ray Shooting; December 2000
See also:
Presentation given at the Chaos Communication Camp
BpmDj homepage
A number of strategies to speed up tempo measurement
Biasremoval from BPM Counters and an argument for the use of Measures per Minute instead of Beats per Minute [article | slides]
Basic Autodifferencing Explained

Table Of Contents

Introduction
Required Accuracy
Beat Graphs
Ray Shooting
        Algorithmic Performance
        Theoretical Accuracy
        Selectivity
Searching: Selective Descent
    Resampling
    Algorithm
    Performance

Experiments
    Setup
    Comparing against related work
        Peak Detectors
        Fourier Analysis of the Energy Envelope
        Autocorrelation
    Experiment
Results & Discussion
    Harmonics
    Results
    Key + Period = Tempo = Key/Period
Conclusion
Full Result Listing
Bibliography
Footnotes

Introduction

Automatically measuring the tempo of a digital soundtrack is crucial for DJ-programs such as BpmDJ [1, 2], and other similar software. The ability to mix two (or more) pieces of music, depends almost entirely on the availability of correct tempo information. All too often this information must be manually supplied by the DJ. This results in inaccurate tempo descriptions, which are useless from a programming point of view [3]. Therefore we will investigate a technique to measure the tempo of a soundtrack accurately and automatically. In the upcoming discussion we assume that the audio fragments that need to be mixed have a fixed¹ and initially unknown tempo.

The tempo of a soundtrack is immediately related to the length of a repetitive rhythm pattern encoded within the audio-fragment. This rhythm pattern itself can be virtually anything: a 4/4 dance rhythm, a 3/4 waltz, a tango, salsa or anything else. We assume that we have no information whatsoever about this rhythm pattern.

This paper is structured in four parts. First we elaborate on the required tempo accuracy. Second, we explain our beat graph visualization technique. This technique flows naturally into the the ray shooting algorithm, which is the third part. And last, we elaborate on our experiments and compare our technique with existing techniques.

Required Accuracy

Without accurate tempo information two songs will drift out of synchronization very quickly. This often results in a mix that degrades to chaos, which is not what we want. Therefore we must ask ourselves how accurately the tempo of a song should be measure to be useful. To express the required accuracy, we will calculate the maximum error on the tempo measurement to keep the synchronization drift after a certain timespan below a certain value.

We assume that the tempo of the song is measured with an error $\epsilon$ . If the exact tempo of the song is BPM then the measurement will report $f\pm\epsilon$ BPM. Such a wrong measurement leads to a drift $\gamma$ after a certain timespan . The timespan we consider, is the time needed by the DJ to mix the songs. Typically this can be 30'', 60'' or even up to 2 minutes if (s)he wants to alternate between different songs. We will express the drift $\gamma$ in beats (at the given tempo), not in seconds. We chose not to use absolute time values because slow songs don't suffer as much from the same absolute drift as fast songs. E.g, a drift of 25 ms is barely noticeable in slow songs (E.g, 70 BPM), while such a drift is very disturbing for fast songs (E.g, 145 BPM). Therefore, we consider a maximum drift of $\frac{1}{b}$ beat, with being the note-divisor. For instance, we will calculate the required accuracy to keep the drift below $\frac{1}{8}$ , $\frac{1}{16}$ and $\frac{1}{32}$ beat. Given the exact tempo (in BPM) and the measurement error $\epsilon$ (in BPM), we can now calculate the drift $\gamma$ (in beats) after timespan (in seconds).

To do so, we must first know how many beats are contained within the timespan . Based on the standard relation between period and frequency², this is $\frac{f.t}{60}$ beats. Because the tempo is measured with an accuracy of $\epsilon$ BPM, we get a drift (expressed in beats) of $\gamma=\frac{(f\pm\epsilon).t}{60}-\frac{f.t}{60}=\pm\frac{\epsilon.t}{60}$ . To keep this drift below $\frac{1}{b}$ beat, we must keep $\left\vert\gamma\right\vert<\frac{1}{b}$ . Because $\left\vert\gamma\right\vert=\frac{\epsilon.t}{60}$ , we obtain a required accuracy of

$\begin{displaymath} \epsilon<\frac{60}{b.t} \end{displaymath}$

(1)

Equation 1 describes how accurately one song should be measured to have a maximum possible drift of $\frac{1}{b}$ beats after a time of seconds. In practice, at least two songs are needed to create a mix. If all involved songs are measured with the same accuracy $\epsilon$ , then the maximum drift after seconds will be the sum of the maximum drifts of all involved songs. From this, it becomes clear that we need to divide the required accuracy by the number of songs. Table 2 gives an overview of the required accuracies. As can be seen, an accuracy of 0.0313 BPM is essential, while an accuracy of 0.0156 BPM is comfortable.

Table 2: This table presents an overview of the required accuracy $\epsilon$ (in BPM) given a number of songs, a maximum allowable drift $\gamma$ (in beats) and a timespan

(in seconds).

# songs	30 s	60 s	90 s	120 s
	0.0625	0.0313	0.0208	0.0156
1	0.125	0.0625	0.0416	0.0313
	0.25	0.125	0.0833	0.0625
	0.0313	0.0156	0.0104	0.0078
2	0.0625	0.0313	0.0208	0.0156
	0.125	0.0625	0.0417	0.0313
	0.0208	0.0104	0.00693	0.0052
3	0.0417	0.0208	0.0139	0.0104
	0.0833	0.0417	0.0278	0.0208
	0.0157	0.00783	0.0052	0.0039
4	0.0313	0.0156	0.0104	0.00783
	0.0625	0.0313	0.0208	0.0156

Beat Graphs

To explain our ray shooting technique we will first introduce the concept of beat-graphs. Not only can they help in manually measuring the tempo of a soundtrack, they also naturally extend to our ray-shooting technique.


Alien Pump by Tandu at 148.011 BPM.	Lsd by Hallucinogen at 136.98 BPM.

Figure 1:Beat graphs of two songs.

A beat graph visualizes an audio-stream (denoted as a series of samples $a_{i}$ ) under a certain period . In the remainder of the text we will assume that we are investigating an audio-fragment $a_{i}$ . This fragment has a length of samples and is sampled at a sampling rate . If not specified will be considered to be a standard of 44100 Hz. Given this notation, the beat graph visualizes If we have an audio fragment, denoted $a_{i}$ , with a length of samples, It visualizes the function

$\begin{displaymath} g(x,y,p)=\vert a_{x.p+y}\vert\end{displaymath}$

Horizontally, the measures are enumerated, while vertically the content of one such a measure is visualized. The color of a pixel at position is given by the absolute strength of the signal at time step . The value is the period of the rhythm pattern. This typically contains 4 beats in a 4/4 key.

Picture 1 shows two beat graphs of Tandu and Posford [4, 5]. Reading a beat graph can best be done from top to bottom and left to right. For instance in the song Alien Pump [4] we see 4 distinct horizontal lines (numbers 1, 2, 3 and 4). These are the the 'beats' of the song. The top line covers all the beats on the first note in every measure, the second horizontal line covers all the beats at the second note within a measure and so on... The vertical white strokes that break the horizontal lines (C, D and E) are breaks within the music: passages without bass drum or another notable signal. Because the lines are distinctively horizontal we know that the period is correct. If the lines in the beat graph are slanted such as in Lsd [5]then the period (and thus the tempo) is slightly incorrect. In this case, the line is slanted upwards which means that the period is too short (the beat on the next vertical line comes before the beat on the current line), and this means that the tempo is a bit too high. By looking at the direction of the lines in a beat graph we can relate the actual tempo to the visualized tempo.


X-Files by Chakra & Edi Mis at 139.933.	Anniversary Waltz by Status Quo at 172.614 BPM.

Figure 2: Beat graphs of two songs.

The beat graph not necessarily displays only straight lines. Two examples of this are given in figure 2. The first example is the song X-Files [6]. In the beginning the tempo line bends upward indicating that the song is brought down from a higher tempo. After a break the tempo remains constant. Another example is the song Anniversary Waltz [7] in which we clearly see a drummer who drifts around a target tempo.

Our visualization technique of beat graphs offers some important advantages. First, the visualization function is very easy to implement and can be calculated quickly. This is in stark contrast with voice prints, which require extensive mathematical analysis and offer no immediately useful tempo information. Secondly, the beat graph contains rhythm information. From the viewpoint of the DJ, all the necessary information is present. The temporal organization can be recognized by looking at the picture. E.g, we can easily see how the songs Alien Pump [4] contains beats at notes 1, 2, 3, 4 and 4 $\frac{1}{2}$ (position A in the picture). After a while the extra beat at the end of a measure is shifted forward in time by half a measure (position B). Similarly, breaks and tempo changes can be observed (C, D and E). Third, as a DJ-tool it offers a visualization that can be used to align the tempo immediately. E.g, by drawing a line on the beat-graph such as done in Lsd [5], it is possible to recalculate the graph with an adjusted period. Not only will this introduce accurate tempo information, it also introduces the information necessary to find the start of a measure. This makes it easy to accurately and quickly place cue-points.

We will now further investigate how we can fully automatically line up the beat graph. If we are able to do so then we are able to calculate the tempo of a song automatically.

Ray Shooting

Beat graphs give us the possibility to visualize the tempo information within a song. We will now use this to search for the best possible visualization of the beat graph. An optimal visualization will give rise to as much 'horizontality' as possible. 'Horizontality' is measured by comparing every vertical slice (every measure) of the beat graph with the previous slice (the previous measure). Through accumulation of the differences between the consecutive slices we obtain a number that is large when little horizontality is present and small when a lot of horizontality is present. Formally, we measure

$\begin{displaymath} ray(p)=\sum_{x=1}^{\frac{n}{p}}\sum_{y=0}^{p}\vert g(x,y,p)-g(x-1,y,p)\vert \end{displaymath}$

(2)

The use of the absolute value in the above equation is necessary to accumulate the errors. If this would not be present then a mismatch between two measures at position $x_{1}$ could be compensated for by a good match between two measures at another position $x_{2}$ .

Equation 2 can be written more simply by expanding :

$\begin{displaymath} ray(a,p)=\sum_{i=p}^{n}\left\vert\left\vert a_{i}\right\vert-\left\vert a_{i-p}\right\vert\right\vert \end{displaymath}$

(3)

The problem we face now, is to find the period for which is the smallest. must be located within a certain range, for instance between the corresponding periods of 80 BPM and 160 BPM. Once this period is found, we can convert it back to its tempo. Converting a period to BPM is done as follows.

$\begin{displaymath} bpm(p):=\frac{60.M.r}{p}\qquad period(f):=\frac{60.M.r}{f} \end{displaymath}$

(4)

In the above conversion rules, denotes the number of beats within the rhythm pattern. This typically depends on the key of the song. For now, we will assume that every measure contains 4 beats, later on we will come back to this and explain how this number can also be measured automatically. Using the above equation we can define the period range in which we are looking for the best match. We will denoted this range $[p_{0}:p_{1}]$ with $p_{0}$ the smallest period (thus the highest tempo) and $p_{1}$ is the highest period (and thus the lowest frequency). Given these two boundaries, we can calculate the period of a song as follows

$\begin{displaymath} period\,\textrm{is song period}\iff ray(period)=min\{ ray(p)\,\vert\, p\in[p_{0}:p_{1}]\}\end{displaymath}$

A possible optimization to this search process is the introduction of clipping values: When, during calculation of one of the rays, the current ray accumulates an error larger than the smallest ray until now (the clipping value) it can simply stops and returns the clipping value:

$\begin{displaymath} ray(a,p,c)=min(c,\sum_{i=p}^{n}\left\vert\left\vert a_{i}\right\vert-\left\vert a_{i-p}\right\vert\right\vert) \end{displaymath}$

(5)

Because this ray function is limited to the computational complexity of the last best match, its calculation time can, as the number of scans increase, only decrease. The search algorithm can then be written down as

for( $min:=+\infty,\, i:=0$ ;

;

)

$match:=ray(audio,p_{0}+i,min)$

$tempo:=period(p_{0}+i)$

Algorithmic Performance

The time needed to calculate the value of is . Because the period is often neglectable small in comparison to the entire length of a song, we omit it. Hence, takes time to finish. To find the best match in a certain tempo range by doing a brute force search we need to shoot $p_{1}-p_{0}$ rays. We will denote the size of this window . The resulting time estimate is accordingly . This performance estimate is linear which might look appealing, however, the size of the window is typically a very large constant, so this performance estimate should be handled with care. Nevertheless, the real strength of our approach comes from its extremely high accuracy and its modular nature.

Theoretical Accuracy

Our algorithm can verify any period above 0 and below $\frac{n}{2}$ . This means that it can distinguish which one of two periods or period is the best. Therefore, we will calculate the accuracy $\epsilon$ by measuring the smallest distinguishable tempo. Given a certain tempo, we will convert it to its period and then assume that we have measured the period wrongly with the least possible error of 1 sample. The two resulting periods are converted back to BPM's and the differences between these two equals the theoretical accuracy. Suppose the tempo of the song is , then the period of this frequency is . The closest distinguishable next period is . After converting these two periods back to their tempos we get and thus an accuracy of . Expansion of this expression results in

**Table 3:** Accuracy of the ray shooting technique. Horizontally the measured tempo is set out, vertically the accuracy expressed in BPM is shown. The full line is the accuracy of the algorithm using measures of 4 beats, the dotted line is the accuracy of the algorithm using measures of 8 beats.

Table 3 presents an overview of the accuracies we might obtain with this technique. As can be seen, even the least accurate measurable tempo (170 BPM) can be distinguished with an accuracy of 0.00273, which is still 11 times more accurate than the required accuracy of 0.0313. For lower frequencies this even increases to 67.4 times better than the required accuracy ! It should be noted here that this is a theoretical accuracy. Later on we will experimentally validate the practical accuracy.

Selectivity

The algorithm as presented allows to verify the presence of one frequency in . This opens the possibilities to parallelize the algorithm easily but also the possibility to actively search for matching frequencies while neglecting large parts of the search space. This of course requires a suitable search algorithm. In the following section we will shed light on one search approach that has proven to be very fast and still highly accurate.

Searching: Selective Descent

As explained in the previous section, verifying every possible frequency by doing a brute force search would take too much time. Nevertheless, the algorithm as presented here, allows the verification of single frequencies very accurately and quickly. This has lead to an algorithm that first does a quick scan of the entire frequency spectrum and then selects the most important peaks to investigate in further detail.

To skip large chunks of the search space, we use an incremental approach. First, the algorithm does a coarse grained scan that takes into account 'energy' blocks of information (let's say with a block size of $2^{x}$ ). From the obtained results, it will decide which areas are the most interesting to investigate in further detail. This is done by calculating the mean error of all measured positions. Everything below the mean error will be investigated in the next step. We repeat this process with a smaller block size of $2^{x-1}$ until the block size becomes 1.

The problem with this approach is that it might very well overlook steep dalls and consequently find and report a completely wrong period/tempo. To avoid this we will flatten the function by resampling the input data to match the required block size.

Resampling

To avoid a strongly oscillating function, which would make a selective descent very difficult, we will, in every iteration step, resample the audio stream to the correct frequency. This resampling however is very delicate. Simply by taking the mean of the data within a block, we will throw away valuable information. In fact, we will throw everything away that cannot be represented at the given sampling rate. E.g, if we have a block size of 1024 then the final sampling rate will be 43 Hz. Such low sampling rates can only describe the absolute low frequencies up to 21.5 Hz [8]. Please note here, that such a down sampling not only throws away harmonics (such as a down sampling from 44100 Hz to 22050 Hz would do), but actually throws away entire pieces of useful information (like for instance hi-hats). Such a sampling rate conversion is clearly not what we are looking for. Therefore we have invented the following resampling algorithm

With every step, the algorithm will first rescale the audio stream by shrinking every block of $2^{x}$ sample to 1 sample. This is done by taking the energy content of the block:

$\begin{displaymath} b=resample(a,s)\quad\iff\quad b_{i}=\sum_{j=i.s}^{i.s+s-1}\vert a_{j}\vert \end{displaymath}$

(6)

As the caution reader may observe, we do not take the square of the value $a_{j}$ as would be necessary to correctly describe the energy content of the block. The reason why we didn't is twofold. First because this approach is faster and keeps the values within a reasonable bit range, hence remains accurate. Second, because we are concerned with signals representing musical content and the spectrum of music already follows a power distribution.

Algorithm

With the knowledge how to resample the audio fragment and how to search for the most prominent frequency, we can now write down the algorithm in full detail. In the algorithm below is the number of beats that are supposed to be in one measure. $p_{0}=period(stopbpm)$ and $p_{1}=period(startbpm)$ (see equation 4). The window size is given by $p_{1}-p_{0}$ . The resample function is described in equation 6 and the function is described in equation 5. is the number of requested iterations. The maximum block size will then be $2^{m}$ .

1. $matches:=[+\infty,\ldots\textrm{w times}\ldots,+\infty]$

2. resample( $\textrm{input audio}$ , $2^{m}$ )

3. for(; $j<\frac{w}{2^{m}}$ ; )

4. $matches[j.2^{m}]:=$ ray(, $p_{0}+j$ , $+\infty$ )

5. for(; ; )

6. resample( $\textrm{input audio}$ , $2^{i}$ )

7. for(; $j<\frac{w}{2^{i}}$ ; )

8. if $j\textrm{ in between matches below mean}$

9. $match[j.2^{i}]:=$ ray(, $p_{0}+j$ , $+\infty$ )

10. for( $minimum:=+\infty$ , ; ; )

11. if $j\textrm{ in between matches below mean}$

12. ray( $\textrm{input audio}$ , $p_{0}+j$ , )

13. if

14.

15.

Performance

Estimating the performance of this algorithm is easy but requires some explanation. Let us assume that we start with a block size of $2^{m}$ . This means that we will iterate steps.

The initial step, with a block size of $2^{m}$ will result in $\frac{w}{2^{m}}$ rays to be shot. The cost to shoot one ray depends on the audio size. After resampling this is $\frac{n}{2^{m}}$ .

The following steps, except for the last, are similar. However, here it becomes difficult to estimate how many rays that will be shot, because we take the mean error of the previous step and select only the region that lies below the mean error. How large this area is, is unknown. Therefore we will call the number of new rays divided by the number of old rays the rest-factor . After one step, we get $q.\frac{w}{2^{m}}$ rays; after two steps this is $q^{2}\frac{w}{2^{m}}$ and by induction after steps the number of rays becomes $q^{i}\frac{w}{2^{m}}$ . Every ray, after resampling to a block size of $2^{m-i}$ , takes $\frac{n}{2^{m-i}}$ time, hence the time estimation for every $i^{th}$ step is $q^{i}\frac{w.n}{2^{m}2^{m-i}}$ . We continue this process from until .

The last step () is similar, except that we can do the ray shooting it in half the expected time because we are looking for the minimum and no longer for the mean error. This results in $q^{m}\frac{w}{2^{m}}.\frac{n}{2}$

The resampling cost at every step is . After adding all steps together we obtain:

$\begin{displaymath} m.n+\sum_{i=0}^{m-1}\frac{w.n.q^{i}}{2^{2m-i}}+\frac{w.n.q^{m}}{2^{m+1}}\end{displaymath}$

After some reductions, we obtain:

$\begin{displaymath} m.n+\frac{w.n}{2^{m}}\left(\sum_{i=0}^{m-1}\frac{q^{i}}{2^{m-i}}+\frac{q^{m}}{2}\right)\end{displaymath}$

Depending on the factor of and we obtain different performance estimates. We have empirically measured the value of under a window of [80 BPM:160 BPM]. We started our algorithm at a block size of $2^{8}$ and have observed that the mean value of is 0.85. This results in an estimate of . In practice, from the possible 66150 rays to shoot we only needed to shoot 142,35 to obtain the best match. So, we only searched 0.20313 % of the search space before obtaining the correct answer. This is a speedup of about 246.18 times in comparison with the brute force search algorithm.

Experiments

Setup

We have implemented this algorithm as a BPM counter for the DJ program BpmDj [1]. The program can measure the tempo of .MP3 and .OGG files. It is written in C++ and runs under Linux. The program can be downloaded from http://bpmdj.sourceforge.net/. The parameters the algorithm currently uses are: , $p_{0}=period(160)$ , $p_{1}=period(80)$ . Of course these can be modified when necessary. The algorithm starts iterating at and stops at because a higher accuracy is simply not needed.


Rolling On Chrome by Aphrodelics [9] remixed by Kruder & Dorfmeister	Tainted Love by Soft Cell [10]

Figure 3:Tempo scanning two songs.

Figure 3 shows the output it produces for two songs. Every color presents the results of one scanning iteration. The first scan is colored green, the last scan is colored red. The horizontal colored lines represent the position of the mean values for that scan. As can be seen, by using the mean value, quickly large parts of the search space are omitted.

Comparing against related work

To compare the accuracy of our technique with other measurement techniques, we also inserted two well-known BPM-measurement techniques into BpmDj. The first technique does a Fourier analysis of the audio envelope in order to find the most prominent frequency. The second techniques calculates the power density spectrum of the audio sample, which happens to be the same as the autocorrelation function. Techniques that are not fully automatically have been omitted. Hence, we excluded techniques that make use of a database describing possible rythms [11]. Below we briefly describe some other techniques.

Peak Detectors

Peak detectors [12]make use of Fourier analysis or specially tuned digital filters to spike at the occurrence of bass drums (frequencies between 2 and 100 Hz), hi-hats (above 8000 Hz) or other repetitive sounds. By analyzing the distance between the different spikes, they might be able to obtain the tempo. The problem with such techniques is that they are inherent inaccurate, and not only requires detailed fine tuning of the filter-bank but afterwards still require a comprehension phase of the involved rhythm . Their inaccuracy comes from the problem that low frequencies (encoding a bass drum for instance) have a large period, which makes it difficult to position the sound accurately. In other words, the Fourier analysis will spike somewhere around the actual position of the beat, thereby making the algorithm inherent inaccurate. The second problem of peak detectors involves the difficulties of designing a set of peak-responses that will work for all songs. E.g, some songs have a hard and short bass-drum with a lot of body while other song have a muffled slowly decaying bass-drum. Tuning the algorithm such that it can detect correctly the important sounds in almost all songs can be difficult. Finally, the last problem of peak detectors is that they often have no notion of the involved rhythm. E.g, a song with a drum line with beats at position 1, 2, 3, 4 and $4\frac{1}{2}$ are notoriously difficult to understand because the last spike at position $4\frac{1}{2}$ will seriously lower the mean time between the different beats. This can be remedied through using a knowledge base of standard rhythms.

In comparison with peak detectors our technique is clearly superior because it is a) much easier to implement, b) does not require a comprehension phase and c) is much more accurate.

Fourier Analysis of the Energy Envelope

This technique obtains the energy envelope of the audio sample and will then transform it to its frequency representation. The most prominent frequency can be immediately related to the tempo of the audio fragment [13, 14]. From a performance point of view this techniques are much better than the algorithm we presented. Especially if one makes use of the Goertzel algorithm [15]. The accuracy of these techniques can be theoretically as good as our technique, assuming that one takes a window-size that is large enough. E.g, to have an accuracy of 0.078 BPM at a sampling rate of 44100 Hz one needs at least a window size of $2^{25}$ , which is an audio-fragment of 12' 40''. When such a window-size is not available (because the audio-fragment is not as long), the envelope should be zero-padded up to the required length [15].

Autocorrelation

At first sight, our algorithm might seem similar to autocorrelation [15]because we compare the song with itself at different distances. However, this is only a superficial similarity. To understand this we point out that autocorrelation for a discrete audio signal is defined as $R(p)=\sum_{i=p}^{n}a_{i}.a_{i-p}$ . As can be seen, the comparison between different slices is done by multiplying the different values. In our algorithm we only measure the distance between the absolute values, so there is no clear direct correspondence between ray shooting and autocorrelation. Nevertheless autocorrelation has been applied to obtain the mean tempo of an audio fragment. The computation of can be done using quickly using two Fourier transforms. The process takes 3 steps. First the audio-fragment is transformed to its frequency-domain, then every sample is replaced with its squared norm and finally a backward Fourier transform is done. Formally, $R=F^{-1}(\vert F(h)\vert^{2})$ .

Experiment

The experiment consisted of a selection of 150 techno songs. The tempo of each song has been manually measured through aligning their beat graphs. All songs had a tempo between 80 BPM and 160 BPM. Of them 7 had a drifting tempo. In these songs we have manually selected the largest straight part (such as done for the song X-Files [6]). Afterwards, we have run the BPM counter on all the songs again. For every song the counter reported the measurements using different techniques and the time necessary to obtain this value. The machine on which we did the tests was a Dell Optiplex GX1, 600 MHz running Debian GNU/Linux. The counter has been compiled with the Gnu C Compiler version 3.3.2 with maximum optimization, but without machine specific instructions.

Results & Discussion

Harmonics

The song Tainted Love [10] (see picture 3) is interesting because it shows 4 distinct spikes in the ray-shoot analysis. Every spike is a multiple of each other. In fact, the $4^{th}$ spike is the actual tempo with a measured period of 4 beats, which is a correlation between beat 1 and beat 5. The $3^{th}$ spike occurs when the algorithm correlates between the 1st and the $6^{th}$ beat, hence calculates a tempo with measures of 5 beats, while it expects to find only 4 beats in a measure. This results in a tempo of 114.5. The $2^{nd}$ spike is similar but now a correlation between the $1^{st}$ and the $7^{th}$ beat occurs. The first spike is a correlation between the $1^{st}$ beat and the $8^{th}$ beat. We have these distinct spikes because the song is relatively empty and monotone.

All techniques have a problem with estimating how many beats are located within one rhythmic pattern. When a tempo is reported that is a multiple of the required tempo we call it an harmonic.

Results

**Table 4:** Results. The analyzing speed is measured by dividing the song time through the analyzing time.

The results of our experiment can be downloaded from http://bpmdj.sourceforge.net/bpmcompare.html or seen in the appendix.

As expected the results contained a number of wrongly reported errors due to harmonics. These have been detected and eliminated by simply multiplying the reported period by $\frac{5}{4}$ , $\frac{6}{4}$ or $\frac{7}{4}$ . The autocorrelation technique reported an harmonic tempo in 26% of the cases, the ray-shooting technique in 17 % and the Fourier analysis in 5 %. Clearly the Fourier analysis is superior with respect to the problem of harmonics.

However, with respect to the accuracy of the measurement, our ray-shooting technique is best. It measures the mean tempo with an accuracy of 0.0413. From which we can conclude that our algorithm has a practical accuracy of 0.0340 BPM, which is close to the required accuracy of 0.0313. The autocorrelation technique has an accuracy of 0.0759 BPM. The Fourier analysis has an accuracy of 0.0903 BPM. The latter is consistent with reported accuracies. If we look at the speed of our algorithm then it is clearly outperformed. Both the autocorrelation and Fourier analysis are between 2 and 4 time as fast as the autocorrelation technique.

Key + Period = Tempo = Key/Period

Our ray shooting technique finds the period of an unknown repetitive rhythm pattern. As observed it might sometimes report an harmonic of the actual tempo. This happens because we have always assumed that every rhythm pattern contains 4 beats (see equation 4). In practice this is not always the case for two reasons.

First, because not all songs are written in a 4/4 key, we can easily have a different number of beats within the rhythm pattern. E.g, a rhythm pattern with a length of 2 seconds, in a 4/4 key will have 4 beats, hence a tempo of $\frac{4\, beats}{2\, seconds}$ , which is a tempo of 120 BPM. On the other hand, if the rhythm pattern is written in a 3/4 key, these 2 seconds will only have 3 beats, which leads to a tempo of 90 BPM.

The second reason why we cannot assume that every rhythm pattern will always contain 4 beats is that our algorithm might sometimes find rhythm patterns that have a period that is factor different from the actual pattern. This happens mostly when the rhythm pattern itself is monotone and the song shows little rhythmic variation. (As we have explained for the song ``Tainted Love'')

As observed from our results, the Fourier analysis has little problems with harmonics, however it is less accurate than our technique. Therefore, a combination of both techniques might yield a high accuracy without too much problems of harmonics. Specifically, we could easily modify the algorithm such that it would in its first iteration fill the array with the spectrum of the envelope. Formally, line 4 of our algorithm becomes $matches[j.2^{m}]:=F_{a}(j)$ .

Conclusion

In this paper we have introduced our visualization technique, called beat graphs. A beat graph shows the structure of the music under a certain tempo. It shows rhythmic information, covering where breaks and beats are positioned and tempo information. Beat graphs can be quickly drawn and are a valuable tool for the digital DJ, because they can also be used as a fingerprint of songs.

Based on these beat graphs, we have developed an offline algorithm to determine the tempo of a song fully automatically. The algorithm works by shooting rays through the digital audio and checking the similarities between consecutive slices within the audio. To speed up the process of finding the best matching ray, we presented an optimized search algorithm that must only search 0.2% of the entire search space.

The BPM counter works fully correct in 82% of the cases. The other 17% are correctly measured but report a tempo that is an harmonic of the actual tempo. The remaining one percent is measured incorrectly . The accuracy of the measurement is 0.0340 BPM, which is enough to keep the drift of two songs below $\frac{1}{32}$ beat after 30'' (or to keep the drift below $\frac{1}{16}$ beat after 60''). Aside from being very accurate, our algorithm is also insensitive to breaks and gaps in the audio stream. The algorithm is 8.33 faster than real time on a 600 MHz Dell Optiplex, without making use of processor specific features.

Full Result Listing

The table below contains 150 songs, all which have been measured a) manually using beat-graphs, b) autmatically using our rayshooting technique, c) automatically using autocorrelation and d) automatically using a fourier analysis of the audio enveloppe. For every song we have compared the measured tempo against the actual tempo. In a number of cases this tempo is a multiple of the actual tempo beacuse the algorithm measures a period which contains a wrong number of actual beats. E.g, 5 beats, 7 beats, 3 beats and so on. Based on these harmonics we have rescaled the measurement and compared the reported tempo with the measured tempo. This denotes the measurement error. Only for the fourier analysis of the enveloppe we have avoided in doing so (this is reported in the colum: 'Before'). We did this because the fourier analysis measure the most prominent frequency, not the best matchin period. As such are the harmonics of this technique often much more wrong than the harmonics of the two other techniques. As such, fixing the measurement of the fourier technique might give a wrong impression. Therefore the before colum only takes into account the ones that were acutally measured correctly.

								Measurement Error (BPM)
	Measured Period				Harmonic			After fixing harmonics			Before
Title	Manual	Ray	Cor	Env	Rayshoot	Correl	Envelop	Rayshoot	Correl	Envelop	Envelop
MonsterHit_2[Hallucinogen]	132300	132296	82675	66182	1	0.63	0.5	0	0.01	0.04
AndTheDayTurnedToNight_7[Hallucinogen]	128288	112268	112259	89777	0.88	0.88	0.75	0.01	0.01	5.92
TheHerbGarden[Hallucinogen]	109120	109120	109128	109120	1	1	1	0	0.01	0	0
God[]2	106264	106264	106259	106184	1	1	1	0	0	0.08	0.08
StillDreaming(AnythingCanHappen)[AstralProjection]	105944	105960	105964	70640	1	1	0.63	0.02	0.02	6.26
After1000Years[InfectedMushroom]	88204	88180	88188	88185	1	1	1	0.03	0.02	0.03	0.03
ElCielo[Spacefish]	86136	85948	86899	86147	1	1	1	0.27	1.08	0.02	0.02
NextToNothing[]	84888	84888	84887	75488	1	1	0.88	0	0	2
Skydiving[ManWithNoName]	82672	82624	82627	66182	1	1	0.75	0.07	0.07	8.08
3MinuteWarning[YumYum]2	79572	79548	79556	79607	1	1	1	0.04	0.03	0.06	0.06
Voyager3{PaulOakenFold_7}[Prana]	79048	118548	118555	79137	1.5	1.5	1	0.03	0.02	0.15	0.15
ElectronicUncertainty[Psychaos]	78900	78896	78891	78951	1	1	1	0.01	0.02	0.09	0.09
FromFullmoonToSunrise[]	78484	78484	78475	78489	1	1	1	0	0.02	0.01	0.01
Foxglove[Shakta]	78476	78480	78475	78306	1	1	1	0.01	0	0.29	0.29
Shakti[Shakta]	78468	78468	78467	78489	1	1	1	0	0	0.04	0.04
DawnChorus[ManWithNoName]	78392	97984	97980	78398	1.25	1.25	1	0.01	0.01	0.01	0.01
CowsBlues[DenshiDanshi]	78368	78368	78367	78398	1	1	1	0	0	0.05	0.05
KumbaMeta[DjCosmix&Etnica]	78064	78064	78063	78033	1	1	1	0	0	0.05	0.05
Teleport{PaulOakenfold_4}[ManWithNoName]	77944	116916	116911	77852	1.5	1.5	1	0	0.01	0.16	0.16
Vavoom[ManWithNoName]	77828	77832	116743	77852	1	1.5	1	0.01	0	0.04	0.04
Possession[TheDominatrix]	77828	77824	77823	77852	1	1	1	0.01	0.01	0.04	0.04
1998{MattDarey}[BinaryFinary]	77824	77804	77815	77672	1	1	1	0.03	0.02	0.27	0.27
AncientForest[Sungod]2	77820	77824	77823	77852	1	1	1	0.01	0.01	0.06	0.06
LunarJuice{Hallucinogen}[SlinkyWizard]2	77816	77816	77815	77852	1	1	1	0	0	0.06	0.06
MorphicResonance[Cosmosis]	77776	77776	77775	77762	1	1	1	0	0	0.02	0.02
TittyTwister[Viper2]	77332	77344	77332	77314	1	1	1	0.02	0	0.03	0.03
Cambodia[Prometheus]	77260	77236	77255	77225	1	1	1	0.04	0.01	0.06	0.06
InvisibleCities{Tristan}[BlackSun]	77256	96572	96571	77403	1.25	1.25	1	0	0	0.26	0.26
OpenTheSky[Transparent]	77256	77252	77252	77225	1	1	1	0.01	0.01	0.05	0.05
Bongraiders[Necton]	77244	77252	77252	77225	1	1	1	0.01	0.01	0.03	0.03
BinaryFinary[BinaryFinary]	76940	76944	96883	76871	1	1.25	1	0.01	1.01	0.12	0.12
AuroraBorealis[AstralProjection]	76776	76812	76004	76783	1	1	1	0.06	1.4	0.01	0.01
LiquidSunrise[AstralProjection]2	76772	76760	76776	76783	1	1	1	0.02	0.01	0.02	0.02
AllBecauseOfYou{Minesweeper}[UniversalStateOfMind]	76760	76760	76764	76783	1	1	1	0	0.01	0.04	0.04
Tranceport[PaulOakenfold]	76708	76704	76687	76783	1	1	1	0.01	0.04	0.13	0.13
LowCommotion[ManWithNoName]	76692	95860	95852	76695	1.25	1.25	1	0.01	0.02	0.01	0.01
VisionsOfNasca[AstralProjection]3	76692	115060	115056	76695	1.5	1.5	1	0.03	0.02	0.01	0.01
VisionsOfNasca[AstralProjection]	76688	115060	115056	76695	1.5	1.5	1	0.03	0.03	0.01	0.01
PsychedelicTrance[GoaRaume]	76688	76680	76683	76695	1	1	1	0.01	0.01	0.01	0.01
Spawn[EatStatic]	76164	95200	95200	75829	1.25	1.25	1	0.01	0.01	0.61	0.61
EyeTripping[ForceOfNature]	76164	76160	76136	75915	1	1	1	0.01	0.05	0.46	0.46
Fly[AstralProjection]3	76156	76168	76172	76173	1	1	1	0.02	0.03	0.03	0.03
TimeBeganWithTheUniverse{TheEndOfTime}[AstralProjection]	75944	75964	75963	75915	1	1	1	0.04	0.03	0.05	0.05
TimeBeganWithTheUniverse[AstralProjection]2	75944	75964	75963	75915	1	1	1	0.04	0.03	0.05	0.05
Joy[Endora]	75812	75820	75819	75829	1	1	1	0.01	0.01	0.03	0.03
Twisted[InfectedMushroom]	75700	75704	75703	75915	1	1	1	0.01	0.01	0.4	0.4
WalkOnTheMoon[TheAntidote]	75684	75684	75696	75658	1	1	1	0	0.02	0.05	0.05
Utopia[AstralProjection]	75680	75700	75699	75658	1	1	1	0.04	0.04	0.04	0.04
TheFeelings[AstralProjection]	75676	75664	75667	75915	1	1	1	0.02	0.02	0.44	0.44
MaianDream[AstralProjection]2	75676	75800	113195	75658	1	1.5	1	0.23	0.39	0.03	0.03
TajMahal[Microworld]	75676	75704	75679	75658	1	1	1	0.05	0.01	0.03	0.03
TranceDance[AstralProjection]3	75676	75680	113511	75658	1	1.5	1	0.01	0	0.03	0.03
DayDreamOne[TheDr]	75676	94592	94587	75658	1.25	1.25	1	0	0.01	0.03	0.03
Unknown1[AstralProjection]2	75668	75668	75671	75658	1	1	1	0	0.01	0.02	0.02
InNovation{originalmix}[AstralProjection]2	75668	75884	75892	75658	1	1	1	0.4	0.41	0.02	0.02
MagneticActivity[MFG]	75664	75656	75664	75658	1	1	1	0.01	0	0.01	0.01
Contact[EatStatic]	75656	75596	113496	75573	1	1.5	1	0.11	0.01	0.15	0.15
XFile[Chakra&EdiMis]	75632	75624	113455	75573	1	1.5	1	0.01	0.01	0.11	0.11
Kaguya[SunProject]	75608	75592	75595	75658	1	1	1	0.03	0.02	0.09	0.09
GetInTrance[DjMaryJane]	75608	75636	75615	75573	1	1	1	0.05	0.01	0.06	0.06
Soothsayer{LysurgeonWarningMix}[Hallucinogen]	75604	75596	75595	75573	1	1	1	0.01	0.02	0.06	0.06
SeratoninSunrise{Mvo}[ManWithNoName]	75604	75592	75607	75573	1	1	1	0.02	0.01	0.06	0.06
Upgrade[AphidMoon]	75600	75600	113400	75573	1	1.5	1	0	0	0.05	0.05
Cyanid[Colorbox]2	75596	75596	75595	75573	1	1	1	0	0	0.04	0.04
GodIsADj{AstralProjection}[Faithless]3	75592	75832	75824	75573	1	1	1	0.44	0.43	0.04	0.04
Amen[AstralProjection]	75584	76200	75559	75573	1	1	1	1.13	0.05	0.02	0.02
PsychedelicIndioCrystal[MultiFreeman]	75584	75576	75575	100764	1	1	1.38	0.01	0.02	4.4
SmallMoves[InfectedMushroom]	75584	75580	75583	75573	1	1	1	0.01	0	0.02	0.02
TheUltimateSin[Cosmosis]	75560	75556	75559	75573	1	1	1	0.01	0	0.02	0.02
Diablo[Luminus]	75536	75536	75535	75573	1	1	1	0	0	0.07	0.07
Cor[GreenNunsOfTheRevolution]	75508	75500	75503	75488	1	1	1	0.01	0.01	0.04	0.04
Laet[Fol]	75472	75472	75471	75488	1	1	1	0	0	0.03	0.03
MindEruption[SubSequence]	75144	75140	75140	75149	1	1	1	0.01	0.01	0.01	0.01
OverTheMoon[Phreaky]	75136	75140	75140	75149	1	1	1	0.01	0.01	0.02	0.02
Ionised{PaulOakenfold}[AstralProjection]2	75116	75112	75115	75065	1	1	1	0.01	0	0.1	0.1
Spacewalk[Shivasidpao]	75064	75044	75060	75065	1	1	1	0.04	0.01	0	0
DanceOfWitches{Video}[SunProject]	74860	74868	74859	74898	1	1	1	0.02	0	0.07	0.07
Track05[]	74628	74632	74631	74400	1	1	1	0.01	0.01	0.43	0.43
FlyingIntoAStar[AstralProjection]	74608	74660	74647	74565	1	1	1	0.1	0.07	0.08	0.08
KeyboardWidow[Psychaos]	74608	93268	93247	74648	1.25	1.25	1	0.01	0.02	0.08	0.08
PeopleCanFly{AlienProject}[AstralProjection]	74540	74540	74540	74482	1	1	1	0	0	0.11	0.11
DroppedOut[Zirkin&Bonky]	74536	130436	75123	74482	1.75	1	1	0	1.11	0.1	0.1
HFEnergy{Heaven}[AminateFx]	74532	74532	74532	74565	1	1	1	0	0	0.06	0.06
TimeBeganWithTheUniverse[AstralProjection]5	74532	74492	74500	74565	1	1	1	0.08	0.06	0.06	0.06
FreeReturn[Alienated]	74156	74324	130051	74400	1	1.75	1	0.32	0.31	0.47	0.47
InnerReflexion[AviAlgranatiAndBansi]	74096	74096	74096	74071	1	1	1	0	0	0.05	0.05
CosmicAscension[AstralProjection]	74084	73992	74007	74071	1	1	1	0.18	0.15	0.03	0.03
FullMoonDancer[Dreamscape]2	74024	74024	74024	73989	1	1	1	0	0	0.07	0.07
InvisibleContact[PigsInSpace]	74004	111016	111023	73989	1.5	1.5	1	0.01	0.02	0.03	0.03
GiftOfTheGods[Cosmosis]2	73992	73996	73996	73989	1	1	1	0.01	0.01	0.01	0.01
FluoroNeuroSponge_7[Hallucinogen]3	73764	129084	129080	73746	1.75	1.75	1	0	0.01	0.04	0.04
LetThereBeLight[AstralProjection]	73572	73592	73583	73584	1	1	1	0.04	0.02	0.02	0.02
ObsceneNutshell[LostChamber]	73568	73572	73572	73584	1	1	1	0.01	0.01	0.03	0.03
ElectricBlue[AstralProjection]	73500	73500	110252	73503	1	1.5	1	0	0	0.01	0.01
PsychedeliaJunglistia[CypherO]	73496	110220	73456	73343	1.5	1	1	0.03	0.08	0.3	0.3
LifeOnMars[AstralProjection]2	73496	73252	74215	73503	1	1	1	0.48	1.4	0.01	0.01
Nilaya[AstralProjection]3	73488	73484	73480	72865	1	1	1	0.01	0.02	1.23	1.23
OmegaSunset[Mantra]2	73464	73444	110223	73423	1	1.5	1	0.04	0.04	0.08	0.08
Eternal[Epik]	73220	73220	73224	73262	1	1	1	0	0.01	0.08	0.08
Paradise[AstralProjection]2	73176	73176	73175	73103	1	1	1	0	0	0.14	0.14
MadLane[Psychaos]	73076	73076	73080	73103	1	1	1	0	0.01	0.05	0.05
Tatra[Antartica]	73068	109592	109587	120266	1.5	1.5	1.63	0.01	0.02	1.84
OnlineInformation[ElectricUniverse]	73064	91320	73088	73023	1.25	1	1	0.02	0.05	0.08	0.08
AbsoluteZero_6[TotalEclipse]2	73064	73064	73067	73023	1	1	1	0	0.01	0.08	0.08
Ganesh[Sheyba]	73052	91312	91311	73023	1.25	1.25	1	0	0.01	0.06	0.06
SpiritualAntiseptic[Hallucinogen]	73004	73004	72999	73023	1	1	1	0	0.01	0.04	0.04
Freakuences[ElectricUniverse]	73004	109520	109500	73023	1.5	1.5	1	0.02	0.01	0.04	0.04
ShivaDevotional[Shivasidpao]	73004	73004	72999	73023	1	1	1	0	0.01	0.04	0.04
TechnoPrisoners[Phreaky]2	73004	127796	72528	73023	1.75	1	1	0.04	0.95	0.04	0.04
Deliverance[Butler&Wistler]	72996	109492	109491	73023	1.5	1.5	1	0	0	0.05	0.05
AstralPancake[Hallucinogen]	72996	72996	72995	73023	1	1	1	0	0	0.05	0.05
Viper[Viper]	72992	72952	109467	73023	1	1.5	1	0.08	0.03	0.06	0.06
TransparentFuture[AstralProjection]2	72988	72988	72999	73023	1	1	1	0	0.02	0.07	0.07
IWannaExpand[Cwhite]	72976	72976	127671	72944	1	1.75	1	0	0.04	0.06	0.06
GetOut[DarkEntity]	72972	72732	72752	72944	1	1	1	0.48	0.44	0.06	0.06
LSDance[Psysex]	72964	72916	72948	72944	1	1	1	0.1	0.03	0.04	0.04
OuterBreath[Aztec]	72924	127608	127603	72944	1.75	1.75	1	0.01	0.02	0.04	0.04
Shamanix_5[Hallucinogen]2	72608	72604	72603	72628	1	1	1	0.01	0.01	0.04	0.04
Shamanix[Hallucinogen]	72608	72604	72603	72628	1	1	1	0.01	0.01	0.04	0.04
Inspiration[Mfg]	72564	72564	72564	72550	1	1	1	0	0	0.03	0.03
Androids[TechnoSommy]	72564	72568	72568	72550	1	1	1	0.01	0.01	0.03	0.03
HypnoTwist[Tripster]	72556	72588	108840	72550	1	1.5	1	0.06	0.01	0.01	0.01
Schizophonic[XIS]	72512	72500	108752	72471	1	1.5	1	0.02	0.02	0.08	0.08
TheWorldOfTheAcidDealer[TheInfinityProject]	72496	72492	72312	72471	1	1	1	0.01	0.37	0.05	0.05
Vertige2[GoaGil]	72480	90584	90603	72628	1.25	1.25	1	0.03	0	0.3	0.3
Zwork[SunProject]	72460	72472	126736	72471	1	1.75	1	0.02	0.08	0.02	0.02
Exposed[CatOnMushrooms]	72084	72088	72088	72082	1	1	1	0.01	0.01	0	0
TouchTheSon[Sundog]2	72004	72000	71996	72005	1	1	1	0.01	0.02	0	0
SonicChemist[Psychaos]	71996	71988	71992	72005	1	1	1	0.02	0.01	0.02	0.02
AcidKiller{Hallucinogen}[InfectedMushroomXerox]	71968	71964	71963	72005	1	1	1	0.01	0.01	0.08	0.08
TurnOnTuneIn[Cosmosis]	71960	71964	71960	71928	1	1	1	0.01	0	0.07	0.07
SnarlingBlackMabel_6[Hallucinogen]	71904	71904	71904	71928	1	1	1	0	0	0.05	0.05
Lust{ArtOfTranceRemix}[]	71580	71600	71592	71620	1	1	1	0.04	0.02	0.08	0.08
MidnightZombie[SunProject]	71508	71508	71508	71468	1	1	1	0	0	0.08	0.08
ThreeBorgfour[Shivasidpao]	71504	89364	89367	71468	1.25	1.25	1	0.03	0.02	0.07	0.07
TheFirstTouer[CosmicNavigators]	71476	71496	71484	71468	1	1	1	0.04	0.02	0.02	0.02
KitchenSync[Synchro]	71212	106824	71211	126620	1.5	1	1.75	0.01	0	2.35
FastForwardTheFuture{HallucinogenMix}[ZodiacYouth]3	70800	70800	70800	70789	1	1	1	0	0	0.02	0.02
VirtualTerminalEnergised_1[TotalEclipse]2	70632	70628	70627	70640	1	1	1	0.01	0.01	0.02	0.02
TheNeuroDancer[Shakta]	70620	70616	105920	70640	1	1.5	1	0.01	0.01	0.04	0.04
Micronesia[ShaktaMoonweed]	70616	70616	88275	70640	1	1.25	1	0	0.01	0.05	0.05
Spawn[Beast]	70572	123496	123496	70566	1.75	1.75	1	0.01	0.01	0.01	0.01
Chapachoolka[SunProject]	70560	70548	70552	70492	1	1	1	0.03	0.02	0.14	0.14
380Volt[SunProject]	70524	70516	70515	70492	1	1	1	0.02	0.02	0.07	0.07
SpiritualTranceTrack6[GoaGil]	69236	69236	69235	69255	1	1	1	0	0	0.04	0.04
Diffusion[Shidapu]	69104	69104	69103	69255	1	1	1	0	0	0.33	0.33
PsychicPhenomenon[Ultrabeat]2	68724	68720	68715	68688	1	1	1	0.01	0.02	0.08	0.08
Hybrid{TheInfinityProject}[EatStatic]	68308	68312	68311	68200	1	1	1	0.01	0.01	0.25	0.25
Part1[Psilocybin]	67916	67928	67940	67923	1	1	1	0.03	0.06	0.02	0.02
BeneathAnIndianSky[RoyAquarius]2	67480	67480	67479	67243	1	1	1	0	0	0.55	0.55
Measurement Error					124	111	142	0.0413	0.0759	0.2914	0.0903
Variance								0.0148	0.0562	1.0952	0.0215

Bibliography

1.	BpmDj: Free DJ Tools for Linux Werner Van Belle 1999-2010 http://bpmdj.yellowcouch.org/
2.	Beatforce Computer DJ-ing System John Beuving 25 September 2001 http://beatforce.berlios.de/
3.	Hang the DJ: Automatic sequencing and seamless mixing of dance-music tracks Dave Cliff Technical report, Digital Media Systyems Departement, Hewlett-Pacakrd Laboratories, Bristol BS34 8QZ England, June 2000
4.	Alien Pump Tandu Distance To Goa 6, Label: Distance (Substance) Records, 1997
5.	LSD SImon Posford (Hallucinogen) Dragonfly Records, 1995
6.	X-Files Chakra & Edimis Retro 10 Years of Israeli Trance, Label: Phonokol, 2001
7.	Anniversary Walz Status Quo Label: Castle Music, 1990
8.	Certain Topics in Telegraph Transmission Theory Harry Nyquist AIEE Trans., 47:617-644, 1928
9.	Rolling On Chrome Aphrodelics remicex by Kruder & Dorfmeister Studio K7, 1990
10.	Tainted Love Soft Cell Warner, May 1996
11.	Music understanding at the beat level: Real-time beat tracking for audio signals M. Goto, Y. Muraoka Readings in Computational Auditory Scene Analysis, 1997
12.	Apparatus for detecting the number of beats Yamada, Kimura, Tomohiko, Funada, Takeaki, Inoshita, and Gen US Patent Nr 5,614,687, December 1995
13.	Estimation of tempo, micro time and time signature from percussive music Christian Uhle, Jürgen Herre Proceedings of the 6th International Conference on Digital Audio Effects (DAFX-03), London, UK, September 2003
14.	Tempo and beat analysis of acoustic musical signals Scheirer, Eric D Journal of the Acoustical Society of America, 103-1:588, January 1998
15.	Discrete-Time Signal Processing Alan V. Oppenheim, Ronald W. Schafer, John R. Buck Signal Processing Series. Prentice Hall, 1989

Footnotes

...fixed ¹: Admittedly, not all songs fall in this category, nevertheless we can safely make this assumption because we are only interested in the audio fragments that are to be mixed. Also, very few DJ's will mix two songs when one of the tempos changes too much.
... frequency ²: BPM and Hz are both units of frequency. To convert BPM to Hz we simply multiply or divide by 60: $1\,\textrm{BPM}=\frac{1}{60}\,\textrm{Hz}$ and $1\,\textrm{Hz}=60\,\textrm{BPM}$ . If we have a frequency specified in BPM then we can obtain the time between two beats in msecs with $t=\frac{60000}{f}$ .

http://werner.yellowcouch.org/
werner@yellowcouch.org