没有合适的资源?快使用搜索试试~ 我知道了~
温馨提示
This paper presents a novel approach to visualiz- ing the time structure of musical waveforms. The acoustic similarity between any two instants of an audio recording is displayed in a static 2D repre- sentation, which makes structural and rhythmic characteristics visible. Unlike practically all prior work, this method characterizes self-similarity rather than specific audio attributes such as pitch or spectral features. Examples are presented for classical and popular music.
资源推荐
资源详情
资源评论
Abstract
This paper presents a novel approach to visualiz-
ing the time structure of musical waveforms. The
acoustic similarity between any two instants of an
audio recording is displayed in a static 2D repre-
sentation, which makes structural and rhythmic
characteristics visible. Unlike practically all prior
work, this method characterizes self-similarity
rather than specific audio attributes such as pitch
or spectral features. Examples are presented for
classical and popular music.
1. Introduction
There has been considerable interest in making music
visible. Efforts include artistic attempts to realize images
elicited by sound, of which the Walt Disney film Fantasia is
perhaps the canonical example. Another approach is to
quantitatively render the time and/or frequency content of
the audio signal, using methods such as the oscillograph and
sound spectrograph [1,2]. These are intended primarily for
scientific or quantitative analysis, though artists like Mary
Ellen Bute have used quantitative methods such as the
cathode ray oscilloscope towards artistic ends [3]. Other
visualizations are derived from note-based or score-like
representation of music, typically MIDI note events [4,5].
Music is generally self-similar. With the possible
exception of a few avant-garde compositions, structure and
repetition is a general feature of nearly all music. That is, the
coda often resembles the introduction and the second chorus
sounds like the first. On a shorter time scale, successive bars
are often repetitive, especially in popular music. This paper
presents methods of visualizing music by its acoustic self-
similarity across time, rather than by absolute acoustic
characteristics. Self-similarity is visualized in a two-
dimensional time representation such as Figure 1
. This
representation presented here is very flexible and can be
used with practically any parameterization of audio. Besides
audio, similar representations have been used to analyze text
[7], video [8], hypertext links [9], and dynamical systems
[10].
2. Similarity Analysis
An audio file is visualized as a square. Time runs from
left to right as well as from bottom to top. In the square, the
brightness of a point is proportional to the audio
similarity at instants i and j. Similar regions are bright while
dissimilar regions are dark. Thus there is always a bright
diagonal line running from bottom left to top right, because
audio is always the most similar to itself at any particular
time. Repetitive similarities, such as repeating notes or
motifs, show up as a checkerboard patterns: a note repeated
twice will give four bright areas at the corner of a square.
The two regions at the off-diagonal corners are the “cross-
terms” resulting from the first note’s similarity to the second.
Repeated themes are visible as diagonal lines parallel to, and
separated from, the main diagonal by the time difference
ij(,)
Figure 1. Self-similarity of Bach’s Prelude No. 1
i
start
end
j
i
j
similarity
matrix
S
start
end
waveform
D(i,j)
Figure 2. Distance matrix calculation
Visualizing Musical Structure and Rhythm via Self-Similarity
Jonathan Foote and Matthew Cooper
FX Palo Alto Laboratory, Inc.
3400 Hillview Ave., Building 4
Palo Alto, CA 94304 USA
{foote, cooper}@pal.xerox.com
资源评论
rogermansuy
- 粉丝: 0
- 资源: 2
上传资源 快速赚钱
- 我的内容管理 展开
- 我的资源 快来上传第一个资源
- 我的收益 登录查看自己的收益
- 我的积分 登录查看自己的积分
- 我的C币 登录后查看C币余额
- 我的收藏
- 我的下载
- 下载帮助
最新资源
资源上传下载、课程学习等过程中有任何疑问或建议,欢迎提出宝贵意见哦~我们会及时处理!
点击此处反馈
安全验证
文档复制为VIP权益,开通VIP直接复制
信息提交成功