Return to search

The Augmented Geometrically Spaced Transform : applications of the single channel frequency estimator / Applications of the single channel frequency estimator

Thesis: S.M., Massachusetts Institute of Technology, School of Architecture and Planning, Program in Media Arts and Sciences, February, 2021 / Cataloged from the official PDF version of thesis. / Includes bibliographical references (pages 99-103). / The Augmented Geometrically Spaced Transform (AGST) is an auditory model that is based on an inversion of the acoustic piano, where the piano produces music and the transform analyses it. In contrast with the standard spectrogram, which is a complex frequency vector versus time, the AGST is based around a matrix of frequencies, known as the AGST Frequency Matrix, where for every frequency in the matrix, a spectral envelope is computed using a Single Channel Frequency Estimator (SCFE). The core invention of the thesis is the algorithm for the SCFE, which computes spectral envelopes with maximally high definition in a computationally efficient manner. A bank of SCFEs is assembled into a constant Q transform, known as a Geometrically Spaced Transform (GST). The GST can be used to visualize harmonics inside of musical notes, or audio in general, in a constant Q fashion. It is then shown that the AGST is a good front-end model for computational pitch perception. For example, it can be used to solve an important problem in auditory perception, the case of the missing fundamental. The entire thesis is framed in the context of building artificially intelligent music systems, including synthetic listeners (machines that listen in the way that people do), and synthetic performers (machines that allow for interactive music performance). / by Jonathan Michael Feldman. / S.M. / S.M. Massachusetts Institute of Technology, School of Architecture and Planning, Program in Media Arts and Sciences

Identiferoai:union.ndltd.org:MIT/oai:dspace.mit.edu:1721.1/131006
Date January 2021
CreatorsFeldman, Jonathan Michael,S.M.Massachusetts Institute of Technology.
ContributorsJoseph A. Paradiso., Program in Media Arts and Sciences (Massachusetts Institute of Technology), Program in Media Arts and Sciences (Massachusetts Institute of Technology)
PublisherMassachusetts Institute of Technology
Source SetsM.I.T. Theses and Dissertation
LanguageEnglish
Detected LanguageEnglish
TypeThesis
Format103 pages, application/pdf
RightsMIT theses may be protected by copyright. Please reuse MIT thesis content according to the MIT Libraries Permissions Policy, which is available through the URL provided., http://dspace.mit.edu/handle/1721.1/7582

Page generated in 0.0017 seconds