• Refine Query
  • Source
  • Publication year
  • to
  • Language
  • 1
  • 1
  • Tagged with
  • 2
  • 2
  • 2
  • 1
  • 1
  • 1
  • 1
  • 1
  • 1
  • 1
  • 1
  • 1
  • 1
  • 1
  • 1
  • About
  • The Global ETD Search service is a free service for researchers to find electronic theses and dissertations. This service is provided by the Networked Digital Library of Theses and Dissertations.
    Our metadata is collected from universities around the world. If you manage a university/consortium/country archive and want to be added, details can be found on the NDLTD website.
1

Multi-objective optimization for model selection in music classification / Flermålsoptimering för modellval i musikklassificering

Ujihara, Rintaro January 2021 (has links)
With the breakthrough of machine learning techniques, the research concerning music emotion classification has been getting notable progress combining various audio features and state-of-the-art machine learning models. Still, it is known that the way to preprocess music samples and to choose which machine classification algorithm to use depends on data sets and the objective of each project work. The collaborating company of this thesis, Ichigoichie AB, is currently developing a system to categorize music data into positive/negative classes. To enhance the accuracy of the existing system, this project aims to figure out the best model through experiments with six audio features (Mel spectrogram, MFCC, HPSS, Onset, CENS, Tonnetz) and several machine learning models including deep neural network models for the classification task. For each model, hyperparameter tuning is performed and the model evaluation is carried out according to pareto optimality with regard to accuracy and execution time. The results show that the most promising model accomplished 95% correct classification with an execution time of less than 15 seconds. / I och med genombrottet av maskininlärningstekniker har forskning kring känsloklassificering i musik sett betydande framsteg genom att kombinera olikamusikanalysverktyg med nya maskinlärningsmodeller. Trots detta är hur man förbehandlar ljuddatat och valet av vilken maskinklassificeringsalgoritm som ska tillämpas beroende på vilken typ av data man arbetar med samt målet med projektet. Denna uppsats samarbetspartner, Ichigoichie AB, utvecklar för närvarande ett system för att kategorisera musikdata enligt positiva och negativa känslor. För att höja systemets noggrannhet är målet med denna uppsats att experimentellt hitta bästa modellen baserat på sex musik-egenskaper (Mel-spektrogram, MFCC, HPSS, Onset, CENS samt Tonnetz) och ett antal olika maskininlärningsmodeller, inklusive Deep Learning-modeller. Varje modell hyperparameteroptimeras och utvärderas enligt paretooptimalitet med hänsyn till noggrannhet och beräkningstid. Resultaten visar att den mest lovande modellen uppnådde 95% korrekt klassificering med en beräkningstid på mindre än 15 sekunder.
2

A Hierarchical Approach To Music Analysis And Source Separation

Thoshkahna, Balaji 11 1900 (has links) (PDF)
Music analysis and source separation have become important and allied areas of research over the last decade. Towards this, analyzing a music signal for important events such as onsets, offsets and transients are important problems. These tasks help in music source separation and transcription. Approaches in source separation too have been making great strides, but most of these techniques are aimed at Western music and fail to perform well for Indian music. The fluid style of instrumentation in Indian music requires a slightly modified approach to analysis and source separation. We propose an onset detection algorithm that is motivated by the human auditory system. This algorithm has the advantage of having a unified framework for the detection of both onsets and offsets in music signals. This onset detection algorithm is further extended to detect percussive transients. Percussive transients have sharp onsets followed closely by sharp offsets. This characteristic is exploited in the percussive transients detection algorithm. This detection does not lend itself well to the extraction of transients and hence we propose an iterative algorithm to extract all types of transients from a polyphonic music signal. The proposed iterative algorithm is both fast and accurate to extract transients of various strengths. This problem of transient extraction can be extended to the problem of harmonic/percussion sound separation(HPSS), where a music signal is separated into two streams consisting of components mainly from percussion and harmonic instruments. Many algorithms that have been proposed till date deal with HPSS for Western music. But with Indian classical/film music, a different style of instrumentation or singing is seen, including high degree of vibratos or glissando content. This requires new approaches to HPSS. We propose extensions to two existing HPSS techniques, adapting them for Indian music. In both the extensions, we retain the original framework of the algorithm, showing that it is easy to incorporate the changes needed to handle Indian music. We also propose a new HPSS algorithm that is inspired by our transient extraction technique. This algorithm can be considered a generalized extension to our transient extraction algorithm and showcases our view that HPSS can be considered as an extension to transient analysis. Even the best HPSS techniques have leakages of harmonic components into percussion and this can lead to poor performances in tasks like rhythm analysis. In order to reduce this leakage, we propose a post processing technique on the percussion stream of the HPSS algorithm. The proposed method utilizes signal stitching by exploiting a commonly used model for percussive envelopes. We also developed a vocals extraction algorithm from the harmonic stream of the HPSS algorithm. The vocals extraction follows the popular paradigm of extracting the predominant pitch followed by generation of the vocals signal corresponding to the pitch. We show that HPSS as a pre-processing technique gives an advantage in reducing the interference from percussive sources in the extraction stage. It is also shown that the performance of vocal extraction algorithms improve with the knowledge about locations of the vocal segments. This is shown with the help of an oracle to locate the vocal segments. The use of the oracle greatly reduces the interferences from other dominating sources in the extracted vocals signal.

Page generated in 0.0185 seconds