Global ETD Search

1	An Assurance Metric and Robustness Evaluation of a Low-cost Acoustic Beamformer for Source Localization Coleman, Thomas Christopher 26 July 2018 (has links) A rise in interest for service robotic rovers produces a need for a low-cost method for source localization in order for a prospective robotic unit to engage with a human operator. This study examines the use of the LMS algorithm for constructing a beamformer using an optimized Weiner filter solution for this source localization application and evaluates the robustness of a developed characterization method for assuring that a proper approximation for the desired signal is achieved. The method presented in this paper encompasses using a filter and sum method in which the sums are generated for a selected set of filter angles, and this set of sums are compared and characterized to produce a selection for an approximate arrival angle from the sound source to the microphone array. These filters are adaptively trained offline using a generated desired signal chirp to represent the average human whistle and a training data set for each of the four possible room configurations. This method was tested to determine if a selected filter configuration could still produce viable outputs for scenarios in which the testing room had been changed, whether noise was injected into the testing environment, if two or three microphones were used in testing process, and whether the filter angles are aligned with the arrival angles of the signal. Results on the robustness of the adaptive LMS beamformer are presented. Limitations of the system performance are discussed and possible solutions for results that have undesired performance are given in future work. / Master of Science / A rise in interest for service robotic rovers produces a need for a low-cost method for locating a sound source so that a potential service robot can interact with a human operator. In this study, a beamformer is implemented to approximate a direction for the sound source. This beamformer is comprised of a set of trained filters for the designed microphone array. These filters were trained based on three training conditions of training room, the number of microphones used, and whether additive or ambient noise is used during training. The training signal for the filters consisted of a chirp from 1 to 2.5 kHz to mimic a portion of the human whistling spectrum. Once trained, these beamformers were then given data from separate tests to determine if a distinct and correct approximation could be determined. This paper suggests a method to use the correlation of each beamformer to the training signal to determine both the maximum correlated beamformer and whether correlation is distinct from greater than the other beamformers examined. These results are finally examined under an ANOVA and percent difference process to determine if the three training conditions improve the average prediction of the angle of arrival of the source signal for the generated beamformers. Metric Evaluation Acoustic Beamforming Source Localization
2	Computational Acoustic Beamforming of Noise Source on Wind Turbine Airfoil Li, Chi Shing January 2014 (has links) A new method, Computational Acoustic Beamforming, is proposed in this thesis. This novel numerical sound source localization methodology combines the advantages of the Computational Fluid Dynamics (CFD) simulation and experimental acoustic beamforming, which enable this method to take directivity of sound source emission into account while maintaining a relatively low cost. This method can also aid the optimization of beamforming algorithm and microphone array design. In addition, it makes sound source prediction of large structures in the low frequency range possible. Three modules, CFD, Computational Aeroacoustics (CAA) and acoustic beamforming, are incorporated in this proposed method. This thesis adopts an open source commercial software OpenFOAM for the flow field simulation with the Improved Delayed Detached Eddy Simulation (IDDES) turbulence model. The CAA calculation is conducted by an in-house code using impermeable Ffowcs-Williams and Hawkings (FW-H) equation for static sound source. The acoustic beamforming is performed by an in-house Delay and Sum (DAS) beamformer code with several different microphone array designs. Each module has been validated with currently available experimental data and numerical results. A flow over NACA 0012 airfoil case was chosen as a demonstration case for the new method. The aerodynamics and aeroacoustics results are shown and compared with the experimental measurements. A relatively good agreement has been achieved which gives the confidence of using this newly proposed method in sound source localization applications.
3	ROBUST SPEAKER DIARIZATION FOR MEETINGS Anguera Miró, Xavier 21 December 2006 (has links) Aquesta tesi doctoral mostra la recerca feta en l'àrea de la diarització de locutor per a sales de reunions. En la present s'estudien els algorismes i la implementació d'un sistema en diferit de segmentació i aglomerat de locutor per a grabacions de reunions a on normalment es té accés a més d'un micròfon per al processat. El bloc més important de recerca s'ha fet durant una estada al International Computer Science Institute (ICSI, Berkeley, Caligornia) per un període de dos anys.La diarització de locutor s'ha estudiat força per al domini de grabacions de ràdio i televisió. La majoria dels sistemes proposats utilitzen algun tipus d'aglomerat jeràrquic de les dades en grups acústics a on de bon principi no se sap el número de locutors òptim ni tampoc la seva identitat. Un mètode molt comunment utilitzat s'anomena "bottom-up clustering" (aglomerat de baix-a-dalt), amb el qual inicialment es defineixen molts grups acústics de dades que es van ajuntant de manera iterativa fins a obtenir el nombre òptim de grups tot i acomplint un criteri de parada. Tots aquests sistemes es basen en l'anàlisi d'un canal d'entrada individual, el qual no permet la seva aplicació directa per a reunions. A més a més, molts d'aquests algorisms necessiten entrenar models o afinar els parameters del sistema usant dades externes, el qual dificulta l'aplicabilitat d'aquests sistemes per a dades diferents de les usades per a l'adaptació.La implementació proposada en aquesta tesi es dirigeix a solventar els problemes mencionats anteriorment. Aquesta pren com a punt de partida el sistema existent al ICSI de diarització de locutor basat en l'aglomerat de "baix-a-dalt". Primer es processen els canals de grabació disponibles per a obtindre un sol canal d'audio de qualitat major, a més dínformació sobre la posició dels locutors existents. Aleshores s'implementa un sistema de detecció de veu/silenci que no requereix de cap entrenament previ, i processa els segments de veu resultant amb una versió millorada del sistema mono-canal de diarització de locutor. Aquest sistema ha estat modificat per a l'ús de l'informació de posició dels locutors (quan es tingui) i s'han adaptat i creat nous algorismes per a que el sistema obtingui tanta informació com sigui possible directament del senyal acustic, fent-lo menys depenent de les dades de desenvolupament. El sistema resultant és flexible i es pot usar en qualsevol tipus de sala de reunions pel que fa al nombre de micròfons o la seva posició. El sistema, a més, no requereix en absolute dades d´entrenament, sent més senzill adaptar-lo a diferents tipus de dades o dominis d'aplicació. Finalment, fa un pas endavant en l'ús de parametres que siguin mes robusts als canvis en les dades acústiques. Dos versions del sistema es van presentar amb resultats excel.lents a les evaluacions de RT05s i RT06s del NIST en transcripció rica per a reunions, a on aquests es van avaluar amb dades de dos subdominis diferents (conferencies i reunions). A més a més, es fan experiments utilitzant totes les dades disponibles de les evaluacions RT per a demostrar la viabilitat dels algorisms proposats en aquesta tasca. / This thesis shows research performed into the topic of speaker diarization for meeting rooms. It looks into the algorithms and the implementation of an offline speaker segmentation and clustering system for a meeting recording where usually more than one microphone is available. The main research and system implementation has been done while visiting the International Computes Science Institute (ICSI, Berkeley, California) for a period of two years. Speaker diarization is a well studied topic on the domain of broadcast news recordings. Most of the proposed systems involve some sort of hierarchical clustering of the data into clusters, where the optimum number of speakers of their identities are unknown a priory. A very commonly used method is called bottom-up clustering, where multiple initial clusters are iteratively merged until the optimum number of clusters is reached, according to some stopping criterion. Such systems are based on a single channel input, not allowing a direct application for the meetings domain. Although some efforts have been done to adapt such systems to multichannel data, at the start of this thesis no effective implementation had been proposed. Furthermore, many of these speaker diarization algorithms involve some sort of models training or parameter tuning using external data, which impedes its usability with data different from what they have been adapted to.The implementation proposed in this thesis works towards solving the aforementioned problems. Taking the existing hierarchical bottom-up mono-channel speaker diarization system from ICSI, it first uses a flexible acoustic beamforming to extract speaker location information and obtain a single enhanced signal from all available microphones. It then implements a train-free speech/non-speech detection on such signal and processes the resulting speech segments with an improved version of the mono-channel speaker diarization system. Such system has been modified to use speaker location information (then available) and several algorithms have been adapted or created new to adapt the system behavior to each particular recording by obtaining information directly from the acoustics, making it less dependent on the development data.The resulting system is flexible to any meetings room layout regarding the number of microphones and their placement. It is train-free making it easy to adapt to different sorts of data and domains of application. Finally, it takes a step forward into the use of parameters that are more robust to changes in the acoustic data. Two versions of the system were submitted with excellent results in RT05s and RT06s NIST Rich Transcription evaluations for meetings, where data from two different subdomains (lectures and conferences) was evaluated. Also, experiments using the RT datasets from all meetings evaluations were used to test the different proposed algorithms proving their suitability to the task. acoustic beamforming speaker diarization speaker clustering speaker change detection speaker segmentation signal enhancement 004
4	Baseline-free Damage Identification for Plate-like Structures using a Delay and Sum Beamforming Algorithm Thakur, Ashwani January 2021 (has links) No description available. Mechanical Engineering Damage detection plate structures active sound source microphone array acoustic beamforming algorithm
5	Acoustic Beamforming : Design and Development of Steered Response Power With Phase Transformation (SRP-PHAT). / Acoustic Beamforming : Design and Development of Steered Response Power With Phase Transformation (SRP-PHAT). Dey, Ajoy Kumar, Saha, Susmita January 2011 (has links) Acoustic Sound Source localization using signal processing is required in order to estimate the direction from where a particular acoustic source signal is coming and it is also important in order to find a soluation for hands free communication. Video conferencing, hand free communications are different applications requiring acoustic sound source localization. This applications need a robust algorithm which can reliably localize and position the acoustic sound sources. The Steered Response Power Phase Transform (SRP-PHAT) is an important and roubst algorithm to localilze acoustic sound sources. However, the algorithm has a high computational complexity thus making the algorithm unsuitable for real time applications. This thesis focuses on describe the implementation of the SRP-PHAT algorithm as a function of source type, reverberation levels and ambient noise. The main objective of this thesis is to present different approaches of the SRP-PHAT to verify the algorithm in terms of acoustic enviroment, microphone array configuration, acoustic source position and levels of reverberation and noise. Acoustic Beamforming SRP-PHAT Sound Source Localization Source Position Microphone Array algorithm Signal Processing Signalbehandling Computer Sciences Datavetenskap (datalogi) Telecommunications Telekommunikation

1

Page generated in 0.082 seconds