Global ETD Search

21	Wireless Network SNR Enhancement Using Mobile Relay Stations Ohannessian, Rostom 13 January 2011 (has links) With the proliferation of wireless technologies, wireless Internet access in public places will become a necessity in the near future. In outdoor areas, where the base stations are sparsely distributed, mobile users at the edge of the network communicate with the base station at a very low rate and thus waste network resources. To solve this problem, one of the previously taken approaches was the use of relay stations to improve the throughput of the network. In this work, we take this approach to the next level by updating the positions of the relays according to the particular distribution of the users at certain time instants. By comparing the proposed scheme to fixed relay placement strategies, we show that the former has 15-60% performance improvement over the latter, in terms of the average SNR of the network. Mobile Relays SNR Enhancement k-means Clustering 0544
22	Non-Line of Sight Identification with Particle Filter Optimization Algprithm in Wireless Location Chen, Tai-Yuan 29 July 2008 (has links) In wireless location systems, received signals may be influenced by non-line of sight (NLOS) propagation errors, which yield severe degradation of location accuracy.Therefore, to distinguish how many measurement signals are line-of-sight (LOS) and to identify them simultaneously will contribute to the increase of location accuracy.We propose a method based on recursive hypothesis testing algorithm, and use residual information to determine whether the NLOS errors are present in measurements. Since the probability distribution of measurements with NLOS errors is different from that of measurements without NLOS errors, a likelihood ratio test can be used in determining the LOS/NLOS status of the measurements. To search for an optimal threshold for the hypothesis testing, particle filtering optimization(PFO) is adopted. The PFO algorithm uses particle filtering to find the best threshold for determining the status of signals measured at all base stations (BSs). In the PFO algorithm, the clustering property of K-means is also used in separating particles, thereby the search of optimal threshold may be implemented in parallel.In this thesis, we focus on the hybrid TOA/AOA (time of arrical/angle of arrival) location method, in which localization only uses the LOS location measurements to calculate the location of a mobile station. Simulation results show that the proposed algorithm performs better than other algorithms which suffer from different degrees of NLOS errors. The proposed scheme also obtains higher identification rate of LOS-BSs in different situations by using the optimal thresholds for status detection. Wireless Location TOA K-means Particle Filter AOA NLOS
23	Automatic Attribute Clustering and Feature Selection Based on Genetic Algorithms Wang, Po-Cheng 21 August 2009 (has links) Feature selection is an important pre-processing step in mining and learning. A good set of features can not only improve the accuracy of classification, but also reduce the time to derive rules. It is executed especially when the amount of attributes in a given training data is very large. This thesis thus proposes three GA-based clustering methods for attribute clustering and feature selection. In the first method, each feasible clustering result is encoded into a chromosome with positive integers and a gene in the chromosome is for an attribute. The value of a gene represents the cluster to which the attribute belongs. The fitness of each individual is evaluated using both the average accuracy of attribute substitutions in clusters and the cluster balance. The second method further extends the first method to improve the time performance. A new fitness function based on both the accuracy and the attribute dependency is proposed. It can reduce the time of scanning the data base. The third approach uses another encoding method for representing chromosomes. It can achieve a faster convergence and a better result than the second one. At last, the experimental comparison with the k-means clustering approach and with all combinations of attributes also shows the proposed approach can get a good trade-off between accuracy and time complexity. Besides, after feature selection, the rules derived from only the selected features may usually be hard to use if some values of the selected features cannot be obtained in current environments. This problem can be easily solved in our proposed approaches. The attributes with missing values can be replaced by other attributes in the same clusters. The proposed approaches thus provide flexible alternatives for feature selection. k-means reduct genetic algorithms feature clustering feature selection
24	Bagged clustering Leisch, Friedrich January 1999 (has links) (PDF) A new ensemble method for cluster analysis is introduced, which can be interpreted in two different ways: As complexity-reducing preprocessing stage for hierarchical clustering and as combination procedure for several partitioning results. The basic idea is to locate and combine structurally stable cluster centers and/or prototypes. Random effects of the training set are reduced by repeatedly training on resampled sets (bootstrap samples). We discuss the algorithm both from a more theoretical and an applied point of view and demonstrate it on several data sets. (author's abstract) / Series: Working Papers SFB "Adaptive Information Systems and Modelling in Economics and Management Science"
25	A K-MEANS BASED WATERSHED IMAGING SEGMENTATION ALGORITHM FOR BANANA CLUSTER QUALITY INSPECTION Castillo, Gregorio Alfonso 01 December 2016 (has links) Banana has become the most commonly consumed fresh fruit among US population. It is a challenge to use computer vision to divide touching bananas, for this purpose a novel image segmentation algorithm is proposed, combining k-means and the watershed transformation. The first part is to extract the background, achieved using a K-means based in the HS space, the second part is individual banana segmentation where a smarter selection of the initial markers from where the watershed transformation grows is attained fusing two morphological filters with different structural elements. The validation of the proposed algorithm has been conducted using 124 experimentally capture banana pictures manually segmented. For background extraction K-means in HS space produced the best performance over the other two tested (Otsu, K-means(Lab), getting average a F1 Score average of 96.99%, Otsu and K-means(Lab) scored 82.58% and 88.06% respectively. The result of the watershed segmentation was also compared with the manual segmentation; The overall performance using the F1 Score in average is 92.28%. The performance would improve with modifications to the system, including a more homogenous illumination, only allowing certain positions to be possible for the bananas cluster, and a more adequate background selection. Color Image Segmentation HSI color space K-means Watershed
26	Diseño de procesos para la segmentación de clientes según su comportamiento de compra y hábito de consumo en una empresa de consumo masivo Rojas Araya, Javier Orlando January 2017 (has links) Magíster en Ingeniería de Negocios con Tecnologías de Información / La industria de alimentos de consumo masivo ha ido evolucionando en el tiempo. Los primeros canales de venta para llegar a los clientes finales fueron los almacenes de barrio los que se vieron fuertemente amenazados con la proliferación de grandes cadenas de supermercados. La aparición de internet también creó un nuevo canal que permite a los clientes finales hacer pedidos de productos y pagarlos a través de aplicaciones móviles para finalmente recibirlos en su domicilio. A pesar de esta evolución en los canales, los almacenes de barrio se niegan a desaparecer. Son muchos los clientes que siguen prefiriendo la atención amable y personalizada de los almacenes junto con un abanico amplio de productos y precios atractivos. La empresa no está ajena a esta realidad y también comercializa sus productos a clientes finales por los canales supermercado y almacenes. Respecto a los almacenes se atiende mensualmente una cantidad aproximada de 25.000 clientes a nivel nacional donde existe una mayor concentración en la zona centro del país. Segmentar a estos clientes para conocer su comportamiento de compra y hábito de consumo se ha convertido en el eje central de la estrategia de este canal. Ya no basta con analizar los reportes de ventas para aumentar el rendimiento del Área Comercial. Este proyecto tiene por objetivo agrupar los clientes del canal Almacenes de la empresa bajo los conceptos de comportamiento de compra y hábito de consumo y lograr caracterizarlos. Para alcanzar esta meta se utiliza la metodología de Ingeniería de Negocios que parte desde la definición del posicionamiento estratégico, el modelo de negocio, la arquitectura de procesos, el diseño detallado de los procesos, el diseño del apoyo tecnológico que soportará a los procesos y finalmente la construcción y puesta en marcha de la solución. Además se utilizarán algoritmos propios para este tipo de tareas como son DBSCAN y K-Means. Los resultados obtenidos permiten segmentar a los clientes en siete grupos para el comportamiento de compra y siete para el hábito consumo. Con esto se puede responder las preguntas de cuándo, cuánto y qué compran los clientes del canal. El beneficio del proyecto se traduce en un aumento de las ventas por acciones que permiten recuperar a clientes que están en proceso de fugarse y por aumento del ticket promedio de aquellos clientes que realizan compras frecuentes pero de muy bajo monto de facturación. / 07/04/2022 Segmentación del mercado Gestión de negocios - Chile K-means DBSCAN
27	Optimalizace rozvržení provozu ve firmě Vodárenská akciová společnost a.s. Urbanová, Zuzana January 2014 (has links) Dimploma thesis deals with the optimization of operation distribution in company Vodárenská akciová společnost, a.s. Aim of the thesis is to determine achievement standards of companies branches and to state the capacity reserves for every one of them. Next, using methods of cluster analysis and graph theory, proposing recommendation leading to effective utilization of operation capacity. This is done by optimizing total number of business branches and subsequent creation of new regions. Thesis consists of theoretical and practical part. In this papers theoretical part, hierarchical and nonhierarchical clustering algorithms, minimal spanning tree and water management sector are described. Practical part addresses optimization of layout of company operation distribution. Based on comparison of outputs of chosen methods, recommendations for company, will be proposed.
28	Clustering Methods and Their Applications to Adolescent Healthcare Data Mayer-Jochimsen, Morgan 01 January 2013 (has links) Clustering is a mathematical method of data analysis which identifies trends in data by efficiently separating data into a specified number of clusters so is incredibly useful and widely applicable for questions of interrelatedness of data. Two methods of clustering are considered here. K-means clustering defines clusters in relation to the centroid, or center, of a cluster. Spectral clustering establishes connections between all of the data points to be clustered, then eliminates those connections that link dissimilar points. This is represented as an eigenvector problem where the solution is given by the eigenvectors of the Normalized Graph Laplacian. Spectral clustering establishes groups so that the similarity between points of the same cluster is stronger than similarity between different clusters. K-means and spectral clustering are used to analyze adolescent data from the 2009 California Health Interview Survey. Differences were observed between the results of the clustering methods on 3294 individuals and 22 health-related attributes. K-means clustered the adolescents by exercise, poverty, and variables related to psychological health while spectral clustering groups were informed by smoking, alcohol use, low exercise, psychological distress, low parental involvement, and poverty. We posit some guesses as to this difference, observe characteristics of the clustering methods, and comment on the viability of spectral clustering on healthcare data. K-means spectral clustering clustering healthcare Other Applied Mathematics
29	An Empirical Analysis of Network Traffic: Device Profiling and Classification Anbazhagan, Mythili Vishalini 02 July 2019 (has links) Time and again we have seen the Internet grow and evolve at an unprecedented scale. The number of online users in 1995 was 40 million but in 2020, number of online devices are predicted to reach 50 billion, which would be 7 times the human population on earth. Up until now, the revolution was in the digital world. But now, the revolution is happening in the physical world that we live in; IoT devices are employed in all sorts of environments like domestic houses, hospitals, industrial spaces, nuclear plants etc., Since they are employed in a lot of mission-critical or even life-critical environments, their security and reliability are of paramount importance because compromising them can lead to grave consequences. IoT devices are, by nature, different from conventional Internet connected devices like laptops, smart phones etc., They have small memory, limited storage, low processing power etc., They also operate with little to no human intervention. Hence it becomes very important to understand IoT devices better. How do they behave in a network? How different are they from traditional Internet connected devices? Can they be identified from their network traffic? Is it possible for anyone to identify them just by looking at the network data that leaks outside the network, without even joining the network? That is the aim of this thesis. To the best of our knowledge, no study has collected data from outside the network, without joining the network, with the intention of finding out if IoT devices can be identified from this data. We also identify parameters that classify IoT and non-IoT devices. Then we do manual grouping of similar devices and then do the grouping automatically, using clustering algorithms. This will help in grouping devices of similar nature and create a profile for each kind of device. IoT Network Data Analytics Machine Learning Clustering K-Means
30	Klusteranalys av cykelflödesdata för identifiering av viktiga faktorer och avvikande datapunkter Hojeij, Mohamed, Tram, Alex January 2019 (has links) Studien har för avsikt att förbättra kunskapen om vilka faktorer som påverkar cykelflödeten viss dag i Malmö. Vi har huvudsakligen undersökt frågor om, hur många grupperandekluster är optimalt för att kunna identifiera avvikande dagar och vilka är dess faktoreri en tidsserie cykelvolymdata? Vår arbetsmetod var att använda ett matchande tillvägagångssätt baserat på ett experiment tillsammans med en utvärderingsmetod. Arbetsmetoden skedde i en iterativ process där experimentet var att hitta rätt antal kluster ochdär utvärderingen var analysen av resultaten som producerades av experimentet. Datanerhållen från en cykelräknare belägen på Kaptensgatan i Malmö fick databearbetas medhjälp av normalisering då volymen av cyklister inte ska ha någon påverkan i studien. Syftetmed vårt arbete är att kunna identifiera avvikande datapunkter och dess faktorer med storinverkan på cykelflöden med hjälp av klusteranalys då detta kan leda till mer välinformerade beslut vid stads- och transportplanering. Om det gick att analysera cyklister där dessafaktorer elimineras så skulle detta leda till vidare utveckling och forskning av stor betydelseför Malmö stad. Genom att använda oss av klusteranalysen K-means och Euklidisk distanssom används som beräkning av distanser inom liknande områden kunde vi finna relevantakluster med avvikande datapunkter och faktorer med stor inverkan på cykelflödet. Vårtresultat visar att 7 kluster varav 2 av de delades upp till 6 mindre kluster, var det mest optimala för studien och faktorerna med en stor inverkan på de antal registrerade cyklisternaunder vissa dagar kunde då identifieras bäst. Faktorerna som identifierades var evenemang,festivaler, fotbollsmatcher, konserter, lovdagar, nederbörd och röda dagar. / This study aims to provide a deeper understanding of the different factors and their impacton the bicycle flow in Malmö during a certain day. We mainly examined the questions,what is the most optimal number of clusters needed in order to identify discrepancies, andwhich key factors have huge impact in a dataset? The choice of the method used in thisstudy is a matching approach based on experiment together with an evaluation method.The work method occurred in an iterative process, where the experiment was conductedto find the right number of clusters and the evaluation was the analysing of the resultsthat were produced by the experiment. The collected data from a bicycle counter, locatedin Kaptensgatan in Malmö, had to be processed with normalization to ensure that thevolume of the bicycles does not affect the study. The purpose of our study is to identifydiscrepancies and key factors that have huge implications on the bicycle flow with thehelp of cluster analysis that might lead to more well-informed decision in urban planningand transportation planning. If it were possible to analyze cyclists where these factorsare eliminated, this would lead to further development and research of great importancefor Malmö City. By using the cluster algorithm K-means, and Euclidean distance, whichis used as calculation of distances in similar areas, we could then find relevant clusterswith deviating data points and key factors with great impact on the bicycle flow. Ourresults shows that 7 clusters, 2 of which were divided up to 6 smaller clusters, were themost optimal for the study and the factors with a large impact on the number-registeredcyclists during certain days could then be best identified. The factors identified wereevents, festivals, football matches, concerts, rainfalls and holidays. Klusteranalys K-Means Cykeldata Maskininlärning Engineering and Technology Teknik och teknologier

Search results