11 |
Human skin segmentation using correlation rules on dynamic color clustering / Segmentação de pele humana usando regras de correlação baseadas em agrupamento dinâmico de coresRodrigo Augusto Dias Faria 31 August 2018 (has links)
Human skin is made of a stack of different layers, each of which reflects a portion of impinging light, after absorbing a certain amount of it by the pigments which lie in the layer. The main pigments responsible for skin color origins are melanin and hemoglobin. Skin segmentation plays an important role in a wide range of image processing and computer vision applications. In short, there are three major approaches for skin segmentation: rule-based, machine learning and hybrid. They differ in terms of accuracy and computational efficiency. Generally, machine learning and hybrid approaches outperform the rule-based methods but require a large and representative training dataset and, sometimes, costly classification time as well, which can be a deal breaker for real-time applications. In this work, we propose an improvement, in three distinct versions, of a novel method for rule-based skin segmentation that works in the YCbCr color space. Our motivation is based on the hypotheses that: (1) the original rule can be complemented and, (2) human skin pixels do not appear isolated, i.e. neighborhood operations are taken into consideration. The method is a combination of some correlation rules based on these hypotheses. Such rules evaluate the combinations of chrominance Cb, Cr values to identify the skin pixels depending on the shape and size of dynamically generated skin color clusters. The method is very efficient in terms of computational effort as well as robust in very complex images. / A pele humana é constituída de uma série de camadas distintas, cada uma das quais reflete uma porção de luz incidente, depois de absorver uma certa quantidade dela pelos pigmentos que se encontram na camada. Os principais pigmentos responsáveis pela origem da cor da pele são a melanina e a hemoglobina. A segmentação de pele desempenha um papel importante em uma ampla gama de aplicações em processamento de imagens e visão computacional. Em suma, existem três abordagens principais para segmentação de pele: baseadas em regras, aprendizado de máquina e híbridos. Elas diferem em termos de precisão e eficiência computacional. Geralmente, as abordagens com aprendizado de máquina e as híbridas superam os métodos baseados em regras, mas exigem um conjunto de dados de treinamento grande e representativo e, por vezes, também um tempo de classificação custoso, que pode ser um fator decisivo para aplicações em tempo real. Neste trabalho, propomos uma melhoria, em três versões distintas, de um novo método de segmentação de pele baseado em regras que funciona no espaço de cores YCbCr. Nossa motivação baseia-se nas hipóteses de que: (1) a regra original pode ser complementada e, (2) pixels de pele humana não aparecem isolados, ou seja, as operações de vizinhança são levadas em consideração. O método é uma combinação de algumas regras de correlação baseadas nessas hipóteses. Essas regras avaliam as combinações de valores de crominância Cb, Cr para identificar os pixels de pele, dependendo da forma e tamanho dos agrupamentos de cores de pele gerados dinamicamente. O método é muito eficiente em termos de esforço computacional, bem como robusto em imagens muito complexas.
|
12 |
Interaktivní vyhodnocení aktuálního herního stavu biliardu / Interactive evaluation of current biliard situation during gameSedlařík, Jan January 2011 (has links)
The aim of this master’s thesis is evaluation of images by computer vision methods. The elaborated scene is billiard game scanned by camera. Using pre-processing, image segmentation and object classification we will try to understand the gameplay situation and uvaluate it in accordance with applicable rules.
|
13 |
Metody ditheringu obrazu / Methods of image ditheringPelc, Lukáš January 2014 (has links)
Master’s thesis discusses methods for dithering image. The basis is the explanation of the theory of digital images, color models, color depth and color range. Followed by the dismantling of the basic dithering methods which are a thresholding method, a random and matrix diffusion. Discussed are advanced methods of dithering with error distribution, bee with best known method Floyd-Steinberg. Included is a comparison of different methods including subjective comparison using a questionnaire. Program part is JAVA applet that shows the possibility of generating images using various dithering methods.
|
14 |
Vyhledání podobných obrázků pomocí popisu barevným histogramem / Image Retrieval Based on Color HistogramsSailer, Zbyněk January 2012 (has links)
This thesis deals with description of existing methods of image retrieval. It contains set of methods for image description, coding of global and local descriptor (SIFT, etc.) and describes method of effective searching in multidimensional space (LSH). It continues with proposal and testing of three global descriptors using color histograms, histogram of gradients and the combination of both. The last part deals with similar image retrieval using proposed descriptors and the indexing method LSH and compares the results with the existing method. Product of this work is an experimental application which demonstrates the proposed solution.
|
15 |
Improving information perception from digital images for users with dichromatic color visionShayeghpour, Omid January 2013 (has links)
Color vision deficiency (CVD) is the inability or limited ability to recognize colors and discriminate between them. A person with this condition perceives a narrower range of colors compared to a person with a normal color vision. A growing number of researchers are striving to improve the quality of life for CVD patients. Finding cure, making rectification equipment, providing simulation tools and applying color transformation methods are among the efforts being made by researchers in this field. In this study we concentrate on recoloring digital images in such a way that users with CVD, especially dichromats, perceive more details from the recolored images compared to the original image. The main focus is to give the CVD user a chance to find information within the picture which they could not perceive before. However, this transformed image might look strange or unnatural to users with normal color vision. During this color transformation process, the goal is to keep the overall contrast of the image constant while adjusting the colors that might cause confusion for the CVD user. First, each pixel in the RGB-image is converted to HSV color space in order to be able to control hue, saturation and intensity for each pixel and then safe and problematic hue ranges need to be found. The method for recognizing these ranges was inspired by a condition called “unilateral dichromacy” in which the patient has normal color vision in one eye and dichromacy in another. A special grid-like color card is designed, having constant saturation and intensity over the entire image, while the hue smoothly changes from one block to another to cover the entire hue range. The next step is to simulate the way this color card is perceived by a dichromatic user and finally to find the colors that are perceived identically from two images and the ones that differ too much. This part makes our method highly customizable and we can apply it to other types of CVD, even personalize it for the color vision of a specific observer. The resulting problematic colors need to be dealt with by shifting the hue or saturation based on some pre-defined rules. The results for the method have been evaluated both objectively and subjectively. First, we simulated a set of images as they would be perceived by a dichromat and compared them with simulated view of our transformed images. The results clearly show that our recolored images can eliminate a lot of confusion from user and convey more details. Moreover, an online questionnaire was created and 39 users with CVD confirmed that the transformed images allow them to perceive more information compared to the original images.
|
16 |
Vizuální detekce herního stavu biliardu / Detection of billiard's situation by means of cameraZítka, Michal January 2010 (has links)
This thesis deals with processing pictures from game on billiard, their pre-processing, segmentation and classification of objects. Depending on objects classification is determinated state of game and it´s next step. Aim of this project is development of system for gameplay assistance.
|
17 |
Rozpoznávání pozic a gest / Recognition of Poses and GesturesJiřík, Leoš January 2008 (has links)
This thesis inquires the existing methods on the field of image recognition with regards to gesture recognition. Some methods have been chosen for deeper study and these are to be discussed later on. The second part goes in for the concenpt of an algorithm that would be able of robust gesture recognition based on data acquired within the AMI and M4 projects. A new ways to achieve precise information on participants position are suggested along with dynamic data processing approaches toward recognition. As an alternative, recognition using Gaussian Mixture Models and periodicity analysis are brought in. The gesture class in focus are speech supporting gestures. The last part demonstrates the results and discusses future work.
|
18 |
Segmenta??o Fuzzy de Texturas e V?deosSantos, Tiago Souza dos 17 August 2012 (has links)
Made available in DSpace on 2014-12-17T15:48:04Z (GMT). No. of bitstreams: 1
TiagoSS_DISSERT.pdf: 2900373 bytes, checksum: ea7bd73351348f5c75a5bf4f337c599f (MD5)
Previous issue date: 2012-08-17 / Conselho Nacional de Desenvolvimento Cient?fico e Tecnol?gico / The segmentation of an image aims to subdivide it into constituent regions or objects
that have some relevant semantic content. This subdivision can also be applied to videos.
However, in these cases, the objects appear in various frames that compose the videos.
The task of segmenting an image becomes more complex when they are composed of
objects that are defined by textural features, where the color information alone is not
a good descriptor of the image. Fuzzy Segmentation is a region-growing segmentation
algorithm that uses affinity functions in order to assign to each element in an image a
grade of membership for each object (between 0 and 1). This work presents a modification
of the Fuzzy Segmentation algorithm, for the purpose of improving the temporal and
spatial complexity. The algorithm was adapted to segmenting color videos, treating them
as 3D volume. In order to perform segmentation in videos, conventional color model
or a hybrid model obtained by a method for choosing the best channels were used. The
Fuzzy Segmentation algorithm was also applied to texture segmentation by using adaptive
affinity functions defined for each object texture. Two types of affinity functions were
used, one defined using the normal (or Gaussian) probability distribution and the other
using the Skew Divergence. This latter, a Kullback-Leibler Divergence variation, is a
measure of the difference between two probability distributions. Finally, the algorithm
was tested in somes videos and also in texture mosaic images composed by images of the
Brodatz album / A segmenta??o de uma imagem tem como objetivo subdividi-la em partes ou objetos
constituintes que tenham algum conte?do sem?ntico relevante. Esta subdivis?o pode
tamb?m ser aplicada a um v?deo, por?m, neste, os objetos est?o presentes nos diversos
quadros que comp?em o v?deo. A tarefa de segmentar uma imagem torna-se mais complexa
quando estas s?o compostas por objetos que contenham caracter?sticas texturais,
com pouca ou nenhuma informa??o de cor. A segmenta??o difusa, do Ingl?s fuzzy, ? uma
t?cnica de segmenta??o por crescimento de regi?es que determina para cada elemento
da imagem um grau de pertin?ncia (entre zero e um) indicando a confian?a de que esse
elemento perten?a a um determinado objeto ou regi?o existente na imagem, fazendo-se
uso de fun??es de afinidade para obter esses valores de pertin?ncia. Neste trabalho ?
apresentada uma modifica??o do algoritmo de segmenta??o fuzzy proposto por Carvalho
[Carvalho et al. 2005], a fim de se obter melhorias na complexidade temporal e espacial.
O algoritmo foi adaptado para segmentar v?deos coloridos tratando-os como volumes 3D.
Para segmentar os v?deos, foram utilizadas informa??es provenientes de um modelo de
cor convencional ou de um modelo h?brido obtido atrav?s de uma metodologia para a
escolha dos melhores canais para realizar a segmenta??o. O algoritmo de segmenta??o
fuzzy foi aplicado tamb?m na segmenta??o de texturas, fazendo-se uso de fun??es de afinidades
adaptativas ?s texturas de cada objeto. Dois tipos de fun??es de afinidades foram
utilizadas, uma utilizando a distribui??o normal de probabilidade, ou Gaussiana, e outra
utilizando a diverg?ncia Skew. Esta ?ltima, uma varia??o da diverg?ncia de Kullback-
Leibler, ? uma medida da diverg?ncia entre duas distribui??es de probabilidades. Por
fim, o algoritmo foi testado com alguns v?deos e tamb?m com imagens de mosaicos de
texturas criadas a partir do ?lbum de Brodatz e outros
|
19 |
Algoritmy přepočtů gamutů ve správě barev / Gamut mapping algorithms in color managementSvoboda, Jan January 2014 (has links)
The thesis deals with colors - their representation in digital devices and how to provide the best color preservation accross different devices. In the first part of the work, the knowledge of colors and human vision is briefly summarized. Then color models and color spaces are elaborated, mainly those device independent. Spectrum of colors viewable or printable on a device - the gamut - is different for every device and there's a need of precise reproduction or record of color. That's why the system of color management is described further and especially the gamut mapping approaches and algorithms are mentioned. In the second part of the work, the implementation of how two algorithms of color gamut mapping (HPMINDE, SCLIP) can be implemented in MATLAB is described. In the third and last part of the work, the results of implemented algorithms are presented and discussed. These results are compared to results of commonly used color gamut mapping technique (Adobe Photoshop).
|
20 |
Detekce obličejů ve videu / Face Detection in VideoKolman, Aleš January 2012 (has links)
The project is focused on face detection in video. Firstly, it contains a summary of basic color models. Secondly, you can find the description and comparison of the basic methods for detection of human skin with a practical example of implementation of parametric detector. Thirdly, a theoretical basis for face detection and face tracking in a video containing a list of basic concepts and methods of this issue follows. Greater emphasis is placed on the description of machine learning algorithm AdaBoost and description of the possible application of the Kalman filter for the purpose of face tracking. Design, implementation and testing of library accomplished within the master thesis are listed in the final part of this thesis.
|
Page generated in 0.0611 seconds