Global ETD Search

1	Semi-supervised co-selection : instances and features : application to diagnosis of dry port by rail / Co-selection instances-variables en mode semi-supervisé : application au diagnostic de transport ferroviaire. Makkhongkaew, Raywat 15 December 2016 (has links) Depuis la prolifération des bases de données partiellement étiquetées, l'apprentissage automatique a connu un développement important dans le mode semi-supervisé. Cette tendance est due à la difficulté de l'étiquetage des données d'une part et au coût induit de cet étiquetage quand il est possible, d'autre part.L'apprentissage semi-supervisé consiste en général à modéliser une fonction statistique à partir de base de données regroupant à la fois des exemples étiquetés et d'autres non-étiquetés. Pour aborder une telle problématique, deux familles d'approches existent : celles basées sur la propagation de la supervision en vue de la classification supervisée et celles basées sur les contraintes en vue du clustering (non-supervisé). Nous nous intéressons ici à la deuxième famille avec une difficulté particulière. Il s'agit d'apprendre à partir de données avec une partie étiquetée relativement très réduite par rapport à la partie non-étiquetée.Dans cette thèse, nous nous intéressons à l'optimisation des bases de données statistiques en vue de l'amélioration des modèles d'apprentissage. Cette optimisation peut être horizontale et/ou verticale. La première définit la sélection d'instances et la deuxième définit la tâche de la sélection de variables.Les deux taches sont habituellement étudiées de manière indépendante avec une série de travaux considérable dans la littérature. Nous proposons ici de les étudier dans un cadre simultané, ce qui définit la thématique de la co-sélection. Pour ce faire, nous proposons deux cadres unifiés considérant à la fois la partie étiquetée des données et leur partie non-étiquetée. Le premier cadre est basé sur un clustering pondéré sous contraintes et le deuxième sur la préservation de similarités entre les données. Les deux approches consistent à qualifier les instances et les variables pour en sélectionner les plus pertinentes de manière simultanée.Enfin, nous présentons une série d'études empiriques sur des données publiques connues de la littérature pour valider les approches proposées et les comparer avec d'autres approches connues dans la littérature. De plus, une validation expérimentale est fournie sur un problème réel, concernant le diagnostic de transport ferroviaire de l'état de la Thaïlande / We are drowning in massive data but starved for knowledge retrieval. It is well known through the dimensionality tradeoff that more data increase informative but pay a price in computational complexity, which has to be made up in some way. When the labeled sample size is too little to bring sufficient information about the target concept, supervised learning fail with this serious challenge. Unsupervised learning can be an alternative in this problem. However, as these algorithms ignore label information, important hints from labeled data are left out and this will generally downgrades the performance of unsupervised learning algorithms. Using both labeled and unlabeled data is expected to better procedure in semi-supervised learning, which is more adapted for large domain applications when labels are hardly and costly to obtain. In addition, when data are large, feature selection and instance selection are two important dual operations for removing irrelevant information. Both of tasks with semisupervised learning are different challenges for machine learning and data mining communities for data dimensionality reduction and knowledge retrieval. In this thesis, we focus on co-selection of instances and features in the context of semi-supervised learning. In this context, co-selection becomes a more challenging problem as the data contains labeled and unlabeled examples sampled from the same population. To do such semi-supervised coselection, we propose two unified frameworks, which efficiently integrate labeled and unlabeled parts into the co-selection process. The first framework is based on weighting constrained clustering and the second one is based on similarity preserving selection. Both approaches evaluate the usefulness of features and instances in order to select the most relevant ones, simultaneously. Finally, we present a variety of empirical studies over high-dimensional data sets, which are well-known in the literature. The results are promising and prove the efficiency and effectiveness of the proposed approaches. In addition, the developed methods are validated on a real world application, over data provided by the State Railway of Thailand (SRT). The purpose is to propose the application models from our methodological contributions to diagnose the performance of rail dry port systems. First, we present the results of some ensemble methods applied on a first data set, which is fully labeled. Second, we show how can our co-selection approaches improve the performance of learning algorithms over partially labeled data provided by SRT Sélection d'instances Sélection de variables Co-selection Apprentissage semi-supervisé Classification sous contraintes Instance selection Feature selecion Co-selection Semi-supervised learning Constrained clustering 006.3
2	On-chip Tracing for Bit-Flip Detection during Post-silicon Validation Vali, Amin January 2018 (has links) Post-silicon validation is an important step during the implementation flow of digital integrated circuits and systems. Most of the validation strategies are based on ad-hoc solutions, such as guidelines from best practices, decided on a case-by-case basis for a specific design and/or application domain. Developing systematic approaches for post-silicon validation can mitigate the productivity bottlenecks that have emerged due to both design diversification and shrinking implementation cycles. Ever since integrating on-chip memory blocks became affordable, embedded logic analysis has been used extensively for post-silicon validation. Deciding at design time which signals to be traceable at the post-silicon phase, has been posed as an algorithmic problem a decade ago. Most of the proposed solutions focus on how to restore as much data as possible within a software simulator in order to facilitate the analysis of functional bugs, assuming that there are no electrically-induced design errors, e.g., bit- flips. In this thesis, first it is shown that analyzing the logic inconsistencies from the post-silicon traces can aid with the detection of bit-flips and their root-cause analysis. Furthermore, when a bit-flip is detected, a list of suspect nets can be automatically generated. Since the rate of bit-flip detection as well the size of the list of suspects depends on the debug data that was acquired, it is necessary to select the trace signals consciously. Subsequently, new methods are presented to improve the bit-flip detectability through an algorithmic approach to selecting the on-chip trace signals. Hardware assertion checkers can also be integrated on-chip in order to detect events of interest, as defined by the user. For example, they can detect a violation of a design property that captures a relationship between internal signals that is supposed to hold indefinitely, so long as no bit-flips occur in the physical prototype. Consequently, information collected from hardware assertion checkers can also provide useful debug information during post-silicon validation. Based on this observation, the last contribution from this thesis presents a novel method to concurrently select a set of trace signals and a set of assertions to be integrated on-chip. / Thesis / Doctor of Philosophy (PhD) Post Silicon Validation Trace Signal Selection Silicon Debug Hardware Assertion Checker Co-selection of trace and assertion
3	Les métaux lourds dans les écosystèmes anthropisés : une pression favorisant la sélection de pathogènes opportunistes résistants à des antibiotiques ? / Heavy metals in impacted ecosystem : a pressure favoring the selection of antibiotic resistant opportunistic pathogens ? Deredjian, Amélie 17 December 2010 (has links) Pseudomonas aeruginosa et Stenotrophomonas maltophilia, pathogènes opportunistes majeurs, pourraient acquérir leur résistance aux antibiotiques dans l’environnement, sous la pression exercée par les métaux lourds par co-sélection de résistance. Nous avons tout d’abord évalué la distribution et l’abondance de ces espèces dans un large panel de sols d’origine géographique différente (France et Afrique) et évalué l’influence d’activités anthropiques susceptibles d’exposer les sols en éléments métalliques sur cette distribution. Alors que la présence de P. aeruginosa est sporadique et plutôt liée à un apport exogène, S. maltophilia est présente dans tous les sols étudiés, suggérant son endémicité. L’évaluation des résistances des souches isolées de ces sols a également montré des différences entre les deux espèces. Les souches environnementales de P. aeruginosa sont pour la plupart caractérisées par un phénotype sauvage alors que celles de S. maltophilia présentent une grande diversité de phénotypes en fonction des sites, parfois similaires à ceux de souches cliniques. Cette diversité peut être attribuée à l’adaptation aux conditions environnementales très différentes rencontrées mais il est difficile d‘attribuer précisément aux métaux un rôle dans la co-sélection de ces résistances. L’étude menée sur la communauté bactérienne d’un sol contaminé a également permis de mettre en évidence une forte proportion de bactéries résistantes à différents antibiotiques représentée par des espèces qualifiées de pathogènes opportunistes ainsi que la présence du gène blaIMP, permettant la résistance à l’imipénème, utilisé en milieu clinique pour le traitement de clones multi-résistants. / Pseudomonas aeruginosa and Stenotrophomonas maltophilia, two major opportunistic pathogens, could acquire antibiotic resistance in the environment under heavy metal pressure that co-selects both resistances. We first investigated the distribution and abundance of these species in a wide range of soils of different geographical origin (France and Africa) and evaluated the influence of human activities that may expose soils to metallic elements on this distribution. While the presence of P. aeruginosa is rather sporadic and could be linked to exogenous intake, S. maltophilia is present in all studied soils, that suggests its endemicity. Evaluating resistance capacities of strains isolated from these soils also showed differences between the two species. Environmental strains of P. aeruginosa are mostly characterized by a wild type phenotype, whereas those of S. maltophilia present a wide diversity of phenotypes depending on the site, sometimes similar to those of clinical strains. This diversity could be attributed to a deep adaptation to the very different environmental conditions encountered in the original niche but it is difficult to attribute specifically to metals a role in coselection of resistance. The study conducted on the bacterial community present in a contaminated soil has also highlighted a high proportion of bacteria resistant to different antibiotics represented by species qualified as opportunistic pathogens and the presence of the gene blaIMP, enabling resistance to imipenem, used in the hospital to treat infections due to multidrug-resistant clones. Pseudomonas aeruginosa Stenotrophomonas maltophilia Résistance aux antibiotiques Résistance aux métaux Co-sélection de résistance Environnements anthropisés Pseudomonas aeruginosa Stenotrophomonas maltophilia Antibiotic resistance Metal resistance Resistance co-selection Impacted environments

1

Page generated in 0.0854 seconds