• Refine Query
  • Source
  • Publication year
  • to
  • Language
  • 339
  • 26
  • 21
  • 13
  • 8
  • 5
  • 5
  • 5
  • 4
  • 3
  • 2
  • 1
  • 1
  • 1
  • 1
  • Tagged with
  • 507
  • 507
  • 272
  • 270
  • 147
  • 135
  • 129
  • 128
  • 113
  • 92
  • 88
  • 77
  • 76
  • 74
  • 59
  • About
  • The Global ETD Search service is a free service for researchers to find electronic theses and dissertations. This service is provided by the Networked Digital Library of Theses and Dissertations.
    Our metadata is collected from universities around the world. If you manage a university/consortium/country archive and want to be added, details can be found on the NDLTD website.
291

Analysis of Vegetation Vulnerability Dynamics and Driving Forces to Multiple Drought Stresses in a Changing Environment

Wei, Xiaoting, Huang, Shengzhi, Huang, Qiang, Liu, Dong, Leng, Guoyong, Yang, Haibo, Duan, Weili, Li, Jianfeng, Bai, Qingjun, Peng, Jian 15 January 2024 (has links)
Quantifying changes in the vulnerability of vegetation to various drought stresses in different seasons is important for rational and effective ecological conservation and restoration. However, the vulnerability of vegetation and its dynamics in a changing environment are still unknown, and quantitative attribution analysis of vulnerability changes has been rarely studied. To this end, this study explored the changes of vegetation vulnerability characteristics under various drought stresses in Xinjiang and conducted quantitative attribution analysis using the random forest method. In addition, the effects of ecological water transport and increased irrigation areas on vegetation vulnerability dynamics were examined. The standardized precipitation index (SPI), standardized precipitation-evapotranspiration index (SPEI), and standardized soil moisture index (SSMI) represent atmospheric water supply stress, water and heat supply stress, and soil water supply stress, respectively. The results showed that: (1) different vegetation types responded differently to water stress, with grasslands being more sensitive than forests and croplands in summer; (2) increased vegetation vulnerability under drought stresses dominated in Xinjiang after 2003, with vegetation growth and near-surface temperature being the main drivers, while increased soil moisture in the root zone was the main driver of decreased vegetation vulnerability; (3) vulnerability of cropland to SPI/SPEI/SSMI-related water stress increased due to the rapid expansion of irrigation areas, which led to increasing water demand in autumn that was difficult to meet; and (4) after ecological water transport of the Tarim River Basin, the vulnerability of its downstream vegetation to drought was reduced.
292

Machine Learning Methods for Predicting Trading Behaviour of an Actively Managed Mutual Fund

Forslund, Herman, Johnson, Marcus January 2021 (has links)
This paper aims to reverse engineer the tradingstrategy of an actively managed mutual fund by identifyingtechnical patterns in their trading. Investment strategies formany institutional investors consists of both fundamental andtechnical analysis. The purpose of the paper is to explore towhich extent the latter can be used to predict the trading actionsby taking some commonly used technical indicators as input invarious machine learning algorithms to assess patterns betweenthem and the trading of the fund. Furthermore, the technicalindicators’ ability to predict future prices is analysed using thesame methods. The results are not sufficiently clear to suggestthat the fund uses technical indicators to begin with, let alonewhich ones. As for the prediction of future prices, the technicalindicators appear to have some predictive ability. / Syftet med denna rapport är att prediktera handeln i en aktivt förvaltad aktiefond med hjälp av fyra maskininlärningsmetoder. Investeringsstrategier kombinerar i regel två analysmetoder, fundamental respektive teknisk analys. Avsikten med rapporten är att utforska huruvida det sistnämnda kan användas för att förutspå fondens handel genom att använda ett antal vanligt förekommande tekniska indikatorer och medelst maskininlärningsmetoder söka efter mönster mellan dessa och handeln. Vidare innefattar även studien en analys över hur väl tekniska indikatorer predikterar upprespektive nedgångar på aktiepriser. Vad gäller investeringsstrategierna återfanns inga tydliga samband mellan de utvalda indikatorerna och transaktionerna. Resultaten för andra delen av studien tyder på viss prediktiv förmåga för tekniska indikatorer på marknadsrörelser. / Kandidatexjobb i elektroteknik 2021, KTH, Stockholm
293

Machine Learning-based Biometric Identification

Israelsson, Hanna, Wrife, Andreas January 2021 (has links)
With the rapid development of computers andmodels for machine learning, image recognition has, in recentyears, become widespread in various areas. In this report, imagerecognition is discussed in relation to biometric identificationusing fingerprint images. The aim is to investigate how well abiometric identification model can be trained with an extendeddataset, which resulted from rotating and shifting the images inthe original dataset consisting of very few images. Furthermore,it is investigated how the accuracy of this single-stage modeldiffers from the accuracy of a model with two-stage identification.We chose Random Forest (RF) as the machine learning modeland Scikit default values for the hyperparameters. We furtherincluded five-fold cross-validation in the training process. Theperformance of the trained machine learning model is evaluatedwith testing accuracy and confusion matrices. It was shown thatthe method for extending the dataset was successful. A greaternumber of images gave a greater accuracy in the predictions.Two-stage identification gave approximately the same accuracyas the single-stage method, but both methods would need tobe tested on datasets with images from a greater number ofindividuals before any final conclusions can be drawn. / Tack vare den snabba utvecklingen av datoreroch modeller för maskininlärning har bildigenkänning desenaste åren fått stor spridning i samhället. I denna rapportbehandlas bildigenkänning i relation till biometrisk identifieringi form av fingeravtrycksavläsning. Målet är att undersöka hurväl en modell för biometrisk identifiering kan tränas och testaspå ett dataset med ursprungligen mycket få bilder, om datasettetförst expanderas genom att flertalet kopior av originalbildernaskapas och sedan roteras och förskjuts i olika riktningar.Vidare undersöks hur noggrannheten för denna enstegsmodellskiljer sig jämfört med identifiering i två steg. Vi valdeRandom Forest (RF) som maskininlärningsmodell och Scikitsstandardinställningar för hyperparametrarna. Vidare inkluderadesfemfaldig korsvalidering i träningsprocessen. Prestandanhos den tränade maskininlärningsmodellen bedömdes med hjälpav testnoggrannhet och confusion matriser. Det visades sig attmetoden för att expandera datasettet var framgångsrik. Ettstörre antal bilder gav större noggrannhet i förutsägelserna.Tvåstegsidentifiering gav ungefärligen samma noggrannhet somenstegsidentifiering, men metoderna skulle behöva testas på datamängder med bilder från ett större antal individer innannågra slutgiltiga slutsatser kan dras. / Kandidatexjobb i elektroteknik 2021, KTH, Stockholm
294

Application of data-driven models in exploring cyanobacterial bloom risks in Lake Mälaren / Tillämpning av datadrivna modeller för att utforska cyanobakterieblomningsrisker i Mälaren

Herrera, Abigail Huertas January 2021 (has links)
Cyanobacteria are a unique organism, a bacterium that develop photosynthesis, thus it contains chlorophyll, a pigment commonly associated to algae. For this reason, cyanobacteria are also known as blue-green algae. One of the secondary metabolites of cyanobacteria is cyanotoxin, a substance which is hepatoxic, neurotoxic, and dermatoxic. The frequency and intensity of cyanobacterial blooms have been of increasing concern in the last decades for drinking water supply. There is a need to improve monitoring of cyanobacteria content at source water for drinking water supply and its indicators and correlation with other chemical, physical and biological parameters. This study aims to identify the potential cyanobacterial bloom risk in Lake Mälaren by determining the influential chemical and physical parameters using Random Forest in classification mode. The classification was done using the WHO Alert Level Frameworks and study cases for lakes in Sweden. The data used to model was downloaded from the website of the Swedish University of Agricultural Science. It comprises 33 monitoring stations from 1964 to 2020, 21 chemical parameters, including cyanobacteria biovolume and chlorophyll content. Given the heterogeneity of data, the monitoring stations were grouped into Clusters. Using the data, statistical, correlation, time series, and principal component analysis were performed. Through these methods, spatial, distribution and temporal analysis were obtained. Afterwards, several models were determined using Random Forest. However, the mean values of cyanobacteria distributed over time indicated a medium risk, the maximum values suggested high risk in several areas of the Lake. Maximum concentrations were present at the west and northeast of the Lake, where the major inflows from the Watershed are discharged. As the water flows through the basin, the concentration of cyanobacteria reduces by half, which suggested that the large and deep bays act as sedimentation ponds. A very high correlation was found between the Cluster 5 and 6, east and middle northeast of the Lake, respectively. Finally, the contributing factors identified after modelling cyanobacteria as target factor were chlorophyll, month, water temperature, oxygen content, transparency, NO2NO3N, TN/TP, Ca, Mg and Cl. / Cyanobakterier är unika organismer, bakterier som utvecklar fotosyntes, så de innehåller klorofyll, ett pigment som vanligtvis förknippas med alger. Av denna anledning är cyanobakterier också kända som blågröna alger. En av de sekundära metaboliterna av cyanobakterier är cyanotoxin, ett ämne som är hepatoxiskt, neurotoxiskt och dermatoxiskt. Frekvensen och intensiteten av cyanobakterieblomningar har varit ett ökande problem under de senaste decennierna för dricksvattenförsörjningen. Många vattenreningsverk mäter inte innehållet av cyanobakterier i vatten; medan andra kemiska, fysikaliska och biologiska parametrar mäts. Denna studie syftar till att identifiera den potentiella risken för cyanobakteriell blomning i Mälaren genom att bestämma de mest inflytelserika kemiska och fysikaliska parametrarna med hjälp av metoden Random Forest i klassificeringsläge. Klassificeringen gjordes med hjälp av WHO Alert Level Frameworks och olika studier av sjöar i Sverige. Data som användes för att modellera laddades ner från Sveriges Lantbruksuniversitets webbplats. Den omfattar 33 övervakningsstationer från 1964 till 2020, med 21 kemiska parametrar, inklusive cyanobakteriers biovolym och klorofyllhalt. Med tanke på heterogeniteten i data grupperades övervakningsstationerna i kluster. Med hjälp av data utfördes statistisk analys, korrelation, tidsserier och huvudkomponentanalys. Genom dessa metoder erhölls rumslig, distribution och tidsanalys. Efteråt bestämdes flera modeller med hjälp av Random Forest. Medelvärdena för cyanobakterier fördelade över tiden indikerade en medelrisk, medan maximivärdena antydde något annat. Maximala koncentrationer fanns i väster och nordost om Mälaren, där de stora inflödena från vattendelaren släpps ut. När vattnet rinner genom bassängen minskar koncentrationen av cyanobakterier till hälften, vilket tyder på att de stora och djupa vikarna fungerar som sedimentationsdammar. En mycket hög korrelation hittades mellan kluster 5 och 6, öster respektive mellan nordost om sjön. Slutligen var de viktigaste faktorerna som identifierades efter modellering av cyanobakterier som målfaktor klorofyll, månad, vattentemperatur, syrehalt, transparens, NO2NO3N, TN/TP, Ca, Mg och Cl.
295

Using random forest and decision tree models for a new vehicle prediction approach in computational toxicology

Mistry, Pritesh, Neagu, Daniel, Trundle, Paul R., Vessey, J.D. 22 October 2015 (has links)
Yes / Drug vehicles are chemical carriers that provide beneficial aid to the drugs they bear. Taking advantage of their favourable properties can potentially allow the safer use of drugs that are considered highly toxic. A means for vehicle selection without experimental trial would therefore be of benefit in saving time and money for the industry. Although machine learning is increasingly used in predictive toxicology, to our knowledge there is no reported work in using machine learning techniques to model drug-vehicle relationships for vehicle selection to minimise toxicity. In this paper we demonstrate the use of data mining and machine learning techniques to process, extract and build models based on classifiers (decision trees and random forests) that allow us to predict which vehicle would be most suited to reduce a drug’s toxicity. Using data acquired from the National Institute of Health’s (NIH) Developmental Therapeutics Program (DTP) we propose a methodology using an area under a curve (AUC) approach that allows us to distinguish which vehicle provides the best toxicity profile for a drug and build classification models based on this knowledge. Our results show that we can achieve prediction accuracies of 80 % using random forest models whilst the decision tree models produce accuracies in the 70 % region. We consider our methodology widely applicable within the scientific domain and beyond for comprehensively building classification models for the comparison of functional relationships between two variables.
296

Machine learning assisted convective wall heat transfer models for fire modeling along vertical walls, ceilings and floors

Jie Tao (18859882) 24 June 2024 (has links)
<p dir="ltr">Fires cause significant casualties and property damage. As critical component of indoor and building fires, fires along a surface (vertical or horizontal) contribute significantly to fire spreading and resulted damage. Accurately predicting the interactions between a wall surface and fire is crucial to minimizing losses. Computational methods, such as large-eddy simulations (LES), can result in errors in fire modeling along a surface due to various model and numerical errors among which the error in the convective wall heat transfer models is an important source. The convective heat transfer model error grows when the grid resolution near a thermal boundary layer along a wall surface decreases. Traditional wall-function based heat transfer models, mostly developed for forced convection heat transfer problems, tend to fail in the buoyancy-driven fire wall heat transfer. It is imperative to develop accurate and efficient convective wall heat transfer models for fire modeling.</p><p dir="ltr">In this study, machine learning is employed as an alternative to traditional physics-based modeling approach for wall heat transfer in fire modeling. A significant advantage of machine learning over physics-based modeling is that machine learning does not require thorough knowledge of fire wall heat transfer which is generally hard to acquire due to the complexity of the problem. A machine-learning assisted convective wall heat transfer model, aiming to enhance wall fire predictions, is developed in this work. The objective is to improve predictions of convective heat flux to a wall in under-resolved LES of wall fires. An amplification factor ($\beta$) is introduced to compensate the under-prediction of temperature gradients normal to a wall surface in coarse grid simulations. Machine learning is then employed to assist the construction of models for $\beta$ with the training data obtained directly from fine-resolution LES. Extensive studies are conducted to identify suitable machine learning architecture, input features, training data generation strategies, training procedure, and testing and validation approaches.</p><p dir="ltr">A vertical wall fire test case is considered first to develop a baseline machine learning model. The focus is on identifying suitable input features and training strategies for machine learning of convective wall fire heat transfer. A four-parameter (input) machine learning model for $\beta$ is constructed. Both \textit{a priori} and \textit{a posteriori} testing are developed in the vertical wall fire case to provide preliminary model performance assessment. The fully tested model is also examined in an intermediate-scale parallel-wall fire spreading case that was not seen in the model training to assess the applicability of the developed machine learning model. In general, excellent model performance is observed in the vertical wall fire case.</p><p dir="ltr">The established machine learning approach for the vertical wall case is then extended to horizontal surfaces like floor fires and ceiling fires to expand the training scope of the machine learning model. The unique challenges in these new fire scenarios are investigated separately to identify the need of additional input features and training strategies. It is found that a fifth input parameter, in addition to the four parameters identified in the vertical wall, is generally needed in order to correctly identify different fire scenarios. Data augmentation techniques are also found to be a useful technique to handle data sparsity during model training. Different machine learning architectures like random forest and deep neural network are also compared.</p><p dir="ltr">The above studies are finally integrated into a unified machine learning model suitable for both vertical and horizontal surfaces. Extensive testing shows that the unified model reproduces the model performance of the separately trained models. The work is significant in demonstrating the feasibility of using machine learning approaches to enhance fire simulations. The developed machine learning modeling techniques improve predictions in various fire scenarios by using relatively coarse grid to maintain low computational cost, a critical consideration when simulation approaches are employed in real fire simulations.</p>
297

The Foundation of Pattern Structures and their Applications

Lumpe, Lars 06 October 2021 (has links)
This thesis is divided into a theoretical part, aimed at developing statements around the newly introduced concept of pattern morphisms, and a practical part, where we present use cases of pattern structures. A first insight of our work clarifies the facts on projections of pattern structures. We discovered that a projection of a pattern structure does not always lead again to a pattern structure. A solution to this problem, and one of the most important points of this thesis, is the introduction of pattern morphisms in Chapter4. Pattern morphisms make it possible to describe relationships between pattern structures, and thus enable a deeper understanding of pattern structures in general. They also provide the means to describe projections of pattern structures that lead to pattern structures again. In Chapter5 and Chapter6, we looked at the impact of morphisms between pattern structures on concept lattices and on their representations and thus clarified the theoretical background of existing research in this field. The application part reveals that random forests can be described through pattern structures, which constitutes another central achievement of our work. In order to demonstrate the practical relevance of our findings, we included a use case where this finding is used to build an algorithm that solves a real world classification problem of red wines. The prediction accuracy of the random forest is better, but the high interpretability makes our algorithm valuable. Another approach to the red wine classification problem is presented in Chapter 8, where, starting from an elementary pattern structure, we built a classification model that yielded good results.
298

Order in the random forest

Karlsson, Isak January 2017 (has links)
In many domains, repeated measurements are systematically collected to obtain the characteristics of objects or situations that evolve over time or other logical orderings. Although the classification of such data series shares many similarities with traditional multidimensional classification, inducing accurate machine learning models using traditional algorithms are typically infeasible since the order of the values must be considered. In this thesis, the challenges related to inducing predictive models from data series using a class of algorithms known as random forests are studied for the purpose of efficiently and effectively classifying (i) univariate, (ii) multivariate and (iii) heterogeneous data series either directly in their sequential form or indirectly as transformed to sparse and high-dimensional representations. In the thesis, methods are developed to address the challenges of (a) handling sparse and high-dimensional data, (b) data series classification and (c) early time series classification using random forests. The proposed algorithms are empirically evaluated in large-scale experiments and practically evaluated in the context of detecting adverse drug events. In the first part of the thesis, it is demonstrated that minor modifications to the random forest algorithm and the use of a random projection technique can improve the effectiveness of random forests when faced with discrete data series projected to sparse and high-dimensional representations. In the second part of the thesis, an algorithm for inducing random forests directly from univariate, multivariate and heterogeneous data series using phase-independent patterns is introduced and shown to be highly effective in terms of both computational and predictive performance. Then, leveraging the notion of phase-independent patterns, the random forest is extended to allow for early classification of time series and is shown to perform favorably when compared to alternatives. The conclusions of the thesis not only reaffirm the empirical effectiveness of random forests for traditional multidimensional data but also indicate that the random forest framework can, with success, be extended to sequential data representations.
299

Three-dimensional geometric image analysis for interventional electrophysiology

McManigle, John E. January 2014 (has links)
Improving imaging hardware, computational power, and algorithmic design are driving advances in interventional medical imaging. We lay the groundwork here for more effective use of machine learning and image registration in clinical electrophysiology. To achieve identification of atrial fibrosis using image data, we registered the electroanatomic map (EAM) data of atrial fibrillation (AF) patients undergoing pulmonary vein isolation (PVI) with MR (n = 16) or CT (n = 18) images. The relationship between image features and bipolar voltage was evaluated using single-parameter regression and random forest models. Random forest performed significantly better than regression, identifying fibrosis with area under the receiver operating characteristic curve (AUC) 0.746 (MR) and 0.977 (CT). This is the first evaluation of voltage prediction using image data. Next, we compared the character of native atrial fibrosis with ablation scar in MR images. Fourteen AF patients undergoing repeat PVI were recruited. EAM data from their first PVI was registered to the MR images acquired before the first PVI (‘pre-operative’) and before the second PVI ('post-operative' with respect to the first PVI). Non-ablation map points had similar characteristics in the two images, while ablation points exhibited higher intensity and more heterogeneity in post-operative images. Ablation scar is more strongly enhancing and more heterogeneous than native fibrosis. Finally, we addressed myocardial measurement in 3-D echocardiograms. The circular Hough transform was modified with a feature asymmetry filter, epicardial edges, and a search constraint. Manual and Hough measurements were compared in 5641 slices from 3-D images. The enhanced Hough algorithm was more accurate than the unmodified version (Dice coefficient 0.77 vs. 0.58). This method promises utility in segmentation-assisted cross-modality registration. By improving the information that can be extracted from medical images and the ease with which that information can be accessed, this progress will contribute to the advancing integration of imaging in electrophysiology.
300

Uplift Modeling : Identifying Optimal Treatment Group Allocation and Whom to Contact to Maximize Return on Investment

Karlsson, Henrik January 2019 (has links)
This report investigates the possibilities to model the causal effect of treatment within the insurance domain to increase return on investment of sales through telemarketing. In order to capture the causal effect, two or more subgroups are required where one group receives control treatment. Two different uplift models model the causal effect of treatment, Class Transformation Method, and Modeling Uplift Directly with Random Forests. Both methods are evaluated by the Qini curve and the Qini coefficient. To model the causal effect of treatment, the comparison with a control group is a necessity. The report attempts to find the optimal treatment group allocation in order to maximize the precision in the difference between the treatment group and the control group. Further, the report provides a rule of thumb that ensure that the control group is of sufficient size to be able to model the causal effect. If has provided the data material used to model uplift and it consists of approximately 630000 customer interactions and 60 features. The total uplift in the data set, the difference in purchase rate between the treatment group and control group, is approximately 3%. Uplift by random forest with a Euclidean distance splitting criterion that tries to maximize the distributional divergence between treatment group and control group performs best, which captures 15% of the theoretical best model. The same model manages to capture 77% of the total amount of purchases in the treatment group by only giving treatment to half of the treatment group. With the purchase rates in the data set, the optimal treatment group allocation is approximately 58%-70%, but the study could be performed with as much as approximately 97%treatment group allocation.

Page generated in 0.11 seconds