Spelling suggestions: "subject:"data deriven algorithm"" "subject:"data deriven allgorithm""
1 |
A Data-Driven Algorithm for Parameter Estimation in the Parametric Survival Mixture ModelZhang, Jin 12 1900 (has links)
<p> We propose a data-driven estimation algorithm in survival mixture model. The objective of
this study is to provide an alternative fitting procedure to the conventional EM algorithm.
The EM algorithm is the classical ML fitting of the parametric mixture model. If the initial
values for the EM algorithm are not properly chosen, the maximizers might be local or
divergent. Traditionally, initial values are given manually according to experience or a gridpoint
search. This is a heavy burden for a high-dimensional data sets. Also, specifying the
ranges of parameters for a grid-point search is difficult. To avoid the specification of initial
values, we employ the random partition. Then, improvement of fitting is adjusted according
to model specification. This process is repeated a large number of times, so it is computer intensive.
The large repetitions makes the solution more likely to be the global maximizer,
and it is driven purely by the data. We conduct a simulation study for three cases of
two-component Log-Normal, two-component Weibull, and two-component Log-Normal and
Wei bull, in order to illustrate the effectiveness of the proposed algorithm. Finally, we apply
our algorithm to a breast cancer study data which follows a cure model. The program is
written in R. It calls existing R functions, so it is flexible to use in regression situations where
model formula must be specified. </p> / Thesis / Master of Science (MSc)
|
2 |
Processing Turkish Radiology ReportsHadimli, Kerem 01 May 2011 (has links) (PDF)
Radiology departments utilize various visualization techniques of patients&rsquo / bodies, and narrative free text reports describing the findings in these visualizations are written by medical doctors. The information within these narrative reports is required to be extracted for medical information systems. Turkish is an highly agglutinative language and this poses problems in information retrieval and extraction from Turkish free texts.
In this thesis one rule-based and one data-driven alternate methods for information retrieval and structured information extraction from Turkish radiology reports are presented. Contrary to previous studies in medical NLP systems, both of these methods do not utilize any medical lexicon or ontology.
Information extraction is performed on the level of extracting medically related phrases from the sentence. The aim is to measure baseline performance Turkish language can provide for medical information extraction and retrieval, in isolation of other factors.
|
3 |
A Multi-Sensor Data Fusion Approach for Real-Time Lane-Based Traffic EstimationJanuary 2015 (has links)
abstract: Modern intelligent transportation systems (ITS) make driving more efficient, easier, and safer. Knowledge of real-time traffic conditions is a critical input for operating ITS. Real-time freeway traffic state estimation approaches have been used to quantify traffic conditions given limited amount of data collected by traffic sensors. Currently, almost all real-time estimation methods have been developed for estimating laterally aggregated traffic conditions in a roadway segment using link-based models which assume homogeneous conditions across multiple lanes. However, with new advances and applications of ITS, knowledge of lane-based traffic conditions is becoming important, where the traffic condition differences among lanes are recognized. In addition, most of the current real-time freeway traffic estimators consider only data from loop detectors. This dissertation develops a bi-level data fusion approach using heterogeneous multi-sensor measurements to estimate real-time lane-based freeway traffic conditions, which integrates a link-level model-based estimator and a lane-level data-driven estimator.
Macroscopic traffic flow models describe the evolution of aggregated traffic characteristics over time and space, which are required by model-based traffic estimation approaches. Since current first-order Lagrangian macroscopic traffic flow model has some unrealistic implicit assumptions (e.g., infinite acceleration), a second-order Lagrangian macroscopic traffic flow model has been developed by incorporating drivers’ anticipation and reaction delay. A multi-sensor extended Kalman filter (MEKF) algorithm has been developed to combine heterogeneous measurements from multiple sources. A MEKF-based traffic estimator, explicitly using the developed second-order traffic flow model and measurements from loop detectors as well as GPS trajectories for given fractions of vehicles, has been proposed which gives real-time link-level traffic estimates in the bi-level estimation system.
The lane-level estimation in the bi-level data fusion system uses the link-level estimates as priors and adopts a data-driven approach to obtain lane-based estimates, where now heterogeneous multi-sensor measurements are combined using parallel spatial-temporal filters.
Experimental analysis shows that the second-order model can more realistically reproduce real world traffic flow patterns (e.g., stop-and-go waves). The MEKF-based link-level estimator exhibits more accurate results than the estimator that uses only a single data source. Evaluation of the lane-level estimator demonstrates that the proposed new bi-level multi-sensor data fusion system can provide very good estimates of real-time lane-based traffic conditions. / Dissertation/Thesis / Doctoral Dissertation Industrial Engineering 2015
|
Page generated in 0.0751 seconds