Global ETD Search

191	False Discovery Rates, Higher Criticism and Related Methods in High-Dimensional Multiple Testing Klaus, Bernd 09 January 2013 (has links) The technical advancements in genomics, functional magnetic-resonance and other areas of scientific research seen in the last two decades have led to a burst of interest in multiple testing procedures. A driving factor for innovations in the field of multiple testing has been the problem of large scale simultaneous testing. There, the goal is to uncover lower--dimensional signals from high--dimensional data. Mathematically speaking, this means that the dimension d is usually in the thousands while the sample size n is relatively small (max. 100 in general, often due to cost constraints) --- a characteristic commonly abbreviated as d >> n. In my thesis I look at several multiple testing problems and corresponding procedures from a false discovery rate (FDR) perspective, a methodology originally introduced in a seminal paper by Benjamini and Hochberg (2005). FDR analysis starts by fitting a two--component mixture model to the observed test statistics. This mixture consists of a null model density and an alternative component density from which the interesting cases are assumed to be drawn. In the thesis I proposed a new approach called log--FDR to the estimation of false discovery rates. Specifically, my new approach to truncated maximum likelihood estimation yields accurate null model estimates. This is complemented by constrained maximum likelihood estimation for the alternative density using log--concave density estimation. A recent competitor to the FDR is the method of \"Higher Criticism\". It has been strongly advocated in the context of variable selection in classification which is deeply linked to multiple comparisons. Hence, I also looked at variable selection in class prediction which can be viewed as a special signal identification problem. Both FDR methods and Higher Criticism can be highly useful for signal identification. This is discussed in the context of variable selection in linear discriminant analysis (LDA), a popular classification method. FDR methods are not only useful for multiple testing situations in the strict sense, they are also applicable to related problems. I looked at several kinds of applications of FDR in linear classification. I present and extend statistical techniques related to effect size estimation using false discovery rates and showed how to use these for variable selection. The resulting fdr--effect method proposed for effect size estimation is shown to work as well as competing approaches while being conceptually simple and computationally inexpensive. Additionally, I applied the fdr--effect method to variable selection by minimizing the misclassification rate and showed that it works very well and leads to compact and interpretable feature sets. info:eu-repo/classification/ddc/500 ddc:500
192	Meine Meldedaten gehören mir - Widerspruch jetzt! 15 May 2019 (has links) Vielen Bürgerinnen und Bürgern ist diese Praxis der Meldebehörden nicht bewusst. Sie können aber der Auskunftserteilung via Internet, der Weitergabe zu Werbezwecken sowie an Parteien und Adressbuchverlage beim Meldeamt widersprechen. Wir empfehlen dafür das Widerspruchsformular des Sächsischen Datenschutzbeauftragten. info:eu-repo/classification/ddc/340 ddc:340
193	Analyse des Routenwahlverhaltens von Radfahrenden auf Grundlage GPS basierter Daten zum real beobachteten Verkehrsverhalten Huber, Stefan 03 May 2022 (has links) Eine an den Bedürfnissen von Radfahrerinnen orientierte Planung ist für die Förde-rung des Radverkehrs unumgänglich. Um den Ausbau der Radverkehrsinfrastruktur ent-sprechend zu planen, sind jedoch Informationen zum Routenwahlverhalten der Radfah-renden und dessen Einflussfaktoren notwendig. Mit diesen Informationen können z.B. Wirkungen von Maßnahmen oder die zukünftige Nutzung von geplanter Radverkehrsinf-rastruktur abgeschätzt werden. Aus diesem Grund wurden vor allem in der letzten Dekade zahlreiche internationale Arbeiten vorangetrieben, die die Routenwahl auf Basis von GPS-basierten Daten zum real beobachteten Routenwahlverhalten untersuchten. Da die Studien meist für einzelne und nichtdeutsche Städte durchgeführt wurden, lassen sich die Ergebnisse jedoch nicht direkt auf den deutschen Raum übertragen. Um die Einflussfaktoren der Routenwahl für den deutschen Raum zu untersuchen, wird in dieser Arbeit die Routenwahl von Radfahrenden im Raum Dresden analysiert. Ba-sis der Untersuchung stellt ein während der Aktion STADTRADELN erhobener GPS-Daten-satz dar, der 18.459 Radfahrten von 1.361 Radfahrerinnen enthält und Auskunft über deren real beobachtetes Routenwahlverhalten gibt. Die erhobenen Routendaten wurden mit unterschiedlichen Verfahren aufbereitet und über ein Verkehrsnetz mit Sekundärda-ten angereichert, sodass die Eigenschaften der gewählten Routen berechnet und den je-weils erzeugten und nicht gewählten Routenalternativen gegenübergestellt werden konn-ten. Der Einfluss der unterschiedlichen Faktoren wurde anschließend mittels logistischer Regressionsanalyse untersucht. Das Ergebnis der Untersuchung zeigt auf, dass sich die folgenden Einflussfaktoren positiv auf die Routenwahl auswirken: + Existenz von Radverkehrsinfrastruktur + Geringe Längsneigungen + Gute Oberflächenbeschaffenheit (z.B. Asphalt) + Vorhandensein anderer Radfahrender entlang einer Route + Geringe zulässige Maximalgeschwindigkeit des motorisierten Verkehrs + Geringe Fahrstreifenanzahl des Kfz-Verkehrs + Durch Lichtsignalanlagen geregelte Knotenpunkte + Grün- und Wohngebietsflächen Demgegenüber üben die folgenden Einflussfaktoren einen negativen Einfluss auf die Wahl einer Route aus: – Zunehmende Distanz – Hohe Streckenanteile mit großer Längsneigung – Eine hohe maximale Längsneigung der Route – Geringe Breite der Radverkehrsführung – Höhere Unfallhäufigkeit entlang der Route Für einige Einflussfaktoren konnte kein (signifikanter) Einfluss auf die Routenwahl nachgewiesen werden. Dazu gehören bspw. die durchschnittlich tägliche Verkehrsstärke des Kfz-Verkehrs, der ruhende Verkehr oder die Häufigkeit von „rechts-vor-links“ geregel-ten Knotenpunkten. Die vorliegende Arbeit konnte dazu beitragen, die Wirkung der Einflussfaktoren auf die Routenwahl im Radverkehr einer deutschen Stadt zu quantifizieren. Eine Stärke der Untersuchung liegt in der Erhebung und Nutzung des umfangreichen GPS-Datensatzes zum real beobachteten Routenwahlverhalten der Radfahrerinnen. Das resultierende Routenwahlmodell kann für die Abschätzung von Maßnahmenwirkungen genutzt werden (z.B. im Rahmen der Verkehrsnachfragemodellierung). Eine kritische Reflexion der Ergebnisse und Methoden zeigt, dass die Qualität der Ana-lyse und ihrer Ergebnisse durch die Nutzung weiterer Sekundär- und Primärdaten sowie anderer Methoden weiter verbessert werden kann. Die Ergebnisse der Untersuchung geben einen Einblick in die Routenwahl der Radfah-renden des Untersuchungsgebiets. Im Rahmen der Arbeit konnte nicht geprüft werden, ob die Routenwahl der Radfahrerinnen in Dresden vergleichbar mit der Routenwahl in anderen deutschen Städten ist. Das entwickelte Verfahren kann jedoch auch für Analysen in anderen Städten angewendet werden, sodass verlässliche Aussagen zur Routenwahl deutscher Radfahrender in anderen deutschen Kommunen erarbeitet werden können. Es lassen sich insgesamt folgende Schlussfolgerungen ziehen: Um den Radfahrenden eine möglichst attraktive Radverkehrsinfrastruktur zu bieten und damit die Nutzung des Fahrrads zu fördern, sollte der Radverkehr auf eigener Radverkehrsinfrastruktur geführt werden, die möglichst breit ist und eine gute Oberflächenqualität besitzt. Entlang der Rad-routen sollten die Geschwindigkeiten des fließenden motorisierten Verkehrs möglichst reduziert werden. Die Bündelung von Radverkehrsströmen sowie die Führung durch Grünflächen kann zudem Fahrtkomfort und Sicherheitsempfinden erhöhen. / User-oriented bicycle planning is essential for the promotion of cycling in cities. In order to adequately plan and build cycling infrastructure, information on route choice be-haviour of cyclists and its influencing factors is essential. The knowledge about influencing factors of bicycle route choice can help to assess the effect of measures as well as future or potential utilisation of planned cycling infrastructure. Several international studies – particularly carried out in the last decade – examined route choice based on GPS-based revealed preference data. As the studies were mostly conducted for few and non-German cities the results, however, cannot be directly trans-ferred to the German area. In order to investigate bicycle route choice for the German area, the route choice of cyclists in the city of Dresden is analysed in this study. The basis of the investigation is a GPS data set collected during the CITY CYCLING campaign. It contains 18,459 bicycle trips by 1,361 cyclists and provides detailed information about the observed route choice be-haviour of cyclists. The collected route data were processed by using different methods. Linking the routes to a traffic supply network allowed enriching them with secondary data so that characteristics of the selected routes and the non-selected generated alternatives were determined. The influence of the different factors was investigated using logistic re-gression analysis. The results of the study reveal that the following influencing factors positively affect bicycle route choice: + Existence of cycling infrastructure + Low slopes + Good surface conditions (e.g. asphalt) + Presence of other cyclists along a route + Low maximum speed of motorised traffic + Few number of lanes for motorised traffic + Intersections controlled by traffic signals + Green and residential areas In contrast, the following factors show a negative influence on bicycle route choice: – Increasing distance – High proportion of routes with steep longitudinal gradients – A high maximum gradient along the route – Narrow width of the cycle lane – Higher accident frequency along the route For some influencing factors, no (significant) influence on route choice was found (e.g. for the average daily traffic volume of motorized traffic, stationary traffic or the frequency of right-of-way regulated intersections). The present study contributes to the quantification of influencing factors or rather their effect on bicycle route choice in a German city. Furthermore, the resulting route choice model can be used to estimate the effects of different measures (e.g. in the context of traffic demand modelling). A strength of the study is the extensive data set on revealed preferences of cyclists that has been used to analyse route choice behaviour. A critical reflection of the results and methods shows that the quality of the analysis and its results could further be improved by using more secondary and primary data as well as other models for analysis. The results of the study provide an insight into route choices of cyclists in the study area. However, it could not be determined whether route choice of cyclists in Dresden is similar to route choice of cyclists in other German cities. Nevertheless, the developed method could also be used to analyse route choice in other cities, so that reliable state-ments on route choice of German cyclists can be compiled. Overall, the following conclusions can be drawn: in order to provide cyclists the most attractive cycling infrastructure possible and, thus, promote the use of cycling in cities, bicycle traffic should be guided on dedicated and wide cycling infrastructure with good surface quality. Furthermore, the speed of moving motorised traffic should be reduced along the route segments if possible. Bundling of cycling traffic flows and routing cyclists through green spaces can furthermore increase riding comfort and the perception of safety. info:eu-repo/classification/ddc/380 ddc:380
194	Big Data im Radverkehr: Ergebnisbericht 'Mit Smartphones generierte Verhaltensdaten im Radverkehr' Francke, Angela, Becker, Thilo, Lißner, Sven 19 October 2018 (has links) Zusammenfassend lässt sich sagen, dass unter Beachtung der im folgenden Bericht beschriebenen Faktoren GPS-Daten, im vorliegenden Fall exemplarisch untersucht anhand von Daten der Firma Strava Inc., mit einigen Einschränkungen für die Radverkehrsplanung nutzbar sind. Bereits heute sind damit Auswertungen möglich, die zeigen, wo, wann und wie sich Radfahrende im gesamten Netz bewegen. Die mittels Smartphone-App generierten Daten können sehr sinnvoll als Ergänzung zu bestehenden Dauerzählstellen von Kommunen genutzt werden. Berücksichtigt werden sollten bei der Auswertung und Interpretation der Daten jedoch einige Aspekte, wie der eher sportlich orientierte Kontext der erfassten Routen in den untersuchten Beispielen. Des Weiteren werden aktuell die Daten zum Teil noch als Datenbank- oder GIS-Dateien zur Verfügung gestellt bzw. befinden sich Online- Masken zur einfacheren Nutzung im Aufbau oder einem ersten Nutzungsstadium. Die Auswertung und Interpretation erfordert also weiterhin Fachkompetenz und auch personelle Ressourcen. Der Einsatz dieser wird jedoch voraussichtlich durch die Weiterentwicklung von Web-Oberflächen und unterstützenden Auswertemasken abnehmen. Hier gilt es zukünftig, in Zusammenarbeit mit den Kommunen, die benötigten Parameter sowie die geeignetsten Aufbereitungsformen zu erarbeiten. Im Forschungsprojekt erfolgte ein Ansatz der Hochrechnung von Radverkehrsstärken aus Stichproben von GPS-Daten auf das gesamte Netz. Dieser konnte auch erfolgreich in einer weiteren Kommune verifiziert werden. Jedoch ist auch hier in Zukunft noch Forschungsbedarf vorhanden bzw. die Anpassung auf lokale Gegebenheiten notwendig. Radverkehr, GPS-Daten, Verkehrsverhalten cycling, GPS-tracks, traffic info:eu-repo/classification/ddc/380 ddc:380 Dresden Radfahrer Radfahrerverkehr Verkehrsverhalten Datensammlung Smarthphone Orientierung
195	Sachsenforst ...: Jahresbericht 28 December 2023 (has links) No description available. Sachsen, Forstverwaltung, Daten info:eu-repo/classification/ddc/330 ddc:330 info:eu-repo/classification/ddc/630 ddc:630 Sachsen; Forstverwaltung
196	Benchmarking AutoML for regression tasks on small tabular data in materials design Conrad, Felix, Mälzer, Mauritz, Schwarzenberger, Michael, Wiemer, Hajo, Ihlenfeldt, Steffen 05 March 2024 (has links) Machine Learning has become more important for materials engineering in the last decade. Globally, automated machine learning (AutoML) is growing in popularity with the increasing demand for data analysis solutions. Yet, it is not frequently used for small tabular data. Comparisons and benchmarks already exist to assess the qualities of AutoML tools in general, but none of them elaborates on the surrounding conditions of materials engineers working with experimental data: small datasets with less than 1000 samples. This benchmark addresses these conditions and draws special attention to the overall competitiveness with manual data analysis. Four representative AutoML frameworks are used to evaluate twelve domain-specific datasets to provide orientation on the promises of AutoML in the field of materials engineering. Performance, robustness and usability are discussed in particular. The results lead to two main conclusions: First, AutoML is highly competitive with manual model optimization, even with little training time. Second, the data sampling for train and test data is of crucial importance for reliable results. info:eu-repo/classification/ddc/370 ddc:370
197	A Flexible Graph-Based Data Model Supporting Incremental Schema Design and Evolution Braunschweig, Katrin, Thiele, Maik, Lehner, Wolfgang 26 January 2023 (has links) Web data is characterized by a great structural diversity as well as frequent changes, which poses a great challenge for web applications based on that data. We want to address this problem by developing a schema-optional and flexible data model that supports the integration of heterogenous and volatile web data. Therefore, we want to rely on graph-based models that allow to incrementally extend the schema by various information and constraints. Inspired by the on-going web 2.0 trend, we want users to participate in the design and management of the schema. By incrementally adding structural information, users can enhance the schema to meet their very specific requirements. info:eu-repo/classification/ddc/004 ddc:004
198	Geschäftsbericht / Staatsbetrieb Sachsenforst 26 January 2023 (has links) No description available. Sachsen, Forstverwaltung, Daten info:eu-repo/classification/ddc/330 ddc:330 info:eu-repo/classification/ddc/630 ddc:630 Sachsen; Forstverwaltung
199	Sachsenforst ...: Jahresbericht 26 January 2023 (has links) No description available. Sachsen, Forstverwaltung, Daten info:eu-repo/classification/ddc/330 ddc:330 info:eu-repo/classification/ddc/630 ddc:630 Sachsen; Forstverwaltung
200	OPEN—Enabling Non-expert Users to Extract, Integrate, and Analyze Open Data Braunschweig, Katrin, Eberius, Julian, Thiele, Maik, Lehner, Wolfgang 27 January 2023 (has links) Government initiatives for more transparency and participation have lead to an increasing amount of structured data on the web in recent years. Many of these datasets have great potential. For example, a situational analysis and meaningful visualization of the data can assist in pointing out social or economic issues and raising people’s awareness. Unfortunately, the ad-hoc analysis of this so-called Open Data can prove very complex and time-consuming, partly due to a lack of efficient system support.On the one hand, search functionality is required to identify relevant datasets. Common document retrieval techniques used in web search, however, are not optimized for Open Data and do not address the semantic ambiguity inherent in it. On the other hand, semantic integration is necessary to perform analysis tasks across multiple datasets. To do so in an ad-hoc fashion, however, requires more flexibility and easier integration than most data integration systems provide. It is apparent that an optimal management system for Open Data must combine aspects from both classic approaches. In this article, we propose OPEN, a novel concept for the management and situational analysis of Open Data within a single system. In our approach, we extend a classic database management system, adding support for the identification and dynamic integration of public datasets. As most web users lack the experience and training required to formulate structured queries in a DBMS, we add support for non-expert users to our system, for example though keyword queries. Furthermore, we address the challenge of indexing Open Data. info:eu-repo/classification/ddc/004 ddc:004

Search results