Global ETD Search

311	Hardware Support for FPGA Resource Elasticity Aliyeva, Fidan January 2022 (has links) FPGAs are commonly used in cloud computing due to their ability to be programmed as a processor that serves a specific purpose; hence, achieving high performance at low power. On the other hand, FPGAs have a lot of resources available, which are wasted if they host a single application or serve a single user’s request. Partially Reconfiguration technology enables FPGAs to divide their resources into different regions and then dynamically reprogram those regions with various applications during runtime. Therefore, they are considered as a good solution to eliminate the underutilization resource problem. Nevertheless, the sizes of these regions are static; they cannot be increased or decreased once they are defined. Thereby, it leads to the underutilization of reconfigurable region resources. This thesis addresses this problem, i.e., how to dynamically increase/decrease partially reconfigurable FPGA resources matching an application’s needs. Our solution enables expanding and contracting the FPGA resources allocated to an application by 1) application acceleration requirements expressed in multiple smaller modules which are configured into multiple reconfigurable regions assigned to the application dynamically and 2) providing a low - area - overhead, configurable, and isolated communication mechanism by adjusting crossbar interconnect and WISHBONE interface among those multiple reconfigurable regions. / FPGA - kretsar har en förmåga att programmeras som processorer med ett specifikt syfte vilket gör att de ofta används i molnlösningar. Det tager hög prestanda med låg effektförbrukning. Å andra sidan disponerar FPGA - kretsar över stora resurser, vilka är bortkastade om de enbart används av en applikation eller endast på en användares förfrågan. Partiellt omkonfigurerbara teknologier tillåter FPGA - kretsar att fördela resurser mellan olika regioner, och sen dynamiskt omprogrammera regioner med olika applikationer vid körning. Därför betraktas partiellt omkonfigurerbara teknologier som en bra lösning för att minimera underutnyttjande av resurser. Storleken på regionerna är statiska och kan inte ändras när de väl definierats, vilket leder till underutnyttjande av de omkonfigurerbara regionernas resurser. Denna uppsats angriper problemet med dynamisk allokering av partiellt omkonfigurerbara FPGA - resurser utifrån applikationens behov. Vår lösning möjliggör ökning och minskning av FPGA - resurser allokerade till en applikation genom 1) accelerering av applikationen genom att applikationen tilldelas flera mindre moduler konfigurerade till dynamiskt omkonfigurerbara regioner, och 2) tillhanda hållande av en effektiv konfigurerbar och isolerad kommunikationsmekanism, genom justering av crossbar - sammankoppling en och WISHBONE - gränssnittet hos de omkonfigurerbara regionerna. FPGA Elasticity Partial reconfiguration Crossbar WISHBONE Multicast FPGA Elasticitet Partiell omkonfigurering Crossbar WISHBONE Multicast Computer and Information Sciences Data- och informationsvetenskap
312	Introduction de mécanismes de tolérance aux pannes franches dans les architectures de processeur « many-core » à mémoire partagée cohérente / Introduction of Fault-Tolerance Mechanisms for Permanent Failures in Coherent Shared-Memory Many-Core Architectures Fuguet Tortolero, César 25 November 2015 (has links) L'augmentation continue de la puissance de calcul requise par les applications telles que la cryptographie, la simulation, ou le traitement du signal a fait évoluer la structure interne des processeurs vers des architectures massivement parallèles (dites « many-core »). Ces architectures peuvent contenir des centaines, voire des milliers de cœurs afin de fournir une puissance de calcul importante avec une consommation énergétique raisonnable. Néanmoins, l'importante densité de transistors fait que ces architectures sont très susceptibles aux pannes matérielles. L'augmentation dans la variabilité du processus de fabrication, et dans les facteurs de stress des transistors, dégrade à la fois le rendement de fabrication, et leur durée de vie. Nous proposons donc un mécanisme complet de tolérance aux pannes franches, permettant les architectures « many-core » à mémoire partagée cohérente de fonctionner dans un mode dégradé. Ce mécanisme s'appuie sur un logiciel embarqué et distribué dans des mémoires sur puce (« firmware »), qui est exécuté par les cœurs à chaque démarrage du processeur. Ce logiciel implémente plusieurs algorithmes distribués permettant de localiser les composants défaillants (cœurs, bancs mémoires, et routeurs des réseaux sur puce), de reconfigurer l'architecture matérielle, et de fournir une cartographie de l'infrastructure matérielle fonctionnelle au système d'exploitation. Le mécanisme supporte aussi bien des défauts de fabrication, que des pannes de vieillissement après que la puce est en service dans l'équipement. Notre proposition est évaluée en utilisant un prototype virtuel précis au cycle d'une architecture « many-core » existante. / The always increasing performance demands of applications such as cryptography, scientific simulation, network packets dispatching, signal processing or even general-purpose computing has made of many-core architectures a necessary trend in the processor design. These architectures can have hundreds or thousands of processor cores, so as to provide important computational throughputs with a reasonable power consumption. However, their important transistor density makes many-core architectures more prone to hardware failures. There is an augmentation in the fabrication process variability, and in the stress factors of transistors, which impacts both the manufacturing yield and lifetime. A potential solution to this problem is the introduction of fault-tolerance mechanisms allowing the processor to function in a degraded mode despite the presence of defective internal components. We propose a complete in-the-field reconfiguration-based permanent failure recovery mechanism for shared-memory many-core processors. This mechanism is based on a firmware (stored in distributed on-chip read-only memories) executed at each hardware reset by the internal processor cores without any external intervention. It consists in distributed software procedures, which locate the faulty components (cores, memory banks, and network-on-chip routers), reconfigure the hardware architecture, and provide a description of the functional hardware infrastructure to the operating system. Our proposal is evaluated using a cycle-accurate SystemC virtual prototype of an existing many-core architecture. We evaluate both its latency, and its silicon cost. Pannes franches Many-Core Localisation de pannes Reconfiguration Algorithmique distribuée Réseau-Sur-Puce Démarrage du processeur Many-core Fault-tolerance 004
313	Metodika návrhu systémů odolných proti poruchám do omezeného implementačního prostoru na bázi FPGA / Methodology for Fault Tolerant Systems Design into Limited Implementation Area in FPGA Mičulka, Lukáš January 2017 (has links) Tato práce popisuje navrženou metodologii pro návrh systémů odolných proti poruchám v FPGA schopnou ochránit systém před projevy přechodných a trvalých poruch. Oprava přechodné poruchy je prováděna částečnou dynamickou rekonfigurací. Oprava omezeného počtu trvalých poruch je založena na použití odolných architektur využívajících menší množství zdrojů než předchozí použitá architektura. Vadná část FPGA tak není dále využívána. Tato technika je založena na použití předkompilovaných konfigurací uložených v externí paměti. Pro snížení paměťových nároků pro uložení konfiguračních bitových posloupností je použita technika relokace.
314	Wissenschaftliche Schriftenreihe EINGEBETTETE, SELBSTORGANISIERENDE SYSTEME Hardt, Wolfram 23 April 2013 (has links) Diese neu ins Leben gerufene Schriftenreihe widmet sich einer sehr aktuellen Thematik der Technischen Informatik, den eingebetteten, selbstorganisierenden Systemen (ESS). Seit Jahren durchdringen eingebettete Systeme unseren Alltag in fast allen Lebensbereichen. Angefangen von automatisierten Türöffnungssystemen, über komplex gesteuerte Servicemaschinen, z.B. Waschmaschinen, bis hin zu mobilen, persönlich zugeordneten Systemen wie Mobiltelefone und Handheld-Computer sind eingebettete Systeme zu Selbstverständlichkeit geworden. Neue Anforderungen durch den Kunden, der in immer kürzeren Zeitintervallen Neuerungen erwartet und steigende Festkosten für die Einrichtung einer Produktlinie haben einen neuen Aspekt in den Entwurf und Betrieb eingebetteter Systeme gebracht: Selbstorganisation. Einzelaspekte der Selbstorganisation können Selbstdiagnose, Selbsttest, Selbstheilung oder auch statische sowie dynamische Rekonfigurierung von Systemen sein. Dabei sind die Aspekte der Funktionalität und der Kommunikation zu unterscheiden. Beide haben großen Einfluss auf die Performanz und Stabilität eines eingebetteten Systems. Im Bereich der Kommunikation sind die Schnittstellen, die Komponenten des eingebetteten Systems verbinden, von besonderem Interesse. / This newly launched book series addresses a very current subject of computer engineering, embedded selforganising systems (ESS). For years, embedded systems permeate our everyday lives in almost all areas of life. Ranging from automated door opening systems through complex controlled service machines, e.g. Washing machines, to mobile, personally associated systems such as mobile phones and handheld computers, embedded systems have become for granted. New requirements by the customer, who expected changes in shorter time intervals and rising fixed costs of setting up a product line have brought a new aspect in the design and operation of embedded systems: selforganisation. Individual aspects of selforganisation may be self-diagnosis, self-test, self-healing or static as well as dynamic reconfiguration of systems. The aspects of the functionality of communication are distinguished. Both have great influence on the performance and reliability of an embedded system. In the field of the communication interfaces of the embedded system are of particular interest. eingebettete Systeme, Rekonfigurierung info:eu-repo/classification/ddc/004 ddc:004
315	Untersuchungen zur Kostenoptimierung für Hardware-Emulatoren durch Anwendung von Methoden der partiellen Laufzeitrekonfiguration Beckert, René 13 June 2013 (has links) Der vorliegende Band der wissenschaftlichen Schriftenreihe Eingebettete Selbstorganisierende Systeme widmet sich der Optimierung von Hardware Emulatoren durch die Anwendung von Methoden der partiellen Laufzeitrekonfiguration. An aktuelle Schaltkreis- und Systementwürfe werden zunehmend divergente Anforderungen gestellt. Einer sehr kurzen Entwicklungszeit für eine schnelle Markteinführung steht, um teure und aufwändige Re-Desings zu verhindern, eine möglichst umfangreiche Testabdeckung des Entwurfs gegenüber. Um die Zeit für die Tests zu reduzieren, kommen überwiegend FPGA-basierte HW-Emulatoren zum Einsatz. Durch den Einfluss der steigenden Komplexität aktueller Entwürfe auf die Emulator-Plattform reduziert sich jedoch signifikant die Performance der Emulatoren. Die in Emulatoren eingesetzten FPGAs sind aber zunehmend partiell zur Laufzeit rekonfigurierbar. Der in der vorliegenden Arbeit umgesetzte Ansatz behandelt die Anwendung von Methoden der Laufzeitrekonfiguration auf dem Gebiet der Hardware-Emulation. Dafür ist zunächst eine Partitionierung des zu testenden Entwurfs in möglichst funktional unabhängige Systemteile notwendig. Für eine optimierte und ressourceneffiziente Platzierung der einzelnen HW-Module während der Emulation, ist ein ebenfalls auf dem FPGA platziertes Kommunikationsnetzwerk implementiert. Der vorgestellte Ansatz wird an verschiedenen Beispielen anschaulich illustriert. So kann der Leser die Mächtigkeit der entwickelten Methodik nachvollziehen und wird motiviert, das Verfahren auch auf weitere Anwendungsfälle zu übertragen. / Current circuit and system designs consist a lot of gate numbers and divergent requirements. In contrast to a short development and time to market schedule, the needs for perfect test coverage and quality are rising. One approach to cover this problem is the FPGA based functional test of electronic circuits. State of the art FPGA platforms doesn't consist enough gates to support fully custom designs. The thesis catches this problem and gives some approaches to use partial dynamic reconfiguration to solve the size problem. A fully automated design flow demonstrates partial partitioning of designs, modifications to use dynamic reconfiguration and its schedule. At the end of the work, some examples demonstrates the power of the approach. info:eu-repo/classification/ddc/004 ddc:004 Emulation; Rekonfiguration
316	Adapt or grow obsolete : A study on developing dynamiccapabilities in a post-IPO setting Sedman Jaensson, Daniel, Littzell, Tom January 2022 (has links) As technology accelerates the dynamism in the global markets, companies are forced to adapt to the changing landscape. A company’s success in navigating the external environment depends on their dynamic capabilities. One of the most critical moments for many companies’ is when they conduct an IPO. Research into how an IPO affects a firm is necessary since newly listed firms have been shown to underperform compared to private firms. Dynamic capabilities have been observed to be of critical importance for IPO firms and previous research suggests that an IPO changes processes that affects a firm's ability to develop dynamic capabilities. This study explores how the development of dynamic capabilities is affected by changes to microfoundations following an IPO. The study gains insight into the subject by conducting interviews with executives from Swedish firms that underwent the IPO process during 2018-2020. The empirical findings indicate that seven microfoundations are affected by going public. The study contributes to prevailing literature by laying the groundwork for future research on how a firm's ability to develop dynamic capabilities is affected by going public and it has the potential to be a practical tool when making strategic decisions during and after an IPO process. Dynamic capabilities Microfoundation Sensing Seizing Reconfiguration Initial public offering (IPO) Going public IPO firm Business Administration Företagsekonomi
317	FPGA Floor-Planning Impact on Implementation Results Lamprecht, Jaren Tyler 14 November 2012 (has links) (PDF) The field programmable gate array (FPGA) is an attractive computational platform for many applications because of its customizable nature and modest development cost, in terms of both time and money. As FPGAs scale to increased logical capacities, designers have increased flexibility. However, the FPGA placement problem becomes more difficult at increased sizes. Increasingly, designers are encouraged to structure designs hierarchically and floor-plan. Floor planning is a manual process which maps specified design submodules to selected physical regions of the FPGA device fabric. This thesis explores several of the effects that floor-planning has on submodules and the designs they comprise. A method is developed to explore the floor-planning impact on submodules independent of a full design. Six different submodules are independently subjected to varying timing constraints and to area constraints of varying aspect ratios and area allocations. The resulting submodule minimum clock periods, routing overflows, and relocatabilities are assembled from millions of submodule implementations. The aggregate results suggest that EDA placement and routing tools can meet design constraints even with extreme combinations of submodule aspect ratio and area allocations; however, the probability of implementations meeting constraints may be low at those extremes. Separate sets of submodule floor-planning guidelines are developed to optimize for meeting minimum clock period constraints, minimizing routing overflow, and maximize relocatability. The submodule floor planning guidelines for meeting minimum clock period are verified in full design implementations. FPGA floor-plan area constraint clock constraint routing spillover partial reconfiguration submodule relocation Xilinx Electrical and Computer Engineering
318	Electric Distribution Reliability Analysis Considering Time-varying Load, Weather Conditions and Reconfiguration with Distributed Generation Zhu, Dan 12 April 2007 (has links) This dissertation is a systematic study of electric power distribution system reliability evaluation and improvement. Reliability evaluation of electric power systems has traditionally been an integral part of planning and operation. Changes in the electric utility coupled with aging electric apparatus create a need for more realistic techniques for power system reliability modeling. This work presents a reliability evaluation technique that combines set theory and Graph Trace Analysis (GTA). Unlike the traditional Markov approach, this technique provides a fast solution for large system reliability evaluation by managing computer memory efficiently with iterators, assuming a single failure at a time. A reconfiguration for restoration algorithm is also created to enhance the accuracy of the reliability evaluation, considering multiple concurrent failures. As opposed to most restoration simulation methods used in reliability analysis, which convert restoration problems into mathematical models and only can solve radial systems, this new algorithm seeks the reconfiguration solution from topology characteristics of the network itself. As a result the new reconfiguration algorithm can handle systems with loops. In analyzing system reliability, this research takes into account time-varying load patterns, and seeks approaches that are financially justified. An exhaustive search scheme is used to calculate optimal locations for Distributed Generators (DG) from the reliability point of view. A Discrete Ascent Optimal Programming (DAOP) load shifting approach is proposed to provide low cost, reliability improvement solutions. As weather conditions have an important effect on distribution component failure rates, the influence of different types of storms has been incorporated into this study. Storm outage models are created based on ten years' worth of weather and power outage data. An observer is designed to predict the number of outages for an approaching or on going storm. A circuit corridor model is applied to investigate the relationship between power outages and lightning activity. / Ph. D. Time-varying Load. Reconfiguration for Restoration DG Power Distribution System Reliability Improvement Storm Outage Prediction Reliability Analysis
319	Reconfigurable Technologies for Next Generation Internet and Cluster Computing Unnikrishnan, Deepak C. 01 September 2013 (has links) Modern web applications are marked by distinct networking and computing characteristics. As applications evolve, they continue to operate over a large monolithic framework of networking and computing equipment built from general-purpose microprocessors and Application Specific Integrated Circuits (ASICs) that offers few architectural choices. This dissertation presents techniques to diversify the next-generation Internet infrastructure by integrating Field-programmable Gate Arrays (FPGAs), a class of reconfigurable integrated circuits, with general-purpose microprocessor-based techniques. Specifically, our solutions are demonstrated in the context of two applications - network virtualization and distributed cluster computing. Network virtualization enables the physical network infrastructure to be shared among several logical networks to run diverse protocols and differentiated services. The design of a good network virtualization platform is challenging because the physical networking substrate must scale to support several isolated virtual networks with high packet forwarding rates and offer sufficient flexibility to customize networking features. The first major contribution of this dissertation is a novel high performance heterogeneous network virtualization system that integrates FPGAs and general-purpose CPUs. Salient features of this architecture include the ability to scale the number of virtual networks in an FPGA using existing software-based network virtualization techniques, the ability to map virtual networks to a combination of hardware and software resources on demand, and the ability to use off-chip memory resources to scale virtual router features. Partial-reconfiguration has been exploited to dynamically customize virtual networking parameters. An open software framework to describe virtual networking features using a hardware-agnostic language has been developed. Evaluation of our system using a NetFPGA card demonstrates one to two orders of improved throughput over state-of-the-art network virtualization techniques. The demand for greater computing capacity grows as web applications scale. In state-of-the-art systems, an application is scaled by parallelizing the computation on a pool of commodity hardware machines using distributed computing frameworks. Although this technique is useful, it is inefficient because the sequential nature of execution in general-purpose processors does not suit all workloads equally well. Iterative algorithms form a pervasive class of web and data mining algorithms that are poorly executed on general purpose processors due to the presence of strict synchronization barriers in distributed cluster frameworks. This dissertation presents Maestro, a heterogeneous distributed computing framework that demonstrates how FPGAs can break down such synchronization barriers using asynchronous accumulative updates. These updates allow for the accumulation of intermediate results for numerous data points without the need for iteration-based barriers. The benefits of a heterogeneous cluster are illustrated by executing a general-class of iterative algorithms on a cluster of commodity CPUs and FPGAs. Computation is dynamically prioritized to accelerate algorithm convergence. We implement a general-class of three iterative algorithms on a cluster of four FPGAs. A speedup of 7× is achieved over an implementation of asynchronous accumulative updates on a general-purpose CPU. The system offers 154× speedup versus a standard Hadoop-based CPU-workstation cluster. Improved performance is achieved by clusters of FPGAs. Distributed computing FPGA Heterogeneous computing Network virtualization Partial reconfiguration Reconfigurable computing Computer Engineering Computer Sciences Electrical and Computer Engineering
320	Online Management of Resilient and Power Efficient Multicore Processors Rodrigues, Rance 01 September 2013 (has links) The semiconductor industry has been driven by Moore's law for almost half a century. Miniaturization of device size has allowed more transistors to be packed into a smaller area while the improved transistor performance has resulted in a significant increase in frequency. Increased density of devices and rising frequency led, unfortunately, to a power density problem which became an obstacle to further integration. The processor industry responded to this problem by lowering processor frequency and integrating multiple processor cores on a die, choosing to focus on Thread Level Parallelism (TLP) for performance instead of traditional Instruction Level Parallelism (ILP). While continued scaling of devices have provided unprecedented integration, it has also unfortunately led to a few serious problems: The first problem is that of increasing rates of system failures due to soft errors and aging defects. Soft errors are caused by ionizing radiations that originate from radioactive contaminants or secondary release of charged particles from cosmic neutrons. Ionizing radiations may charge/discharge a storage node causing bit flips which may result in a system failure. In this dissertation, we propose solutions for online detection of such errors in microprocessors. A small and functionally limited core called the Sentry Core (SC) is added to the multicore. It monitors operation of the functional cores in the multicore and whenever deemed necessary, it opportunistically initiates Dual Modular redundancy (DMR) to test the operation of the cores in the multicore. This scheme thus allows detection of potential core failure and comes at a small hardware overhead. In addition to detection of soft errors, this solution is also capable of detecting errors introduced by device aging that results in failure of operation. The solution is further extended to verify cache coherence transactions. A second problem we address in this dissertation relate to power concerns. While the multicore solution addresses the power density problem, overall power dissipation is still limited by packaging and cooling technologies. This limits the number of cores that can be integrated for a given package specification. One way to improve performance within this constraint is to reduce power dissipation of individual cores without sacrificing system performance. There have been prior solutions to achieve this objective that involve Dynamic Voltage and Frequency Scaling (DVFS) and the use of sleep states. DVFS and sleep states take advantage of coarse grain variation in demand for computation. In this dissertation, we propose techniques to maximize performance-per-power of multicores at a fine grained time scale. We propose multiple alternative architectures to attain this goal. One of such architectures we explore is Asymmetric Multicore Processors (AMPs). AMPs have been shown to outperform the symmetric ones in terms of performance and Performance-per-Watt for a fixed resource and power budget. However, effectiveness of these architectures depends on accurate thread-to-core scheduling. To address this problem, we propose online thread scheduling solutions responding to changing computational requirements of the threads. Another solution we consider is for Symmetric Multicore processors (SMPs). Here we target sharing of the large and underutilized resources between pairs of cores. While such architectures have been explored in the past, the evaluations were incomplete. Due to sharing, sometimes the shared resource is a bottleneck resulting in significant performance loss. To mitigate such loss, we propose the Dynamic Voltage and Frequency Boosting (DVFB) of the shared resources. This solution is found to significantly mitigate performance loss in times of contention. We also explore in this dissertation, performance-per-Watt improvement of individual cores in a multicore. This is based on dynamic reconfiguration of individual cores to run them alternately in out-of-order (OOO) and in-order (InO) modes adapting dynamically to workload characteristics. This solution is found to significantly improve power efficiency without compromising overall performance. Thus, in this dissertation we propose solutions for several important problems to facilitate continued scaling of processors. Specifically, we address challenges in the area of reliability of computation and propose low power design solutions to address power constraints. Asymmetric multicore Dynamic reconfiguration Online testing Performance monitors Power efficiency Symmetric multicore Computer Engineering Electrical and Computer Engineering

Search results