• Refine Query
  • Source
  • Publication year
  • to
  • Language
  • 460
  • 85
  • 80
  • 20
  • 10
  • 7
  • 3
  • 3
  • 3
  • 3
  • 3
  • 3
  • 3
  • 3
  • 2
  • Tagged with
  • 813
  • 813
  • 177
  • 167
  • 156
  • 119
  • 117
  • 114
  • 109
  • 98
  • 94
  • 90
  • 83
  • 82
  • 77
  • About
  • The Global ETD Search service is a free service for researchers to find electronic theses and dissertations. This service is provided by the Networked Digital Library of Theses and Dissertations.
    Our metadata is collected from universities around the world. If you manage a university/consortium/country archive and want to be added, details can be found on the NDLTD website.
21

Explicit data graph compilation

Smith, Aaron Lee, 1977- 19 August 2010 (has links)
Technology trends such as growing wire delays, power consumption limits, and diminishing clock rate improvements, present conventional instruction set architectures such as RISC, CISC, and VLIW with difficult challenges. To show continued performance growth, future microprocessors must exploit concurrency power efficiently. An important question for any future system is the division of responsibilities between programmer, compiler, and hardware to discover and exploit concurrency. In this research we develop the first compiler for an Explicit Data Graph Execution (EDGE) architecture and show how to solve the new challenge of compiling to a block-based architecture. In EDGE architectures, the compiler is responsible for partitioning the program into a sequence of structured blocks that logically execute atomically. The EDGE ISA defines the structure of, and the restrictions on, these blocks. The TRIPS prototype processor is an EDGE architecture that employs four restrictions on blocks intended to strike a balance between software and hardware complexity. They are: (1) fixed block sizes (maximum of 128 instructions), (2) restricted number of loads and stores (no more than 32 may issue per block), (3) restricted register accesses (no more than eight reads and eight writes to each of four banks per block), and (4) constant number of block outputs (each block must always generate a constant number of register writes and stores, plus exactly one branch). The challenges addressed in this thesis are twofold. First, we develop the algorithms and internal representations necessary to support the new structural constraints imposed by the block-based EDGE execution model. This first step provides correct execution and demonstrates the feasibility of EDGE compilers. Next, we show how to optimize blocks using a dataflow predication model and provide results showing how the compiler is meeting this challenge on the SPEC2000 benchmarks. Using basic blocks as the baseline performance, we show that optimizations utilizing the dataflow predication model achieve up to 64% speedup on SPEC2000 with an average speedup of 31%. / text
22

A methodology for automated design of computer instruction sets

Bennett, J. P. January 1987 (has links)
No description available.
23

Systems with predictable caching

Irwin, James Patrick John January 2002 (has links)
No description available.
24

AN OPTIMIZATION STAGE FOR AHPL COMPILER (LAYOUT).

MAITAN, JACEK. January 1984 (has links)
The dissertation is a description of an analysis and a case study of an Optimization Stage for a Standard Cell oriented silicon compiler. Using the AHPL hardware description language, a complete representation hierarchy (functional, logic, and layout) is proposed for circuits defined at a Register Transfer level. The design of a new class of methods for layout analysis and optimization is based on this hierarchy. A layout evaluation method is based on the analysis of an activity graph derived from a circuit layout. The cost measure for such a graph is defined and used in evaluation of the necessary and sufficient conditions for design optimality (NSCDO). Iterations within the optimization process are controlled using a synthetic measure derived from these optimality conditions. A proposed layout optimization heuristic, derived from NSCDO, allows for better routing channel area utilization without compromising a circuit's timing performance. It is based on an analysis of the timing behavior modifications introduced by the various materials used as interconnectors resulting in an improvement of the load dependency of the output driving capabilities of cells. The dissertation contains an example of a quantitative analysis of a CMOS digital circuit. A system implementing some of the algorithms described above has been written in FORTRAN77.
25

AN EXPERIMENTAL FRAME FOR A MODULAR HIERARCHICALLY COORDINATED ADAPTIVE COMPUTER ARCHITECTURE

Liaw, Yih-Shyan, 1955- January 1986 (has links)
No description available.
26

Toplogies and routing strategies for transputer networks

Pritchard, David John January 1993 (has links)
No description available.
27

Reconfigurable systems in space instrumentation

Bezerra, Eduardo Augusto January 2002 (has links)
No description available.
28

High performance architecture in the MU6 network

Aniteye, L. B. January 1984 (has links)
No description available.
29

Experiments with a vitual tree architecture

McBurney, D. L. January 1993 (has links)
No description available.
30

Model distribution in decentralized multisensor data fusion

Berg, Timothy Martin January 1993 (has links)
No description available.

Page generated in 0.0991 seconds