• Refine Query
  • Source
  • Publication year
  • to
  • Language
  • 82
  • 40
  • 13
  • 8
  • 5
  • 2
  • 2
  • 2
  • 1
  • 1
  • Tagged with
  • 189
  • 189
  • 50
  • 35
  • 34
  • 31
  • 28
  • 26
  • 25
  • 22
  • 21
  • 21
  • 21
  • 19
  • 18
  • About
  • The Global ETD Search service is a free service for researchers to find electronic theses and dissertations. This service is provided by the Networked Digital Library of Theses and Dissertations.
    Our metadata is collected from universities around the world. If you manage a university/consortium/country archive and want to be added, details can be found on the NDLTD website.
21

Algorithm Architecture Co-design for Dense and Sparse Matrix Computations

January 2018 (has links)
abstract: With the end of Dennard scaling and Moore's law, architects have moved towards heterogeneous designs consisting of specialized cores to achieve higher performance and energy efficiency for a target application domain. Applications of linear algebra are ubiquitous in the field of scientific computing, machine learning, statistics, etc. with matrix computations being fundamental to these linear algebra based solutions. Design of multiple dense (or sparse) matrix computation routines on the same platform is quite challenging. Added to the complexity is the fact that dense and sparse matrix computations have large differences in their storage and access patterns and are difficult to optimize on the same architecture. This thesis addresses this challenge and introduces a reconfigurable accelerator that supports both dense and sparse matrix computations efficiently. The reconfigurable architecture has been optimized to execute the following linear algebra routines: GEMV (Dense General Matrix Vector Multiplication), GEMM (Dense General Matrix Matrix Multiplication), TRSM (Triangular Matrix Solver), LU Decomposition, Matrix Inverse, SpMV (Sparse Matrix Vector Multiplication), SpMM (Sparse Matrix Matrix Multiplication). It is a multicore architecture where each core consists of a 2D array of processing elements (PE). The 2D array of PEs is of size 4x4 and is scheduled to perform 4x4 sized matrix updates efficiently. A sequence of such updates is used to solve a larger problem inside a core. A novel partitioned block compressed sparse data structure (PBCSC/PBCSR) is used to perform sparse kernel updates. Scalable partitioning and mapping schemes are presented that map input matrices of any given size to the multicore architecture. Design trade-offs related to the PE array dimension, size of local memory inside a core and the bandwidth between on-chip memories and the cores have been presented. An optimal core configuration is developed from this analysis. Synthesis results using a 7nm PDK show that the proposed accelerator can achieve a performance of upto 32 GOPS using a single core. / Dissertation/Thesis / Masters Thesis Computer Engineering 2018
22

A Bound on the Number of Spanning Trees in Bipartite Graphs

Koo, Cheng Wai 01 January 2016 (has links)
Richard Ehrenborg conjectured that in a bipartite graph G with parts X and Y, the number of spanning trees is at most the product of the vertex degrees divided by |X|⋅|Y|. We make two main contributions. First, using techniques from spectral graph theory, we show that the conjecture holds for sufficiently dense graphs containing a cut vertex of degree 2. Second, using electrical network analysis, we show that the conjecture holds under the operation of removing an edge whose endpoints have sufficiently large degrees. Our other results are combinatorial proofs that the conjecture holds for graphs having |X| ≤ 2, for even cycles, and under the operation of connecting two graphs by a new edge. We also make two new conjectures based on empirical data, each of which is stronger than Ehrenborg's conjecture.
23

Rotating Supporting Hyperplanes and Snug Circumscribing Simplexes

Salmani Jajaei, Ghasemali 01 January 2018 (has links)
This dissertation has two topics. The rst one is about rotating a supporting hyperplane on the convex hull of a nite point set to arrive at one of its facets. We present three procedures for these rotations in multiple dimensions. The rst two procedures rotate a supporting hyperplane for the polytope starting at a lower dimensional face until the support set is a facet. These two procedures keep current points in the support set and accumulate new points after the rotations. The rst procedure uses only algebraic operations. The second procedure uses LP. In the third procedure we rotate a hyperplane on a facet of the polytope to a dierent adjacent facet. Similarly to the rst procedure, this procedure uses only algebraic operations. Some applications to these procedures include data envelopment analysis (DEA) and integer programming. The second topic is in the eld of containment problems for polyhedral sets. We present three procedures to nd a circumscribing simplex that contains a point set in any dimension. The rst two procedures are based on the supporting hyperplane rotation ideas from the rst topic. The third circumscribing simplex procedure uses polar cones and other geometrical properties to nd facets of a circumscribing simplex. One application of the second topic discussed in this dissertation is in hyperspectral unmixing.
24

Backward bifurcation in SIR endemic models : this thesis is presented in partial fulfillment of the requirements for the degree of Masters of Information Science in Mathematics at Massey University, Albany, Auckland, New Zealand

Siddiqui, Sameeha Qaiser January 2008 (has links)
In the well known SIR endemic model, the infection-free steady state is globally stable for R0 < 1 and unstable for R0 > 1. Hence, we have a forward bifurcation when R0 = 1. When R0 > 1, an asymptotically stable endemic steady state exists. The basic reproduction number R0 is the main threshold bifurcation parameter used to determine the stability of steady states of SIR endemic models. In this thesis we study extensions of the SIR endemic model for which a backward bifurcation may occur at R0 = 1. We investigate the biologically reasonable conditions for the change of stability. We also analyse the impact of di erent factors that lead to a backward bifurcation both numerically and analytically. A backward bifurcation leads to sub-critical endemic steady states and hysteresis. We also provide a general classi cation of such models, using a small amplitude expansion near the bifurcation. Additionally, we present a procedure for projecting three dimensional models onto two dimensional models by applying some linear algebraic techniques. The four extensions examined are: the SIR model with a susceptible recovered class; nonlinear transmission; exogenous infection; and with a carrier class. Numerous writers have mentioned that a nonlinear transmission function in relation to the infective class, can only lead to a system with an unstable endemic steady state. In spite of this we show that in a nonlinear transmission model, we have a function depending on the infectives and satisfying certain biological conditions, and leading to a sub-critical endemic equilibriums.
25

Accelerating Dense Linear Algebra for GPUs, Multicores and Hybrid Architectures: an Autotuned and Algorithmic Approach

Nath, Rajib Kumar 01 August 2010 (has links)
Dense linear algebra(DLA) is one of the most seven important kernels in high performance computing. The introduction of new machines from vendors provides us opportunities to optimize DLA libraries for the new machines and thus exploit their power. Unfortunately the optimization phase is not straightforward. The optimum code of a certain Basic Linear Algebra Subprogram (BLAS) kernel, which is the core of DLA algorithms, in two different machines with different semiconductor process can be different even if they share the same features in terms of instruction set architecture, memory hierarchy and clock speed. It has become a tradition to optimize BLAS for new machines. Vendors maintain highly optimized BLAS libraries targeting their CPUs. Unfortunately the existing BLAS for GPUs is not highly optimized for DLA algorithms. In my research, I have provided new algorithms for several important BLAS kernels for different generation of GPUs and introduced a pointer redirecting approach to make BLAS run faster in generic problem size. I have also presented an auto-tuning approach to parameterize the developed BLAS algorithms and select the best set of parameters for a given card. The hardware trends have also brought up the need for updates on existing legacy DLA software packages, such as the sequential LAPACK. To take advantage of the new computational environment, successors of LAPACK must incorporate algorithms of three main characteristics: high parallelism, reduced communication, and heterogeneity-awareness. On multicore architectures, Parallel Linear Algebra Software for Multicore Architectures (PLASMA) has been developed to meet the challenges in multicore. On the other extreme, Matrix Algebra on GPU and Multicore Architectures (MAGMA) library demonstrated a hybridization approach that indeed streamlined the development of high performance DLA for multicores with GPU accelerators. The performance of these two libraries depend upon right choice of parameters for a given problem size and given number of cores and/or GPUs. In this work, the issue of automatically tuning these two libraries is presented. A prune based empirical auto-tuning method has been proposed for tuning PLASMA. Part of the tuning method for PLASMA was considered to tune hybrid MAGMA library.
26

Accelerating Dense Linear Algebra for GPUs, Multicores and Hybrid Architectures: an Autotuned and Algorithmic Approach

Nath, Rajib Kumar 01 August 2010 (has links)
Dense linear algebra(DLA) is one of the most seven important kernels inhigh performance computing. The introduction of new machines from vendorsprovides us opportunities to optimize DLA libraries for the new machinesand thus exploit their power. Unfortunately the optimization phase is notstraightforward. The optimum code of a certain Basic Linear AlgebraSubprogram (BLAS) kernel, which is the core of DLA algorithms, in twodifferent machines with different semiconductor process can be differenteven if they share the same features in terms of instruction setarchitecture, memory hierarchy and clock speed. It has become a traditionto optimize BLAS for new machines. Vendors maintain highly optimized BLASlibraries targeting their CPUs. Unfortunately the existing BLAS for GPUsis not highly optimized for DLA algorithms. In my research, I haveprovided new algorithms for several important BLAS kernels for differentgeneration of GPUs and introduced a pointer redirecting approach to makeBLAS run faster in generic problem size. I have also presented anauto-tuning approach to parameterize the developed BLAS algorithms andselect the best set of parameters for a given card.The hardware trends have also brought up the need for updates on existinglegacy DLA software packages, such as the sequential LAPACK. To takeadvantage of the new computational environment, successors of LAPACK mustincorporate algorithms of three main characteristics: high parallelism,reduced communication, and heterogeneity-awareness. On multicorearchitectures, Parallel Linear Algebra Software for MulticoreArchitectures (PLASMA) has been developed to meet the challenges inmulticore. On the other extreme, Matrix Algebra on GPU and MulticoreArchitectures (MAGMA) library demonstrated a hybridization approach thatindeed streamlined the development of high performance DLA for multicoreswith GPU accelerators. The performance of these two libraries depend uponright choice of parameters for a given problem size and given number ofcores and/or GPUs. In this work, the issue of automatically tuning thesetwo libraries is presented. A prune based empirical auto-tuning method hasbeen proposed for tuning PLASMA. Part of the tuning method for PLASMA wasconsidered to tune hybrid MAGMA library.
27

A calculus of loop invariants for dense linear algebra optimization

Low, Tze Meng 29 January 2014 (has links)
Loop invariants have traditionally been used in proofs of correctness (e.g. program verification) and program derivation. Given that a loop invariant is all that is required to derive a provably correct program, the loop invariant can be thought of as being the essence of a loop. Being the essence of a loop, we ask the question “What other information is embedded within a loop invariant?” This dissertation provides evidence that in the domain of dense linear algebra, loop invariants can be used to determine the behavior of the loops. This dissertation demonstrates that by understanding how the loop invariant describes the behavior of the loop, a goal-oriented approach can be used to derive loops that are not only provably correct, but also have the desired performance behavior. / text
28

Backward bifurcation in SIR endemic models : this thesis is presented in partial fulfillment of the requirements for the degree of Masters of Information Science in Mathematics at Massey University, Albany, Auckland, New Zealand

Siddiqui, Sameeha Qaiser January 2008 (has links)
In the well known SIR endemic model, the infection-free steady state is globally stable for R0 < 1 and unstable for R0 > 1. Hence, we have a forward bifurcation when R0 = 1. When R0 > 1, an asymptotically stable endemic steady state exists. The basic reproduction number R0 is the main threshold bifurcation parameter used to determine the stability of steady states of SIR endemic models. In this thesis we study extensions of the SIR endemic model for which a backward bifurcation may occur at R0 = 1. We investigate the biologically reasonable conditions for the change of stability. We also analyse the impact of di erent factors that lead to a backward bifurcation both numerically and analytically. A backward bifurcation leads to sub-critical endemic steady states and hysteresis. We also provide a general classi cation of such models, using a small amplitude expansion near the bifurcation. Additionally, we present a procedure for projecting three dimensional models onto two dimensional models by applying some linear algebraic techniques. The four extensions examined are: the SIR model with a susceptible recovered class; nonlinear transmission; exogenous infection; and with a carrier class. Numerous writers have mentioned that a nonlinear transmission function in relation to the infective class, can only lead to a system with an unstable endemic steady state. In spite of this we show that in a nonlinear transmission model, we have a function depending on the infectives and satisfying certain biological conditions, and leading to a sub-critical endemic equilibriums.
29

Backward bifurcation in SIR endemic models : this thesis is presented in partial fulfillment of the requirements for the degree of Masters of Information Science in Mathematics at Massey University, Albany, Auckland, New Zealand

Siddiqui, Sameeha Qaiser January 2008 (has links)
In the well known SIR endemic model, the infection-free steady state is globally stable for R0 < 1 and unstable for R0 > 1. Hence, we have a forward bifurcation when R0 = 1. When R0 > 1, an asymptotically stable endemic steady state exists. The basic reproduction number R0 is the main threshold bifurcation parameter used to determine the stability of steady states of SIR endemic models. In this thesis we study extensions of the SIR endemic model for which a backward bifurcation may occur at R0 = 1. We investigate the biologically reasonable conditions for the change of stability. We also analyse the impact of di erent factors that lead to a backward bifurcation both numerically and analytically. A backward bifurcation leads to sub-critical endemic steady states and hysteresis. We also provide a general classi cation of such models, using a small amplitude expansion near the bifurcation. Additionally, we present a procedure for projecting three dimensional models onto two dimensional models by applying some linear algebraic techniques. The four extensions examined are: the SIR model with a susceptible recovered class; nonlinear transmission; exogenous infection; and with a carrier class. Numerous writers have mentioned that a nonlinear transmission function in relation to the infective class, can only lead to a system with an unstable endemic steady state. In spite of this we show that in a nonlinear transmission model, we have a function depending on the infectives and satisfying certain biological conditions, and leading to a sub-critical endemic equilibriums.
30

Sobre a produção de significados para a noção de transformação linear em álgebra linear /

Oliveira, Viviane Cristina Almada de. January 2002 (has links)
Orientador: Romulo Campos Lins / Banca: Iole de Freitas Druck / Banca: Marcos Vieira Teixeira / Resumo: Esta pesquisa, baseada no Modelo Teórico dos Campos Semânticos (MTCS), trata da produção de significados para a noção de transformação linear em Álgebra Linear. Foi desenvolvida a partir das análises de: textos matemáticos - alguns considerados históricos e outros contemporâneos - e entrevistas com duas alunas de um curso de Matemática. Neste trabalho, identificamos possíveis significados que podem ser produzidos para a noção de transformação linear, o que pode auxiliar na prática de professores de Álgebra Linear. Além disso, poderá subsidiar discussões mais amplas sobre a formação inicial do professor de Matemática. / Abstract: This research, based on the Theoretical Model of Semantic Fields (TMSF), deals with the production of meanings for the notion of linear transformation in Linear Algebra. It has been developed from the analysis of: mathematics texts - some taken as historical and others as contemporary - and interviews with two undergraduate mathematics students. In this work, we have identified possible meanings that can be produced for the notion of linear transformation. That can help the practice of teachers of Linear Algebra and might also promote more general discussion about the pre-service education of mathematics teachers. / Mestre

Page generated in 0.0422 seconds