• Refine Query
  • Source
  • Publication year
  • to
  • Language
  • 1
  • 1
  • 1
  • Tagged with
  • 4
  • 4
  • 2
  • 2
  • 2
  • 2
  • 2
  • 2
  • 2
  • 2
  • 2
  • 2
  • 2
  • 2
  • 2
  • About
  • The Global ETD Search service is a free service for researchers to find electronic theses and dissertations. This service is provided by the Networked Digital Library of Theses and Dissertations.
    Our metadata is collected from universities around the world. If you manage a university/consortium/country archive and want to be added, details can be found on the NDLTD website.
1

Research on Parallel Hierarchical Matrix Construction / 階層型行列生成の並列化に関する研究

Bai, Zhengyang 23 March 2023 (has links)
京都大学 / 新制・課程博士 / 博士(情報学) / 甲第24744号 / 情博第832号 / 新制||情||139(附属図書館) / 京都大学大学院情報学研究科システム科学専攻 / (主査)准教授 深沢 圭一郎, 教授 田中 利幸, 教授 石井 信 / 学位規則第4条第1項該当 / Doctor of Informatics / Kyoto University / DFAM
2

High-Performance Matrix Multiplication: Hierarchical Data Structures, Optimized Kernel Routines, and Qualitative Performance Modeling

Wu, Wenhao 02 August 2003 (has links)
The optimal implementation of matrix multiplication on modern computer architectures is of great importance for scientific and engineering applications. However, achieving the optimal performance for matrix multiplication has been continuously challenged both by the ever-widening performance gap between the processor and memory hierarchy and the introduction of new architectural features in modern architectures. The conventional way of dealing with these challenges benefits significantly from the blocking algorithm, which improves the data locality in the cache memory, and from the highly tuned inner kernel routines, which in turn exploit the architectural aspects on the specific processor to deliver near peak performance. A state-of-art improvement of the blocking algorithm is the self-tuning approach that utilizes "heroic" combinatorial optimization of parameters spaces. Other recent research approaches include the approach that explicitly blocks for the TLB (Translation Lookaside Buffer) and the hierarchical formulation that employs memoryriendly Morton Ordering (a spaceilling curve methodology). This thesis compares and contrasts the TLB-blocking-based and Morton-Order-based methods for dense matrix multiplication, and offers a qualitative model to explain the performance behavior. Comparisons to the performance of self-tuning library and the "vendor" library are also offered for the Alpha architecture. The practical benchmark experiments demonstrate that neither conventional blocking-based implementations nor the self-tuning libraries are optimal to achieve consistent high performance in dense matrix multiplication of relatively large square matrix size. Instead, architectural constraints and issues evidently restrict the critical path and options available for optimal performance, so that the relatively simple strategy and framework presented in this study offers higher and flatter overall performance. Interestingly, maximal inner kernel efficiency is not a guarantee of global minimal multiplication time. Also, efficient and flat performance is possible at all problem sizes that fit in main memory, rather than "jagged" performance curves often observed in blocking and self-tuned blocking libraries.
3

Effiziente Vorkonditionierung von Finite-Elemente-Matrizen unter Verwendung hierarchischer Matrizen

Fischer, Thomas 25 October 2010 (has links) (PDF)
Diese Arbeit behandelt die effiziente Vorkonditionierung von Finite-Elemente-Matrizen unter Verwendung hierarchischer Matrizen.
4

Effiziente Vorkonditionierung von Finite-Elemente-Matrizen unter Verwendung hierarchischer Matrizen

Fischer, Thomas 15 September 2010 (has links)
Diese Arbeit behandelt die effiziente Vorkonditionierung von Finite-Elemente-Matrizen unter Verwendung hierarchischer Matrizen.

Page generated in 0.0511 seconds