1 |
趨近一般化資料倉儲與資料探勘之效能評估模型 / Toward a More Generalized Benchmark Workload Model for Data Warehouse and Data Mining邱士涵, Chiu,Shih-Han Unknown Date (has links)
隨著網際網路的發達以及資料庫技術的成熟,人們取得資料變得非常的容易,再加上許多網際網路的應用其實就是一個自動化的資料收集工具,資料量之大已幾近爆炸的程度。資料倉儲便是一種用來儲存大量歷史資料的資料庫,提供彙整或是統計的資訊,以提供決策使用的資訊技術。而資料探勘是從大量的資料當中把對於決策過程中有幫助的規則找出來,提供給管理人員做為決策的參考,開創新的商業契機。資料倉儲的效能表現對於使用者的工作效率有著深遠的影響。因此有些用以衡量與預測資料倉儲之效能與效率之工作量模式便孕育而生,一般稱之為績效評估工具,然而目前所公佈的一般資料倉儲績效評估工具是針對特定範圍領域建構出某些典型的領域規格,並沒有一個使用者需求導向的資料倉儲績效評估工具。在資料探勘方面,探勘結果的準確度比起資料探勘所花費的時間來得重要,目前卻沒有一個有效、使用者需求導向的工具來評估資料探勘結果的準確度。我們針對資料倉儲的效能評估以及資料探勘準確度評估,設計一個以使用者需求為導向的工作量模型,來評估資料倉儲與資料探勘工具。 / As growth of Internet and mature of database technology, people can get the data much easily than before. Many applications on Internet, in fact, are the tools of gather data automatically so that the amount of data is growing bigger and bigger. Data warehouse is one kind of database to store lots of historical data to offer statistical information for the information technology of decisions. Data mining is to find the useful rules for decisions from the amount of data to help the managers make decisions and create the new opportunities of business. The performance of data warehouse is import to user’s work efficiency. Therefore, there are some workload model arise to evaluate and predict the performance and efficiency of data warehouse called benchmark. However, the data warehouse specification announced these days are constructed to some typical domain specific, and the performance evaluation stand on synthetic workload. But, when the difference between the domain of data warehouse user applied and domain of performance evaluation tool is very large, the performance metric may different a lot to the result of benchmark tool. In data mining, the accuracy of mining result is important to business. The accuracy of mining result is more important than the time spend on data mining. However, there is no any useful tool to evaluate the accuracy of mining result and there is no any standard of performance criteria for data mining, either. We design a user requirement-oriented workload to evaluate performance of data warehouse and precision of data mining.
|
2 |
分散式關聯資料庫系統績效評估工作量模式之研究 / Distributed RDBMS Benchmark Workload Modeling韓先良, Han, Sien-Liang Unknown Date (has links)
本研究之主要目標在於建構一個能評估分散式關聯資料庫中之特色的需求導向績效評估方法。在過去的績效評估研究中,已經有許多人對於關聯式資料庫績效評估做了多方面的努力。但是,過去的關聯式資料庫資效評估方法如:Wisconsin、AS3AP、TPC系列的Benchmarks都有著一些限制及不足的地方。
過去的關聯式資料庫績效評估方法並無法完全的評估出分散式資料庫的特殊需求及其表現。所以本研究嘗試要建立出一個能專門適用於分散式資料庫導向的績效評估方法。為了要作出此績效評估方法,本研究採用了工作量模式的研究方法。先建出分散式資料庫績效評估的工作量模式,再以其來實作出績效評估方法。工作量模式分成三部分:資料模式、交易模式、控制模式。 / This thesis is intended to design a requirements-centric database benchmark, which can evaluate the general performance of the distributed relational database systems. In the past, there are many relational database benchmarks. But the relational database benchmarks like Wisconsin, AS3AP, TPC, TP1 have some constraints.
In this study, we aim to design a general-purpose distributed database workload model and implement it. To design this benchmark, we need to build our workload model. The workload model consists of three components:data model, transaction model, control model. Each model has the requirement specification language to accommodate user's workloads.
|
3 |
自由語文資訊檢索資料庫系統績效評估工作量模式之研究-以傳播學網際網路資料庫為例 / Free-Text Information Retrieval Database System Benchmark Workload Model-Web Communication Databases林佳慧, Lin, Chia-Hui Unknown Date (has links)
資訊檢索(Information Retrieval)一直是資訊學界的重要研究領域,但長久以來並未能在其它的學門中發揮其重要性,然而藉由Internet的普及與網路資源的激增,以資訊檢索為基礎的網路搜尋技術,已逐漸成為Internet上相當受到重視的技術之一。雖然全文資訊檢索的技術已存在多時,然而電腦產業一直缺乏一廣泛標準的績效評估,來評估全文資訊檢索系統的產出及價格/效能比,1990年代對全文資訊檢索軟體的大量需求,極需要一個標準、一致的方法,來比較各系統間效能的差異。本研究即針對傳播中文全文資訊檢索資料庫系統為探討的主題,試圖以關連式資料庫績效評估的理論及方法,來建構其工作量模式以進行績效評估。 / Information Retrieval has been a important research domain in Information academic. But it does not play in the other domain for a long time. However the Internet search technology based on Information Retrieval has been an one of the important technologies in the Internet by the spreading of Internet and the growth of network resources. Although the Full-Text Information Retrieval or Free-Text Information Retrieval technology has existed for a long time, the computer industry was lack of a wide, standard benchmark to evaluate the throughput and price/performance of the Full-Text Information Retrieval System. The large demand for Full-Text Information Retrieval System or Free-Text Information Retrieval System software needs a standard method to compare the performance of each system in 90s. This research is focused on the Communication Free-Text Information Retrieval Database System in Chinese, and construct it’s workload model to evaluate by using the theory and method of Relational Database Benchmark.
|
Page generated in 0.0131 seconds