Global ETD Search

1	網路健康社群中資料加值應用之研究 / research of value-added data application in internet health community 林耀弘 Unknown Date (has links) 網際網路的發展打破了時間與空間的限制。在這個虛擬的網際空間裡，逐漸發展出一個又一個得網路社群。在Facebook與Plurk這些社群服務興起之後，更加速了網路社群的產生與其影響力。。網路社群的運作機制與商業模式和個人資料息息相關，許多創新的社群商業模式一再的挑動隱私保護與個人資料保護的敏感神經。然而，流通在社群中的隱私資訊經過整理與加值，帶來了各種嶄新的商業模式。探討網路社群資料加值應用前，本研究先針對網路社群與個人資料進行文獻回顧。其中涵蓋網路社群之定義與特性、隱私權之定義與發展，作為理論發展之基礎。本文以個人資料的觀點分析網路社群與個人資料相關加值服務，深入探討個人資料管理相關服務，研究創新網路公司如何在個人資料管理上找到獲利的機會；並探討網路社群之功能與個人資料之間的關係。藉此找出落實個人資料保護與發展個人資料加值應用的影響因素。探討八個網路上的網路健康社群服務，歸納其功能並進行分類，最後分析網路健康社群之獲利模式。承接前文論述之隱私保護原則、網路社群之特性與個資對網路社群之影響因素，提出網路社群個人資料保護與加值應用建議。以「資料」、「裝置」、「軟體」、「服務」等四個層次分別對網路社群之個人資料保護與加值應用進行論述。最後以網路健康社群為例，對網路健康社群提出具體之發展建議。網路社群網路健康社群資料加值應用
2	應用資料採礦技術於多個資料庫連結與整合劉致琪 Unknown Date (has links) 現今電子化的時代，有些企業雖然擁有數百萬的資料，但要分析起來是相當困難且耗時的，往往又浪費人力與金錢，而又無法得到預期的結果。而利用資料採礦技術，便可以從這大量的資料中，挖掘出隱藏的、有用的訊息及知識，還可以從既有的資料預測未來，使企業可優先獲得商機。對於資料採礦而言，一開始的資料收集便是一項很重要的課題，資料品質的良莠，牽動著結果的正確性及預測的成敗。但每一個研究主題都有其各自的目的、所需的資料變數、適用的演算法等等，所以也有可能無法幸運地在同一個資料庫中得到所需的完整訊息，若是重新進行調查，是很費時、費力的工作。當我們面臨以上問題時，對於部份資料的缺漏該如何補救呢？！這便是我們在本研究中的研究目的。所以我們可以試著從現有的資料庫下，利用兩個其他的資料庫來輔助，利用函數映射的方法來補齊我們所要的資料，如此情況下再來做資料採礦，便能更有效率；對於我們所建立出來的預測模型，也更為準確。在資料庫連結的過程中，我們討論了三種情況，分別為三個資料庫間有相同欄位、兩兩資料庫有相同欄位、三個資料庫間沒有相同欄位。從研究結果發現，不管資料庫之間有無相同欄位可供連結使用，利用函數映射方法為資料庫增加訊息是可行的，而且效能相當不錯，可以提供給資料採礦工作者在蒐集資料時的參考，以及未來的研究方向。資料採礦資料加值函數映射
3	應用資料採礦技術於資料庫加值中的抽樣方法 / THE SAMPLING METHODS FOR VALUE-ADDED DATABASE IN DATA-MINING 陳惠雯 Unknown Date (has links) In the wake of growing database that has already become the trend of today’s business environment within the foreseeable future, reviewing quality information from mountains of data residing on corporations or organizations’ network such as sales figures, manufacturing statistics, financial data and experimental data is clearly costly, time consuming and definitely ineffective approach. Therefore we would need a sound and effective method in obtaining only portions of the data that are representative to the population and which allow us to build the reliable model based upon the sampled data. However, sometimes we have a situation where the database is of limited in size, under such circumstance, we initiate the idea which is relatively new to adding the attributes or values into the database to enhance the quality of the data Follow through such a procedure; it is obvious that implementing a good sampling method is an important groundwork leading us to reach final destination that is obtaining a reliable predictive model. And this is our research goal that is to get an effective and representative value-added sample of by means of sampling method for building an accuracy predictive model. The concept is pretty straightforward that is if we want to get good predictive samples then we need the correct sampling methods. The sampling methods under study are simple random sample, system sample, stratified sample and uniform design. The models used are the C5.0, logistic regression, and neural network for categorical predictive variable and stepwise regression for continuous predictive variable. The results are discussed in the conclusion section. Keywords: Database、Data Mining、Sampling、Value-added database 資料庫資料採礦抽樣方法資料加值 Database Data Mining Sampling Value-added database
4	應用資料採礦技術於資料庫加值中的誤差指標及模型準則 / ERROR INDEX AND MODEL CRITERIA FOR VALUE- ADDED DATABASE IN DATA MINING 包寶茹 Unknown Date (has links) 運用資料來幫助企業做出正確且適當的政策是一個存在已久的觀念，在傳統統計上我們通常會將拿到的資料庫直接去作分析，然而對資料採礦(Data Mining)來說，常面臨資料不夠的瓶頸，亦導致資料庫的價值往往不夠。若，我們能利用調查的樣本，推估出目標資料庫中所欠缺的欄位在調查樣本中與其它欄位的關係，便可回推至目標資料庫將原本所欠缺的欄位補齊，將資料庫加大，亦即資料加值(value-added)，那麼，未來要用到這些欄位來分析資料時只要抽樣進行分析即可，如此，也可有效降低企業的成本支出或浪費。本研究之目的在於整合過去各學者所提出之統計理論與方法，找出誤差指標及模型準則來說明擴充的欄位是有可信度的。由於在目標資料庫擴充欄位時，會產生誤差值，而誤差值的大小往往會影響我們用來判斷此擴充欄位的可行性及可信度，因此本研究並不考慮使用何種抽樣方法，而是假設在簡單隨機抽樣下來進行探討，判別在資料加值前後所造成預測值與實際值之間的差異情形，進一步來做比較。針對欲擴充目標的欄位型態分為連續型和類別型來尋找適當的指標及準備作為我們選擇判斷的指標。類別型欄位利用相似性觀念建立判斷指標，連續型欄位則利用距離觀念、相關性的架構下來討論，如此，可建立合理的誤差指標及模型準則針對欲擴充目標欄位的型態來判斷其擴充的欄位是否具有可信度，並評估其可用價值的高低。本研究實證結果發現資料庫加值為一可行的方法，從推估資料帶入模式後所得預測值與原始觀測值間計算其相似度皆在九成以上，說明擴充的欄位是有可信度的。關鍵詞：資料採礦、資料加值、誤差指標、模型準則、相似性 / In recent years, the application of data mining has received good credits and acceptances from a variety of industries such as the finance industry, the insurance industry, and the electronics industry and so on for its success in extracting valuable information translated to opportunities from the database. Database value-added is a new idea not yet fully mature. Its applications on the different databases will have different effect, therefore, the goal of this research is to find the valid and accountable model criteria as a mean to determine if the added columns make any improvement to the database, hence the overall results in terms of predictions. After selecting the model based upon its appropriateness to the data type, we applied the error index and model criteria to evaluate for the performance of the model, if the model has accurately predicted the added-value column. The criterion used in this research is RMSE for the continuous data type and F-value for the discrete data type. Our findings in this research support our attempts that the error index and model criteria used in this research do give us an accountability measure in determining the reliability of adding the columns to the database. Keywords: Data mining, Database value-added, Database, Error index, Model criteria 資料採礦資料加值誤差指標模型準則相似性 Data mining Database value-added Database Error index Model criteria

1

Page generated in 0.0117 seconds