Return to search

應用文件探勘技術於概念股股價共同移動之研究 / A study of using text mining on the co-movement of concept stock price

證券市場在台灣為相當熱門的投資標的,台灣屬於淺碟式市場,股市投資者以散戶居多,且資訊來源大多為報紙、電視、網路…等媒體,因此外界的訊息易於影響股價波動。近年來股票分類方式除了傳統的產業類別分類,衍生出了一種新的分類方式-概念股。概念股是某種被看好之產品或產業甚至政策相關個股的集合,概念下的股票通常具有相當大的話題性,因此會引發報章媒體的報導,引發投資者的關注。基於以上原因,可推論概念相關的報導會對概念相關個股的漲跌有一定影響。因此本研究以消息面的資訊作為基礎,並以文字探勘技術加以分析,以聚集出人們有興趣概念所相關之個股。
本研究以聯合知識庫2011年1月至4月共86,579篇新聞為資料來源,以iPad2概念為標的,透過文字探勘的技術找出各新聞內容的特徵,並透過關聯分析對新聞做分析,從中找出概念及個股之間的關聯規則,藉此找出和概念相關之個股。接著本研究從台灣證券交易所網站取得2011年2月至5月所有交易日大盤之上漲、下跌個股數量,以其較大值與兩數和相除計算其股價共同移動程度,並取得其累積報酬率與本研究所選出之概念股進行比較。
在研究結果中,本研究方法所選出之概念股在門檻值為0.2時在2月至5月股價共同移動程度分別為79.3%、73.6%、70.2%、68.1%,皆高於MoneyDJ選出之概念股及大盤同期間之股價共同移動程度。而以成對樣本T檢定在顯著水準95%下,顯示本研究選出之概念股顯著有股價共同移動現象。因此也證實了藉由文字探勘技術及關聯規則,能從雜亂無序的新聞中發掘出人們有興趣之概念所相關的個股,以提供投資者做更深入分析。 / In Taiwan stock market, most of investors are individual, as the result, the external information will affect the stock price. Concept stock is an aggregation of many stocks on a relative basis such as industry or particular product. It is usually makes the topic to mass media for report, therefore the investor will pay close attention to it. There are many websites offering digital news so we can obtain easily these from the Internet and analyze them. This paper proposes an method to find stocks that relate to the concept from the digital news.
In this paper, we collected the news from Udndata, using the text mining technique to analyze these data and performing association analysis’s algorithm to find out the association rule between stocks and concept. Then, we use statistical test to test the co-movement pattern between these Concept Stocks to the Taiwan Stock Index. The result illustrate text mining technique is able to find the relation between stocks and concept and proofs the Concept Stocks have co-movement pattern.

Identiferoai:union.ndltd.org:CHENGCHI/G0098356012
Creators吳振和, Wu, Cheng Ho
Publisher國立政治大學
Source SetsNational Chengchi University Libraries
Language中文
Detected LanguageEnglish
Typetext
RightsCopyright © nccu library on behalf of the copyright holders

Page generated in 0.0017 seconds