Return to search

應用大數據於杭州市房地產價格模型之建立 / The Application of Big Data Analytics on Real Estate Price Model of Hangzhou

互聯網的發展與近年來數據平台受到公私部門重視,資訊的取得與流通變得便捷,中國房地產文化目前有別於台灣,尚無實價登錄機制且地域面積廣大,傳統估價模型可能無法直接應用,面對房地產背後眾多的影響因素,本研究將預測建模目標放在泡沫化尚不嚴重且較具有潛力的中國新一線城市杭州市,自新浪二手房網爬取杭州市房地產數據,並自國家統計局取得各地區行政支出數據,作為實證分析資料。結合自動程序爬蟲抓取數據、統計分析與機器學習方法,期望對中國房地產建立一混合非監督式與監督式學習之模型。
在分群結果之後建構模型採用之技術為C5.0、三層CHAID、五層CHAID與Neural Network,挑選出最適合的模型為使用混合模型後的C5.0決策樹方法,達到降低變數維度亦提升或達到相當的預測準確率的雙贏目標,模型中行政地區、面積、總樓層為最頻出現的重要變數。
另外透過集群分析於行政支出的應用,發現2016年度杭州市投入的行政支出集中於余杭區、蕭山區、濱江區,成為賣屋及購屋者的第二項決策標準。 / In recent years, with the growth of the Internet and the importance of data platform on public sector and private sector. Getting and sharing information are made easily. The culture of real estate in China is different from Taiwan. For instance, there is no actual house price registration system. Furthermore, traditional estimate model may not be directly applicable to China which has the vast geographical area of the mainland. There are many factors to influence house price model. This study focus on Hangzhou city. Because the burst of real estate bubbles is not serious as first-tier cities and it is one of new first-tier cities in China. The research data were crawler from Sina second-hand housing website and National Bureau of Statistics. By using auto web crawler skill, statistical analysis, and machine learning method to build a real estate model in China, which was combining unsupervised learning method with supervised learning method.
After clustering Hangzhou second-hand housing data, this study used C5.0, three layers Chi-Square Automatic Interaction Detector(CHAID), five layers CHAID, and Neural Network(NN). The study goal are both reducing dimension and getting better forecast accuracy. Choosing clustering- C5.0 model as appropriate house price model to achieve win-win situation after comparing final result. Administrative region, area, and total floor are the top three high frequency influential factors.
Applying Clustering Analysis to administrative expenses data in Hangzhou, the study found that the government resource focus on Yuhang, Xiaoshan, and Binjiang. It can be the second decision-making criterion for house sellers and house buyers.

Identiferoai:union.ndltd.org:CHENGCHI/G0105354007
Creators郁嘉綾, Yu, Cia-Ling
Publisher國立政治大學
Source SetsNational Chengchi University Libraries
Language中文
Detected LanguageEnglish
Typetext
RightsCopyright © nccu library on behalf of the copyright holders

Page generated in 0.007 seconds