1 |
以圖文辨識為基礎的旅遊路線規劃輔助工具 / Tour Planning Using Landmark Photo Matching and Intelligent Character Recognition黃政明, Huang, Cheng Ming Unknown Date (has links)
智慧型手機的用途已從語音溝通延伸轉變為多功能導向的的生活工具。目 前多數的智慧型手機均具備攝影鏡頭,而此模組更已被公認為基本的標準 配備。使用者透過手機,可以輕易且自然地拍攝感興趣的物體、景色或文 字等,並且建立屬於自己的影像資料庫。在眾多的手機軟體中,旅遊類的 程式是其中一種常見整合內容與多項感測模組的應用實例。在行動平台上, 設計一個影像辨識系統服務可以大幅地協助遊客們在旅途中去瞭解、認識
知名的地標、建築物、或別具意義的物體與文字等。 然而在行動平台上的可用資源是有限的,因此想要在行動平台上開發有效 率的影像辨識系統,是頗具挑戰性的任務。如何在準確率與計算成本之間 取得最佳的平衡點往往是行動平台上開發影像辨識技術的最重要課題。 根據上述的目標,本研究擬於行動平台上設計、開發行動影像搜尋與智慧 型文字辨識系統。具體而言,我們將在影像搜尋上整合兩個全域的特徵描 述子,並針對印刷與手寫字體去開發智慧型文字辨識系統。實驗結果顯示, 在行動影像搜尋與文字辨識的效能測試部分,前三名的辨識率皆可達到的 80%。 / The roles of smart phones have extended from simple voice communications to multi-purpose applications. Smart phone equipped with miniaturized image capturing modules are now considered standard. Users can easily take pictures of interested objects, scenes or texts, and build their own image database. Travel-type mobile app is one example that takes advantage of the array of sensors on the device. A mobile image search engine can bring much convenience to tourists when they want to retrieve information regarding specific landmarks, buildings, or other objects.
However, devising an effective image recognition system for smart phone is a quite challenging task due to the complexity of image search and pattern recognition algorithms. Image recognition techniques that strike a balance between accuracy and efficiency need to be developed to cope with limited resources on mobile platforms.
Toward the above goal, this thesis seeks to design effective mobile visual search and intelligent character recognition systems on mobile platforms. Specifically, we propose two global feature descriptors for efficient image search. We also develop an intelligent character recognition engine that can handle both printed and handwritten texts. Experimental results show that the accuracy reaches 80% for top-3 candidates in visual search and intelligent character recognition tasks.
|
Page generated in 0.1447 seconds