簡易檢索 / 詳目顯示

研究生: 林佳欣
Lin, Chia-Hsin
論文名稱: 基於階層式詮釋資料模式之地理空間影片管理機制
Geospatial Video Management Mechanism Based on Hierarchical Model of Metadata
指導教授: 洪榮宏
Hong, Jung-Hong
學位類別: 碩士
Master
系所名稱: 工學院 - 測量及空間資訊學系
Department of Geomatics
論文出版年: 2023
畢業學年度: 111
語文別: 中文
論文頁數: 197
中文關鍵詞: 地理空間影片階層式詮釋資料三維地理資訊地理標籤影片檢索
外文關鍵詞: Geospatial video, Hierarchical metadata, 3d GIS, Geotag, Video retrieval
相關次數: 點閱:170下載:0
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報
  • 現今世代資訊技術已從過去單純的文字內容逐步演進到以多媒體為主流,特別是如影片等動態拍攝性質之多媒體,透過連續畫面拍攝的特性,能夠廣泛記錄現實世界於特定時空下之各類現象與狀態。除個人之娛樂用途外,近年更開始應用於例如智慧城市、智慧交通、智慧防救災等應用中,成為重要之參考資料來源。然而,面對不同來源自動生產龐大影片資訊量的情形,如何有效管理、搜尋及應用影片記錄之訊息,成為空間資訊進一步擴展的關鍵課題。
    無論使用目的為何,可由影片內容中搜尋所需內容是有效管理與應用影片資料之關鍵。早期之影片資料僅以拍攝內容為主,隨著衛星定位晶片、多元感測器之尺寸大幅縮小及價格平民化,在拍攝時同步記錄其攝影機位置、空間姿態及所對應之現實世界時間已成為可能,這也讓大多數影片能夠進步成具有地理標記之「地理空間影片(Geospatial Video)」。隨著近年空間資訊之發展,記錄現實世界各類現象之資訊快速增加,若能建立各相機攝影點隨移動軌跡變化之視野與三維物件之間的關係,可由空間之觀點掌握與詮釋影片與拍攝物件之關係,並在與三維地理資訊結合之狀況下,獲得更高之管理效率及為更廣泛之應用參考,兩者之結合更可在提升影片運用可能性及強化空間資訊應用上達到互補之效益。
    本研究提出一個基於階層式架構之詮釋資料運作模式,由:「影片(Video)、片段(Segment)、幀畫面(Frame)」建立影片之分層描述結構,分析之影片內容可區分為一至多片段,每片段再由一至多幀畫面所構成,一幀畫面則為在特定位置及時間所拍攝的結果,若能有效地依分層之狀態建立描述內容,與現實事件之三維地理資訊建立關聯,即可強化管理之效能。本文依此階層架構,分別規劃詮釋資料,在減低重複記錄及建立關聯之基礎上,提供影片有關人、事、時、地、物等不同觀點之描述,並依此發展基於空間、時間及主題的索引機制,滿足多元面向之資料查詢需求。分層索引機制之最大特點在於可因應不同目的之使用者搜尋需求,透過時空及敘述內容等條件,分別在不同的階層檢索影片,快速過濾符合需求之影片、片段或畫面。本研究透過所建置之影片查詢系統,以視覺化方式呈現雙視窗介面(影片&軌跡地圖)以及影片描述資訊,確保使用者能夠透過索引獲取地理空間影片和各層級之影片詮釋資訊,最後實現豐富之影片檢索體驗。
    面對多媒體資訊快速成長之趨勢,本研究由地理空間資訊之面向提出管理及應用之具體策略,重點在於可善加應用自動或人類操控而記錄之現實世界現象,並在檢視及處理相關影片內容時,由三維地理資訊提供豐富之時空參考資訊,活化可能之應用。地理空間觀點能夠強化影片時空內容狀態之詮釋,而階層式影片詮釋資料庫的規劃架構則為描述影片提供了有效的參考,並且幫助有效管理和檢索大量的影片資訊,從而挖掘影片的潛在價值和應用潛力。若能夠善用每日以不同形式產生或專業需求產生之影片資料,不惟可對現實世界之動態變化提供一個新的記錄機制,對於多媒體及地理空間資訊科技之發展帶來相當之助益。

    The way of sharing information has gradually developed from plain text content in the past to multimedia as the mainstream, especially dynamic shooting multimedia such as video. The video could record the state of the environment and the dynamic phenomena of the real world, the information it contains is quite extensive, such as: space, time, and theme content. In the face of the huge amount of video information automatically produced by various sources, how to effectively manage, search and apply the information recorded in the video has become a key issue for the further expansion of spatial information.
    With the size reduction of satellite positioning chips and multi-sensors and the low price, it has become possible to simultaneously record the camera position, spatial attitude, and corresponding real world time during shooting. It enables most videos to be developed into "Geospatial Video" with geotagging. If the relationship between the field of view of each camera shooting point that changes with the movement trajectory and the three-dimensional object can be established, then the objects captured in images and videos can be grasped and interpreted from the perspective of space. In the case of combining with 3D geographic information, higher management efficiency and wider application reference can be obtained. The combination of the two can achieve complementary benefits in enhancing the possibility of video application and strengthening the application of spatial information.
    This paper proposes a hierarchical structure-based interpretation data operation mode, which establishes three levels in the overall video description according to the video hierarchical structure: "Video, Segment, and Frame". If the status of each layer can be effectively recorded, the video can be associated with the 3D geographic information of real events, and applications can be activated. According to the combination relationship of video information between layers, plan the corresponding annotation data. Provide descriptions of different viewpoints on people, events, time, places, objects, etc., and develop indexing mechanisms based on space, time, and themes. The biggest feature of this hierarchical index mechanism is that it can respond to users' search needs for different purposes. Retrieve videos at various levels according to conditions such as time and space and narrative content, to obtain relevant information corresponding to videos, segments, and frames. Finally, through the built video query system, the dual-window interface (video & track map) and video structured metadata information are presented visually.
    The biggest feature of this article is that for videos with geotagging, multi-dimensional geographic information is used to help analyze the content of the captured videos, especially for videos shot for special purposes such as road inspection vehicles. Under the hierarchical structure record, it provides various levels of video description results to meet the video indexing of different usage requirements.

    摘要I 致謝X 目錄XI 表目錄XIII 圖目錄XIV 第一章 緒論1 1.1 研究背景1 1.2 研究目標3 1.3 研究流程4 1.4 論文架構7 1.5 研究限制9 第二章 文獻回顧10 2.1 空間多媒體10 2.1.1 多媒體特色及潛力10 2.1.2 空間多媒體13 2.1.3 空間多媒體之發展與應用16 2.1.4 小結18 2.2 影片檢索19 2.2.1 影片檢索技術19 2.2.2 具空間特性之影片檢索技術27 2.2.3 小結31 2.3 地理空間影片資料庫32 2.3.1 資料庫管理系統32 2.3.2 影片詮釋資料(Video Metadata)39 2.3.3 小結42 2.4 地理空間影片相關應用案例分析42 2.4.1 案例一:ArcGIS Full Motion Video43 2.4.2 案例二:GeoVisuals影片地理敘事系統47 2.4.3 案例三:影片與GIS場景即時融合(虛實整合)51 第三章 階層式詮釋資料架構58 3.1 現況與需求58 3.1.1 影片詮釋資料意涵與發展58 3.1.2 影片特性之剖析影片60 3.1.3 地理空間影片描述之需求分析69 3.2 詮釋資料設計與評估73 3.2.1 參考標準74 3.2.2 詮釋資料設計策略77 3.2.3 欄位項目定義說明81 3.2.4 詮釋資料規格分析及需求總表85 第四章 地理空間影片資料庫系統之運作程序94 4.1 地理空間影片資料分析94 4.1.1 感測器與數據需求分析94 4.1.2 數據提取及處理98 4.1.3 攝影點視野空間建模100 4.2 多維度地理資訊系統建置與分析102 4.2.1. 多維度地理資訊環境建置102 4.2.2. 影片分層切割處理104 4.2.3. 地理標記(Geotag)資訊之建立106 4.3 地理空間影片儲存與索引108 4.3.1 關聯式後端資料庫108 4.3.2 資料表之建立與關聯111 4.3.3 使用者檢索策略規劃115 4.3.4 階層式影片索引122 第五章 系統實作與應用情境之測試126 5.1 系統運作與介面分析126 5.1.1 系統運作環境126 5.1.2 後端資料處理129 5.1.3 前端網頁介面與功能139 5.2 影片案例與詮釋資料描述內容討論145 5.2.1 影片情境規劃說明145 5.2.2 案例拍攝之描述內容與討論149 5.3 影片資訊查詢測試與成果161 5.3.1 影片階層式詮釋資料查詢流程與介面161 5.3.2 不同條件之影片查詢與成果測試168 5.4 實作總結討論175 第六章 結論與未來展望179 6.1 結論179 6.2 未來展望181 參考文獻184

    3dcitydb. Available online: https://github.com/3dcitydb/3dcitydb (accessed on 12 November 2019).
    Aggarwal, Jake K., and Michael S. Ryoo. "Human activity analysis: A review." Acm Computing Surveys (Csur) 43.3 (2011): 1-43
    Aji, A.; Wang, F.; Vo, H.; Lee, R.; Liu, Q.; Zhang, X.; Saltz, J. Hadoop GIS: A high performance spatial data warehousing system over mapreduce. Proc. VLDB Endow. 2013, 6, 1009–1020.
    Alan, Hasan Faik, et al. "Sensor Log: A mobile data collection and annotation application." 2014 22nd Signal Processing and Communications Applications Conference (SIU). IEEE, 2014.
    Alarabi, L.; Mokbel, M.F.; Musleh, M. ST-Hadoop: A mapreduce framework for spatio-temporal data. Geoinformatica 2018, 22, 785–813.
    Ansari, Mohammed & Mohammed, Muzammil. (2015). Content based Video Retrieval Systems -Methods, Techniques, Trends and Challenges. International Journal of Computer Applications. 112. 975-8887.
    Aote SS, Potnurwar A. An automatic video annotation framework based on two level keyframe extraction mechanism. Multimedia Tools Appl. 2019;78(11):14465–84.
    Ardeshir, S., Zamir, A. R., Torroella, A., & Shah, M. (2014). GIS-assisted object detection and geospatial localization. In Computer Vision–ECCV 2014: 13th European Conference, Zurich, Switzerland, September 6-12, 2014, Proceedings, Part VI 13 (pp. 602-617). Springer International Publishing.
    Aref, Walid G., et al. "A Video Database Management System for Advancing Video Database Research." Multimedia Information Systems. 2002.
    Arsanjani, J.J.; Zipf, A.; Mooney, P.; Helbich, M. (Eds.) OpenStreetMap in GIScience—Experiences, Research, and Applications, Lecture Notes in Geoinformation and Cartography; Springer: Berlin/Heidelberg, Germany, 2015.
    Arslan, U. (2002, January). A Semantic Data Model and Query Language for Video Data. Retrieved 5 14, 2012, from bilkent.
    Bante, Vaidehi K. and Avinash N. Bhute. “A Text Based Video Retrieval Using Semantic and Visual Approach.” (2015).
    Barry B. Mindfull documentary. Massachusetts Institute for Technology. Ph.D. thesis, MIT, Boston, 2005.
    BILL, R. and FRITSCH, D., 1994, Grundlagen der Geo-Informationssysteme Hardware, Software und Daten, Wichmann: Karlsruhe. 2. Auflage. 423 Seiten.
    Chambon S, Crouzil A. Similarity measures for image matching despite occlusions in stereo vision. Pattern Recognit. 2011;44(9):2063–75.
    Chesher, C. (2012) ‘Navigating sociotechnical spaces: Comparing computer games and sat navs as digital spatial media’, Convergence: The International Journal of Research into New Media Technologies, 18(3): 315-330.
    Crampton, J. (2009) ‘Cartography: maps 2.0’, Progress in Human Geography, 33(1): 91– 100.
    D. Brezeale and D. J. Cook, “Automatic video classification: A survey of the literature,” IEEE Trans. Syst., Man, Cybern., C, Appl. Rev., vol. 38, no. 3, pp. 416–430, May 2008.
    D. Cvetković, "Introductory Chapter: Multimedia and Interaction", in Interactive Multimedia - Multimedia Production and Digital Storytelling. London, United Kingdom: IntechOpen, 2019 [Online].
    de Souza e Silva, A. (2013) ‘Location-aware mobile technologies: Historical, social, and spatial approaches’, Mobile Media & Communication, 1(1): 116–121.
    El Saddik, "Digital Twins: The Convergence of Multimedia Technologies," in IEEE MultiMedia, vol. 25, no. 2, pp. 87-92, Apr.-Jun. 2018.
    Eldawy, A.; Mokbel, M.F. SpatialHadoop: A mapreduce framework for spatial data. In Proceedings of the 2015 IEEE 31st International Conference on Data Engineering, Seoul, Korea, 13–17 April 2015.
    Elwood S. and Leszczynski A. (2011) ‘Privacy, reconsidered: New representations, data practices, and the geoweb’, Geoforum, 42: 5–16.
    Esri, “Oriented Imagery Catalog Schema,” User documentation, August 9th, 2022.
    Fan, P. F., & Li, L. L. (2017). A three-dimensional modeling study based on the technique of low-altitude UAV oblique photogrammetry and Smart3D software. Bulletin of Surveying and Mapping, 63(Suppl. 2), 77– 81.
    Favyen Bastani, Oscar Moll, and Sam Madden. 2020. Vaas: video analytics at scale. Proc. VLDB Endow. 13, 12 (August 2020), 2877–2880.
    Franck Jeveme Panta, Mahmoud Qodseya, André Péninou, and Florence Sèdes. 2018. Management of Mobile Objects Location for Video Content Filtering. In Proceedings of the 16th International Conference on Advances in Mobile Computing and Multimedia (MoMM2018). Association for Computing Machinery, New York, NY, USA, 44–52.
    Furht, B., & Marques, O. (Eds.). (2003). Handbook of Video Databases: Design and Applications (1st ed.). CRC Press.
    G. Hauptmann, R. Baron, M. Y. Chen, M. Christel, P. Duygulu, C. Huang, R. Jin, W. H. Lin, T. Ng, N. Moraveji, N. Papernick, C. Snoek, G. Tzanetakis, J. Yang, R. Yan, and H. Wactlar, “Informedia at TRECVID 2003: Analyzing and searching broadcast news video,” in Proc. TREC Video Retrieval Eval., Gaithersburg, MD, 2003.
    G. Lavee, E. Rivlin, and M. Rudzsky, “Understanding video events: A survey of methods for automatic interpretation of semantic occurrences in video,” IEEE Trans. Syst., Man, Cybern. C, Appl. Rev., vol. 39, no. 5, pp. 489–504, Sep. 2009.
    Goodchild, M. F. (2007). Citizens as sensors: The world of volunteered geography. GeoJournal, 538 69(4), 211–221.
    Goodchild, M.F. Citizens as sensors: The world of volunteered geography. GeoJournal 2007, 69, 211–221.
    GoPro, “TELEMETRY EXTRACTOR for GoPro Instructions Manual” ,2022.
    Graham, Mark & De Sabbata, Stefano & Zook, Matthew. (2015). Towards a Study of Information Geographies: (Im)Mutable Augmentations and a Mapping of the Geographies of Information. Geo: Geography and Environment. 2. 10.1002/geo2.8.
    H. Li, Z. Ma, H. Fan and J. Liu, "An Efficient Approach based on Image Pixel and Semantic Features Towards Video Retrieval," 2018 2nd International Conference on Imaging, Signal Processing and Communication (ICISPC), 2018, pp. 46-53
    H. Tamura, S. Mori, T. Yamawaki, "Textural Features Corresponding to Visual Perception", IEEE Trans. on Systems, Man and Cyber., vol. 8, no. 6, p. 460–473. 2, 4, , June 1978.
    Hafidh Firmansyah, M., Paul, A., Bhattacharya, D., Malik Urfa, G.: AI based embedded speech to text using deepspeech. arXiv e-prints, arXiv-2002 (2020)
    Han, Z., Cui, C., Kong, Y., Qin, F., & Fu, P. (2016). Video data model and retrieval service framework using geographic information. Transactions in GIS, 20(5), 701–717.
    Hao J, Wang G, Seo B and Zimmermann R 2014 Point of Interest detection and visual distance estimation for sensor-rich video. IEEE Transactions on Multimedia 16: 1929–41
    Helle Sjøvaag (2016) Media diversity and the global superplayers: operationalising pluralism for a digital media market, Journal of Media Business Studies, 13:3, 170-186.
    Hu, Shunfu and Ting Dai. “Online Map Application Development Using Google Maps API, SQL Database, and ASP.NET.” (2013).
    Huang, Yu, Jizhou Gao and Heather Yu. “DYNAMIC PROGRAMMING-BASED OPTIMIZATION FOR AUDIO-VISUAL SKIMS.” (2011).
    J. Sivic and A. Zisserman, “Video Google: Efficient visual search of videos,” in Toward Category-Level Object Recognition. Berlin, Germany: Springer, pp. 127–144, 2006.
    Jamonnak, S., Bhati, D., Amiruzzaman, M. et al. VisualCommunity: a platform for archiving and studying communities. J Comput Soc Sc 5, 1257–1279 (2022).
    Jamonnak, S., Zhao, Y., Curtis, A., Al-Dohuki, S., Ye, X., Kamw, F., & Yang, J. (2020). GeoVisuals: a visual analytics approach to leverage the potential of spatial videos and associated geonarratives. International Journal of Geographical Information Science, 34(11), 2115-2135.
    Jamonnak, Suphanut & Zhao, ye & Curtis, Andrew & AL-Dohuki, Shamal & ye, Xinyue & Kamw, Farah & Yang, Jing. (2020). GeoVisuals: a visual analytics approach to leverage the potential of spatial videos and associated geonarratives. International Journal of Geographical Information Science. 34. 1-21.
    Jeon, Yeseul & Jang, Eun-Young & Jang, Hwan-Seok. (2019). SENTIMENT SCORES OF SENTIMENT KEYWORDS: ANALYSIS OF HOTEL REVIEW DATA. 233-235.
    K. Brown, A rich diet: Data-rich multimedia has a lot in store for archiving and storage companies, Broadband Week, March 5, 2001.
    Kalkowski, Sebastian, et al. "Real-time analysis and visualization of the YFCC100M dataset." Proceedings of the 2015 workshop on community-organized multimodal mining: opportunities for novel solutions. 2015.
    Kayastha, N., Niyato, D., Wang, P., & Hossain, E. (2011). Applications, architectures, and protocol design issues for mobile social networks: A survey. Proceedings of the IEEE, 99(12), 2130-2158.
    Kirstein, S., Wersing, H., Gross, H. M., & Körner, E. (2008, November). A Vector Quantization Approach for Life-Long Learning of Categories. In ICONIP (1) (pp. 805-812).
    Kitchin, R. (2016) Getting Smarter about Smart Cities: Improving Data Privacy and Data Security, Data Protection Unit, Department of the Taoiseach, Dublin, Ireland, (last accessed 1st Feb 2016).
    Kitchin, Rob & Lauriault, Tracey & Wilson, Matthew. (2017). Understanding Spatial Media. DOI: 10.4135/9781526425850.n1.
    Kwan, M.P.; Lee, J. Emergency response after 9/11: The potential of real-time 3D GIS for quick emergency response in micro-spatial environments. Comput. Environ. Urban 2005, 29, 93–113.
    Lee, Jae-Gil, and Minseo Kang. "Geospatial big data: challenges and opportunities." Big Data Research 2.2 (2015): 74-81.
    Lewis J 2006 OGC GeoVideo Web Service Draft Implementation Specification. WWW document, http:// opengeospatial.org/files/?artifact_id=12899
    Lewis P, Fotheringham S, and Winstanley A 2011 Spatial video and GIS. International Journal of Geographical Information Science 5:697–716
    Li, C., Liu, Z., Zhao, Z. et al. A fast fusion method for multi-videos with three-dimensional GIS scenes. Multimed Tools Appl 80, 1671–1686 (2021).
    Li, H., Ma, Z., Fan, H., & Liu, J. (2018, July). An Efficient Approach based on Image Pixel and Semantic Features Towards Video Retrieval. In 2018 2nd International Conference on Imaging, Signal Processing and Communication (ICISPC) (pp. 46-53). IEEE.
    Liang-Hua Chen, Kuo-Hao Chin, Hong-Yuan Liao, "An integrated approach to video retrieval", Proceedings of the nineteenth conference on Australasian databaseVolume 75, 49–55, 2008
    Lin, Y., Abdelfatah, K., Zhou, Y., Fan, X., Yu, H., Qian, H., & Wang, S. (2015). Co-interest person detection from multiple wearable camera videos. In Proceedings of the IEEE International Conference on Computer Vision (pp. 4426-4434).
    Lösel, G. (2021). Tags and tracks and annotations–research video as a new form of publication of embodied knowledge. International Journal of Performance Arts and Digital Media, 17(1), 31-45.
    Lu Y, Shahabi C, Kim S H 2014 An efficient index structure for large-scale geotagged video databases. In Proceedings of the Twenty-second ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, Dallas, Texas
    Ma H, Ay S A, Zimmermann R, and Kim S H 2013 Large-scale geotagged video indexing and queries. GeoInformatica 4: 671–97
    Ma, T. Y., Cui, J., & Chu, D. (2020). Research on fusion technology of real 3S scene and video surveillance image based on WebGL. Geomatics & Spatial Information Technology, 43(Suppl. 1), 80–83.
    Maragos, P., Gros, P., Katsamanis, A., Papandreou, G. (2008). Cross-Modal Integration for Performance Improving in Multimedia: A Review. In: Maragos, P., Potamianos, A., Gros, P. (eds) Multimodal Processing and Interaction. Multimedia Systems and Applications, vol 33. Springer, Boston, MA.
    Marcus Banks(林恩慧譯),質性研究的視覺資料運用,韋伯文化國際出版有限公司,2010 年 3 月
    Michael G. Christel, Andreas M. Olligschlaeger, Chang Huang, Interactive Maps for a Digital Video Library[J]. IEEE Multimedia Computing and Systems, 2000:60-67.
    Milosavljević, A., Dimitrijević, A., & Rančić, D. (2010). GIS-augmented video surveillance. International Journal of Geographical Information Science, 24(9), 1415-1433.
    Moreno-Sánchez, Rafael et al. “Design and development strategy for multi-media GIS to support environmental negotiation, administration, and monitoring at the regional level.” Trans. GIS 1 (1996): 161-175.
    Mounika, B.R & Ponnusamy, Palanisamy & Khare, Ashish. (2021). Content Based Video Retrieval—Methods, Techniques and Applications.
    MPEG-7 Overview . (2004 , October ). Retrieved 5 16, 2012, from INTERNATIONAL ORGANISATION FOR STANDARDISATION: http://mpeg.chiariglione.org/standards/mpeg-7/mpeg-7.htm
    Myung-Hee Jo, Developing RIMS (Road Information Management System) using Video GIS method[C]. in Asian Conference on Remote Sensing 2005, 2005. Hanoi Vietnam.
    N. Dimitrova, L. Agnihotri, and G. Wei, “Video classification based on HMMusing text and faces,” in Proc. Eur. Signal Process. Conf., Tampere, Finland,pp. 1373–1376, 2000
    Nack, F. (2009). Video Metadata. In: LIU, L., ÖZSU, M.T. (eds) Encyclopedia of Database Systems. Springer, Boston, MA.
    Navarrete T and Blat J 2002 VideoGIS: Segmenting and indexing video based on geographic information. In Proceedings of the Fifth AGILE Conference on Geographic Information Science, Palma de Mallorca, Spain
    Nezihe BurcuOzgura, M. A. (2009,February26). An intelligent fuzzy object oriented database framework for video database applications. ScienceDirect , pp. 2253–2274.
    Nicu Sebe, Michael S. Lew, Arnold W.M. Smeulders, "Video retrieval and summarization", Computer Vision and Image Understanding, vol. 92, no. 2-3, pg 141-146, 2003
    Oriented Imagery Catalog Schema, August 9th, 2022, Esri User Documentation.
    Park M, Luo J, Collins R T, and Liu Y 2014 Estimating the camera direction of a geotagged image using reference images. Pattern Recognition 47: 2880–93
    R. D. Campos, A. F. De Torres, and F. M. Mota. (2019). A Survey on Multimedia Metadata Annotation Approaches. Multimedia Tools and Applications, 78(18), 25381–25416.
    Rahim, S.T., Zheng, K., Turay, S. et al. Capabilities of multimedia gis. Chin. Geogr. Sci. 9, 159–165 (1999).
    Raj, Bhumika V., and Charul Kandoi. "Content based Video Retrieval.", IRJET Journal (2020).
    Roes, E. (2012). Metadata. In M. J. Bates & M. N. Maack (Eds.), Encyclopedia of Library and Information Sciences (3rd ed., pp. 3560-3570). Taylor and Francis.
    S. Bruyne, D. Deursen, J. Cock, W. Neve, P. Lambert, and R. Walle, “A compressed-domain approach for shot boundary detection on H.264/AVC bit streams,” J. Signal Process.: Image Commun., vol. 23, no. 7, pp. 473–489, 2008.
    S. Steiniger, M. Neun, and A. Edwardes. Foundations of Location Based Services. Lecture Notes on LBS, 1:272, 2006
    Saoudi, E.M., Jai-Andaloussi, S. A distributed Content-Based Video Retrieval system for large datasets. J Big Data 8, 87 (2021).
    Shen, J., Wang, M., Yan, S., & Hua, X. S. (2011, November). Multimedia tagging: past, present, and future. In Proceedings of the 19th ACM international conference on Multimedia (pp. 639-640).
    Shunfu Hu (2003) Multi-media GIS: Analysis and Visualization of Spatio-Temporal and Multimedia Geographic Information, Geographic Information Sciences, 9:1-2, 90-96.
    Söderholm, S.; Bhuiyan, M.Z.H.; Thombre, S.; Ruotsalainen, L.; Kuusniemi, H. A Multi-GNSS Software-Defined Receiver: Design, Implementation, and Performance Benefits. Ann. Telecommun. 2016, 71, 399–410.
    spatial approaches,’ Mobile Media and Communication, 1 (1): 116–21. de Waal, M. (2014) The City as Interface: How New Media Are Changing the City. Rotterdam: nai010 publishers.
    Steiger, E.; Resch, B.; Zipf, A. Exploration of spatiotemporal and semantic clusters of Twitter data using unsupervised neural networks. Int. J. Geogr. Inf. Sci. 2016, 30, 1694–1716.
    Sutko, D. M., & de Souza e Silva, A. (2011). Location-aware mobile media and urban sociability. New Media & Society, 13(5), 807–823.
    T.N.Shanmugham, Priya Rajendran, "An Enhanced Content-Based Video Retrieval System Based on Query Clip", International Journal of Research and Reviews in Applied Sciences, Volume 1, Issue 3, 2009
    Tang, Y., Wu, Y., Wu, M., Wu, W., Hu, X., & Shen, L. (2008). INS/GPS integration: Global observability analysis. IEEE Transactions on Vehicular Technology, 58(3), 1129-1142.
    Tian, Y., Liu, H., & Wang, J. (2017). A multi-object recognition algorithm based on geographic information system. In Proceedings of the 3rd International Conference on Computer and Communication Systems.
    W. Xiu, Z. Gao, W. Liang, W. Qi, and X. Peng, "Information Management and Target Searching in Massive Urban Video Based on Video-GIS" 2018 8th International Conference on Electronics Information and Emergency Communication (ICEIEC), Beijing, China, 2018, pp. 228-232.
    W. Xiu, Z. Gao, W. Liang, W. Qi, and X. Peng, "Information Management and Target Searching in Massive Urban Video Based on Video-GIS," 2018 8th International Conference on Electronics Information and Emergency Communication (ICEIEC), Beijing, China, 2018, pp. 228-232
    Wang, M., Liu, X., Zhang, Y., & Wang, Z. (2017). Camera coverage estimation based on multistage grid subdivision. ISPRS International Journal of Geo-Information, 6(4), 110–128.
    Wang, S., Hu, Q., Zhao, P., Yang, H., Wu, X., Ai, M., & Zhang, X. (2022). Real‐time fusion of multiple videos and 3D real scenes based on optimal viewpoint selection. Transactions in GIS.
    Wang, W., Yang, J., You, X.: Combining ElasticFusion with PSPNet for RGB-D based indoor semantic mapping. In: 2018 Chinese Automation Congress (CAC), pp. 2996–3001. IEEE (2018)
    Wang, W., Zhou, T., Porikli, F., Crandall, D., & Van Gool, L. (2021). A survey on deep learning technique for video segmentation. arXiv e-prints, arXiv-2107.
    Wattanarachothai, Werachard, and Karn Patanukhom. "Key frame extraction for text-based video retrieval using Maximally Stable Extremal Regions." 2015 1st International Conference on Industrial Networks and Intelligent Systems (INISCom). IEEE, 2015.
    Weiming Hu, Nianhua Xie, Li Li, Xianglin Zeng, Maybank S., "A Survey on Visual Content-Based Video Indexing and Retrieval", IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews), 41-6,797-819, 11/2011
    Wikibooks編者.多媒體技術/多媒體技術的特點[G/OL].Wikibooks,,2019年06月24日02:31
    Wilson, M.W. and Stephens, M. (2015) ‘GIS as media?’, in S. Mains, J. Cupples and C. Lukinbeal (eds) Mediated Geographies / Geographies of Media. Dordrecht: Springer. pp. 209-233.
    Wu, C., Zhu, Q., Zhang, Y., Du, Z., Zhou, Y., Xie, X., & He, F. (2015). An adaptive organization method of geovideo data for spatio-temporal association analysis. ISPRS Annals of the Photogrammetry, Remote Sensing & Spatial Information Sciences, 2, 29–34.
    Wu, X., Lu, Y. J., Peng, Q., & Ngo, C. W. (2011). Mining event structures from web videos. IEEE MultiMedia, 18(1), 38-51.
    Wu, Z.; Chang, Y.; Li, Q.; Cai, R. A Novel Method for Tunnel Digital Twin Construction and Virtual-Real Fusion Application. Electronics 2022, 11, 1413.
    Wu, Zhaohui, Ying Chang, Qing Li, and Rongbin Cai. 2022. "A Novel Method for Tunnel Digital Twin Construction and Virtual-Real Fusion Application" Electronics 11, no. 9: 1413.
    XIE Xiao, ZHU Qing, ZHANG Yeting, ZHOU Yan, XU Weiping, WU Chen. Hierarchical Semantic Model of Geovideo[J]. Acta Geodaetica et Cartographica Sinica, 2015, 44(5): 555-562.
    Xie, Y, Wang, M, Liu, X, et al. Spatiotemporal retrieval of dynamic video object trajectories in geographical scenes. Transactions in GIS. 2021; 25: 450– 467.
    Xiu, W., Gao, Z., Liang, W., Qi, W., & Peng, X. (2018, June). Information management and target searching in massive urban video based on video-GIS. In 2018 8th International Conference on Electronics Information and Emergency Communication (ICEIEC) (pp. 228-232). IEEE.
    Y. Yuan, “Research on video classification and retrieval” Ph.D. dissertation, School Electron. Inf. Eng., Xi‟an Jiaotong Univ., Xi‟an, China, pp. 5–27, 2003.
    Yun J. K., Kim, J. J., Hong, D. S., & Han, K. J. (2005). Development of an embedded spatial MMDBMS for spatial mobile devices. In Web and Wireless Geographical Information Systems: 5th International Workshop, W2GIS 2005, Lausanne, Switzerland, December 15-16, 2005. Proceedings 5 (pp. 1-10). Springer Berlin Heidelberg.)
    Zhang Y, Tao R, Wang Y. Motion-state-adaptive video summarization via spatiotemporal analysis. IEEE Transactions on Circuits and Systems for Video Technology. 2016;27(6):1340–52.
    Zhou, X., Liu, C., & Song, Y. (2019). A Survey of Video Event Understanding: Dataset, Evaluation, and Benchmark. ACM Transactions on Multimedia Computing, Communications, and Applications, 15(3s), 1-22.
    王祥安,2004年08月26日,寬頻時代的幕後推手-影音資料庫系統,《數位典藏與數位學習聯合目錄》Hope Net 科技月刊,卷期:100,頁68-73
    吳宗德,2004,視訊檢索技術及其於多媒體百科全書之應用研究計畫,中國文化大學資訊傳播學系暨研究所,行政院國家科學委員會專題研究計畫成果報告
    李彥賢, 楊錦生, 廖國堯. "以社會性標籤為基礎的擴充搜尋技術支援影音分享網站中之影片檢索." 資訊管理學報 19.3 (2012): 533-565
    李彥賢、楊錦生、廖國堯。2011。以社會性標籤為基礎的擴充搜尋技術支援影音分享網站中之影片檢索。
    李道明。2004。影音數位化問題探討—以台灣社會人文影音資料庫為例。
    林信成、康珮熏,「報紙新聞數位典藏 Metadata 轉換系統之設計與應用」,中文媒體數位典藏與新聞標示語言研討會,頁 B2-1~B2-23,台北國家圖書館,2005/5/11~5/12
    張振魁、顏逸品、侯佳利、陳彥良,1999,高等資料庫-Multimedia Database多媒體資料庫,國立中央大學資管所
    連國能(2006)。〈單元動態影像的複合構成在互動多媒體中之應用研究〉。臺灣師範大學設計研究所在職進修碩士班學位論文。
    陳建銘(2003)。〈互動多媒體對影像存在感表現之探討─以「影像•景象─新竹」創作為例〉。元智大學資訊傳播學系學位論文。1-148。
    陳彥良博士(1999),Multimedia Database多媒體資料庫,國立中央大學管理學院。
    黃純國,殷常鴻(主編),多媒體技術與應用,清華大學出版社,2011.06。
    黃國倫、林金龍. (2007). “第三章-數位典藏系統建置”,數位典藏技術導論. 數位典藏與數位學習國家型科技計畫/ 中央研究院,國立臺灣大學出版中心
    廖泫銘,2009,”地圖與地名資料庫”,數位典藏地理資訊工作坊,<時間與空間的整合—歷史GIS在高中教學之應用>
    賴珮君, 吳蕙盈, & 李蔡彥. (2014). 互動敘事中客製化之虛擬拍攝實驗平台 (Doctoral dissertation, 賴珮君).

    無法下載圖示 校內:2026-08-31公開
    校外:2026-08-31公開
    電子論文尚未授權公開,紙本請查館藏目錄
    QR CODE