| 研究生: |
石琢暐 Shih, Cho-Wei |
|---|---|
| 論文名稱: |
語意導向影像管理實現方法與技術研發 Development of Enabling Methods and Technologies for Semantic-oriented Image Management |
| 指導教授: |
陳裕民
Chen, Yuh-Min |
| 學位類別: |
博士 Doctor |
| 系所名稱: |
電機資訊學院 - 製造資訊與系統研究所 Institute of Manufacturing Information and Systems |
| 論文出版年: | 2012 |
| 畢業學年度: | 100 |
| 語文別: | 英文 |
| 論文頁數: | 80 |
| 中文關鍵詞: | 影像管理 、語意導向影像管理 、影像知識模式 、影像生命週期 |
| 外文關鍵詞: | Semantic-oriented image management, Image management, Image knowledge model, Image life-cycle |
| 相關次數: | 點閱:78 下載:3 |
| 分享至: |
| 查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
科技發展與知識經濟加速了資訊、知識與數位內容的流通,雖有助於使用者便利地取得資訊與知識,卻也同時造成資訊過載(Information overload)之現象。資訊過載係指資訊量過多,造成使用者難以適時、確切、迅速地找到需要的資訊,因此,提供一個有效的管理模式,以協助使用者適時獲得並整合正確的資訊,實有其必要性。再者,多數使用者傾向採用影像來記錄生活中的點滴,舉凡人、事、地或物等值得記錄的時刻皆然;且影像可傳達的隱性資訊(包括情感、回憶)較聲音直覺與豐富。是故,影像內容的管理模式更顯其重要性,卻存在幾個關鍵問題:(1)缺乏一般化的影像管理模式;(2)未考量影像之特殊性;(3)忽略影像內隱的語意資訊。
有鑑於此,本研究參考「知識管理(Knowledge management)」之模式與方法,針對影像的生命週期(Image lifecycle),提出一「語意導向影像管理模式」,再依此設計「語意導向影像管理系統架構」與開發「實現技術(Enabling technologies)」,以實現「影像知識管理」之目標。
為實現研究目標,本研究訂立之研究項目如下:
(1) 影像生命週期分析與設計;
(2) 一般化影像管理模式設計;
(3) 語意導向影像管理模式設計;
(4) 語意導向影像管理功能定義與設計;
(5) 語意導向影像管理系統模組開發;
(6) 語意導向影像管理系統整合與測試;
(7) 系統評量(實驗設計)。
The rapid development of technology and advent of knowledge economy accelerate the circulation and access of digital content. However, it also increases the difficulty of knowledge acquirement due to information overload. Therefore, it is necessary to provide an effective management method for information gathering and integration.
Images are widely used to record the happenings of life (including people, events, places or things) due to its ease of use and rich containment of implicit information (including emotions and memories). Therefore, image management has become a remarkable requirement, however, it still exists issues of: (a) lacking of generic image management model; (b) ignoring the characteristics of images; (c) ignoring the implicit semantics of images.
To fulfill the requirements of image management, this research will study lifecycle of images and then propose a "semantic-oriented image management model" based on the concepts and methods of “knowledge management”. System framework and its enabling technologies will also be developed according to the proposed model.
To achieve the objective, the research tasks include:
(a) Design and analysis of image lifecycle,
(b) Design of generic image management model,
(c) Design of semantic-oriented image management model
(d) Planning and design of system framework
(e) Development of enabling technologies and system modules, and
(f) Experiments and assessment.
Al-Obeidat, F., Belacel, N., Carretero, J. A., & Mahanti, P. (2010). Differential Evolution for learning the classification method PROAFTN. Knowledge-Based Systems, 23(5), 418–426.
Balabanovic, M., & Shoham, Y. (1995). Learning Information Retrieval Agents: Experiments with Automated Web Browsing. Proceedings of the AAAI Spring Symposium on Information Gathering from Heterogeneous, Distributed Environments (pp. 13–18).
Belkin, N. J. (2000). Helping people find what they don’t know. Communications of the ACM, 43(8), 58–61.
Belkin, N. J., Kelly, D., Kim, G., Kim, J., Lee, H., & Muresan, G. (2003). Query length in interactive information retrieval. Proceedings of the 26th annual international ACM SIGIR conference on research and development in information retrieval (pp. 205–212). Toronto, Canada: ACM.
Bezerra, B. L. D., & Carvalho, F. D. A. T. D. (2004). A symbolic approach for content-based information filtering. Information Processing Letters, 92(1), 45–52.
Boiko, B. (2001). Content management bible (illustrated ed.).
Bui, A. A. T., Morioka, C., Dionisio, J. D. N., Johnson, D. B., Sinha, U., Ardekani, S., Taira, R. K., et al. (2007). openSourcePACS: An Extensible Infrastructure for Medical Image Management. IEEE Transactions on Information Technology in Biomedicine, 11(1), 94–109.
Burkey, J., & Kuechler, W. (2003). Web-based surveys for corporate information gathering: A bias-reducing design framework. IEEE Transactions on Professional Communication, 46(2), 81–93.
Cestinik, B. (1990). Estimating Probabilities: A Crucial Task in Machine Learning. Proceedings of European Conference on Artificial Intelligence (pp. 147–149). Stockholm, Sweden.
Chang, H.-T. (2008). Automatic web image annotation for image retrieval systems. Proceedings of the 12th WSEAS international conference on Systems (pp. 670–674). Stevens Point, Wisconsin, USA: World Scientific and Engineering Academy and Society (WSEAS).
Chang, Y.-S., Yuan, S.-M., & Lo, W. (2000). A new multi-search engine for querying data through an Internet search service on CORBA. Computer Networks, 34(3), 467–480.
Chen, G., Wang, F., & Zhang, C. (2009). Collaborative filtering using orthogonal nonnegative matrix tri-factorization. Information Processing & Management, 45(3), 368–379.
Chen, H.-N., Chung, K.-L., & Hung, J.-E. (2010). Novel fractal image encoding algorithm using normalized one-norm and kick-out condition. Image and Vision Computing, 28(3), 518–525.
Chen, M.-Y., Chu, H.-C., & Chen, Y.-M. (2010). Developing a semantic-enable information retrieval mechanism. Expert Systems with Applications, 37(1), 322–340.
Chen, T.-Y., Chen, C.-H., Wang, D.-J., & Chiou, Y.-C. (2009). Real-Time Video Object Segmentation Algorithm Based on Change Detection and Background Updating. International Journal of Innovative Computing, Information and Control, 5(7), 1797–1810.
Chen, Y.-L., Kuo, M.-H., Wu, S.-Y., & Tang, K. (2009). Discovering recency, frequency, and monetary (RFM) sequential patterns from customers’ purchasing data. Electronic Commerce Research and Applications, 8(5), 241–251.
Cheng, F.-T., Chen, Y.-T., Su, Y.-C., & Zeng, D.-L. (2008). Evaluating Reliance Level of a Virtual Metrology System. IEEE Transactions on Semiconductor Manufacturing, 21(1), 92–103.
Chu, H., Chen, M., & Chen, Y. (2009). A semantic-based approach to content abstraction and annotation for content management. Expert Systems with Applications, 36(2), 2360–2376.
Chu, W.-T., Li, C.-J., & Tseng, S.-C. (2011). Travelmedia: An intelligent management system for media captured in travel. Journal of Visual Communication and Image Representation, 22(1), 93–104.
Chu, W.-T., & Lin, C.-H. (2010). Consumer photo management and browsing facilitated by near-duplicate detection with feature filtering. Journal of Visual Communication and Image Representation, 21(3), 256–268.
Cooper, W. S. (1971). A definition of relevance for information retrieval. Information Storage and Retrieval, 7(1), 19–37.
Cortes, C., & Vapnik, V. (1995). Support-vector networks. Machine Learning, 20(3), 273–297.
Dashevskiy, M., & Luo, Z.-Y. (2009). Predictions with Confidence in Applications. In P. Perner (Ed.), Machine Learning and Data Mining in Pattern Recognition, Lecture Notes in Computer Science (Vol. 5632, pp. 775–786). Springer Berlin / Heidelberg.
Delany, S. J., Cunningham, P., Doyle, D., & Zamolotskikh, A. (2005). Generating Estimates of Classification Confidence for a Case-Based Spam Filter. In H. Muñoz-Avila & F. Ricci (Eds.), Case-Based Reasoning Research and Development, Lecture Notes in Computer Science (Vol. 3620, pp. 177–190). Springer Berlin / Heidelberg.
Denoyer, L., & Gallinari, P. (2004). Bayesian network model for semi-structured document classification. Information Processing & Management, 40(5), 807–827.
Dudbridge, F. (1995). Least-squares block coding by fractal functions. Fractal image compression (pp. 229–241). London, UK: Springer-Verlag.
Escalante, H. J., Montes-y-Goméz, M., & Sucar, L. E. (2011). An energy-based model for region-labeling. Computer Vision and Image Understanding, 115(6), 787–803.
Falconer, K. (1990). Fractal Geometry: Mathematical Foundations and Applications (1st ed.). John Wiley & Sons.
Fu, H., Chi, Z., & Feng, D. (2010). Recognition of attentive objects with a concept association network for image annotation. Pattern Recognition, 43(10), 3539–3547.
Furao, S., & Hasegawa, O. (2004). A fast no search fractal image coding method. Signal Processing: Image Communication, 19(5), 393–404.
Goh, K.-S., Chang, E. Y., & Li, B. (2005). Using one-class and two-class SVMs for multiclass image annotation. IEEE Transactions on Knowledge and Data Engineering, 17(10), 1333–1346.
Golovchinsky, G., Pickens, J., & Back, M. (2009). A Taxonomy of Collaboration in Online Information Seeking. Proceedings of 1st International Workshop on Collaborative Information Seeking. Pittsburgh, Pennsylvania, USA.
Gong, Z.-G., Liu, Q., & Zhang, J.-B. (2006). Automatic Image Annotation by Mining the Web. In A. M. Tjoa & J. Trujillo (Eds.), Data Warehousing and Knowledge Discovery (Vol. 4081, pp. 449–458). Berlin, Heidelberg: Springer Berlin Heidelberg.
Guerrero-Bote, V., Moya-Anegón, F., & Herrero-Solana, V. (2002). Document organization using Kohonen’s algorithm. Information Processing & Management, 38(1), 79–89.
Hanbury, A. (2008). A survey of methods for image annotation. Journal of Visual Languages and Computing, 19(5), 617–627.
Hastie, T., & Tibshirani, R. (1998). Classification by pairwise coupling. The Annals of Statistics, 26(2), 451–471.
Hosmer, D. W., & Lemeshow, S. (2000). Applied logistic regression. John Wiley and Sons.
Hsu, C.-W., & Lin, C.-J. (2002). A comparison of methods for multiclass support vector machines. IEEE Transactions on Neural Networks, 13(2), 415–425.
Hu, W.-C., Chen, C.-H., Huang, D.-Y., & Ye, Y.-T. (2012). Video object segmentation in rainy situations based on difference scheme with object structure and color analysis. Journal of Visual Communication and Image Representation, 23(2), 303–312.
Huang, T.-K., Weng, R. C., & Lin, C.-J. (2006). Generalized Bradley-Terry Models and Multi-Class Probability Estimates. The Journal of Machine Learning Research, 7, 85–115.
Idrissi, K., Lavoué, G., Ricard, J., & Baskurt, A. (2004). Object of interest-based visual navigation, retrieval, and semantic content identification system. Computer Vision and Image Understanding, 94(1-3), 271–294.
Jansen, B. J., Spink, A., & Saracevic, T. (2000). Real life, real users, and real needs: A study and analysis of user queries on the web. Information Processing and Management, 23(2), 207–227.
Kelly, D., & Fu, X. (2007). Eliciting better information need descriptions from users of information search systems. Information Processing and Management, 43(1), 30–46.
Kim, Y., Kwon, S., & Heun Song, S. (2006). Multiclass sparse logistic regression for classification of multiple cancer types using gene expression data. Computational Statistics & Data Analysis, 51(3), 1643–1655.
Kohonen, T. (1982). Self-organized formation of topologically correct feature maps. Biological Cybernetics, 43(1), 59–69.
Kohonen, T. (2001). Self-organizing maps. Springer.
Kwok, S. H., & Zhao, J. L. (2006). Content-based object organization for efficient image retrieval in image databases. Decision Support Systems, 42(3), 1901–1916.
Laleci, G. B., Aluc, G., Dogac, A., Sinaci, A., Kilic, O., & Tuncer, F. (2010). A semantic backend for content management systems. Knowledge-Based Systems, 23(8), 832–843.
Lancster, F. (1995). Needs, Demands and Motivations in the Use of Sources of Information. Journal of Information , Communication, and Library Science, 1(3), 3–19.
Li, B., Zheng, C.-H., Huang, D.-S., Zhang, L., & Han, K. (2010). Gene expression data classification using locally linear discriminant embedding. Computers in Biology and Medicine, 40(10), 802–810.
Li, M., Liu, L., & Li, C.-B. (2011). An approach to expert recommendation based on fuzzy linguistic method and fuzzy text classification in knowledge management systems. Expert Systems with Applications, 38(7), 8586–8596.
Li, Z.-X., Shi, Z.-P., Liu, X., Li, Z.-Q., & Shi, Z.-Z. (2010). Fusing semantic aspects for image annotation and retrieval. Journal of Visual Communication and Image Representation, 21(8), 798–805.
Li, Z.-X., Shi, Z.-P., Liu, X., & Shi, Z.-Z. (2011). Modeling continuous visual features for semantic image annotation and retrieval. Pattern Recognition Letters, 32(3), 516–523.
Liagkou, V. (2012). A trustworthy architecture for managing cultural content. Mathematical and Computer Modelling, (In Press).
Lim, C. H. (2011). Secure Code Dissemination and Remote Image Management Using Short-Lived Signatures in WSNs. Communications Letters, IEEE, 15(4), 362–364.
Lin, Y.-L., & Wu, M.-S. (2011). An edge property-based neighborhood region search strategy for fractal image compression. Computers & Mathematics with Applications, 62(1), 310–318.
Liu, J., Wang, B., Lu, H.-Q., & Ma, S. (2008). A graph-based image annotation framework. Pattern Recognition Letters, 29(4), 407–415.
Liu, K.-C., Sun, L., & Fu, Y. (2008). Ontological modelling of content management and provision. Information and Software Technology, 50(11), 1155–1164.
Liu, R., & Lin, W. (2003). Mining for interactive identification of users’ information needs. Information Systems, 28(7), 815–833.
London, R. C. (2000). Relating information-need to the cancer experience. European Journal of Cancer Care, 9, 41–47.
Lu, J.-J., Zhao, T.-Z., & Zhang, Y.-F. (2008). Feature selection based-on genetic algorithm for image annotation. Knowledge-Based Systems, 21(8), 887–891.
Makadia, A., Pavlovic, V., & Kumar, S. (2010). Baselines for Image Annotation. International Journal of Computer Vision, 90(1), 88–105.
Mandelbrot, B. B. (1982). The Fractal Geometry of Nature (1st ed.). W. H. Freeman.
Manevitz, L., & Yousef, M. (2007). One-class document classification via Neural Networks. Neurocomputing, 70(7-9), 1466–1481.
Meng, W., Yu, C., & Liu, K.-L. (2002). Building efficient and effective metasearch engines. ACM Computing Surveys, 34(1), 48–89.
Mezaris, V., Kompatsiaris, I., & Strintzis, M. G. (2004). Region-based image retrieval using an object ontology and relevance feedback. EURASIP Journal on Applied Signal Processing, 2004, 886–901.
Miao, D., Duan, Q., Zhang, H., & Jiao, N. (2009). Rough set based hybrid algorithm for text classification. Expert Systems with Applications, 36(5), 9168–9174.
Nikhil, R., Moreno, P. J., & Vasconcelos, N. (2007). Bridging the Gap: Query by Semantic Example. IEEE Transactions on Multimedia, 9(5), 923–938.
Nouali, O., & Blache, P. (2004). A semantic vector space and features-based approach for automatic information filtering. Expert Systems with Applications, 26(2), 171–179.
Øien, G. E., & Lepsøy, S. (1995). A class of fractal image coders with fast decoder convergence. Fractal image compression (pp. 153–175). London, UK: Springer-Verlag.
Pitkow, J., Schütze, H., Cass, T., Cooley, R., Turnbull, D., Edmonds, A., Adar, E., et al. (2002). Personalized search. Communications of the ACM, 45(9), 50–55.
Pivk, A., & Gams, M. (2002). Domain-dependent information gathering agent. Expert Systems with Applications, 23(3), 207–218.
Platt, J. C. (2000). Probabilistic Outputs for Support Vector Machines and Comparison to Regularized Likelihood Methods. Advances in Large Margin Classifiers (pp. 61–74). Cambridge, MA: MIT Press.
Polvere, M., & Nappi, M. (2000). Speed-up in fractal image coding: comparison of methods. IEEE Transactions on Image Processing, 9(6), 1002–1009.
Poppe, C., Martens, G., Mannens, E., & Van de Walle, R. (2009). Personal content management system: A semantic approach. Journal of Visual Communication and Image Representation, 20(2), 131–144.
Rajan, K., Ramalingam, V., Ganesan, M., Palanivel, S., & Palaniappan, B. (2009). Automatic classification of Tamil documents using vector space model and artificial neural network. Expert Systems with Applications, 36(8), 10914–10918.
Rao, K. S., & Prasad, B. E. (1991). KIMS: knowledge-based image management system. Knowledge-Based Systems, 4(4), 209–214.
Robertson, S. E., & Jones, K. S. (1988). Relevance weighting of search terms. Document retrieval systems (pp. 143–160). London, UK: Taylor Graham Publishing.
Rumelhart, D. E., Hinton, G. E., & Williams, R. J. (1986). Learning internal representations by error propagation. Parallel distributed processing: explorations in the microstructure of cognition, vol. 1: foundations (pp. 318–362). MIT Press.
Salter, J., & Antonopoulos, N. (2006). CinemaScreen recommender agent: combining collaborative and content-based filtering. Intelligent Systems, IEEE, 21(1), 35–41.
Salton, G., Wong, A., & Yang, C. S. (1975). A vector space model for automatic indexing. Communications of the ACM, 18(11), 613–620.
Samavi, S., Habibi, M., Shirani, S., & Rowshanbin, N. (2010). Real time fractal image coder based on characteristic vector matching. Image and Vision Computing, 28(11), 1557–1568.
Saupe, D. (1995). Accelerating fractal image compression by multi-dimensional nearest neighbor search. Data Compression Conference, 1995. DCC ’95. Proceedings (pp. 222–231). Presented at the Data Compression Conference, 1995. DCC ’95. Proceedings, IEEE.
Shen, X.-H., & Zhai, C.-X. (2005). Active feedback in ad hoc information retrieval. Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval, SIGIR ’05 (pp. 59–66). New York, USA: ACM.
Shenton, A. K., & Dixon, P. (2004). The nature of information needs and strategies for their investigation in youngsters. Library & Information Science Research, 26(3), 296–310.
Shih, C.-W., Chen, M.-Y., Chu, H.-C., & Chen, Y.-M. (2012). Enhancement of information seeking using an information needs radar model. Information Processing & Management, 48(3), 524–536.
Shtykh, R. Y., & Jin, Q. (2008). Capturing user contexts: Dynamic profiling for information seeking tasks. Proceedings of the 2008 third international conference on systems and networks communications (pp. 365–370). Presented at the the international conference on systems and networks communications, IEEE Computer Society.
Singh, M., Curran, E., & Cunningham, P. (2008). Active Learning for Multi-Label Image Annotation. Proceedings of 19th Irish Conference on Artificial Intelligence and Cognitive Science (AICS 2008). Cork, Ireland.
Su, J.-H., Huang, W.-J., Yu, P. S., & Tseng, V. S. (2011). Efficient Relevance Feedback for Content-Based Image Retrieval by Mining User Navigation Patterns. IEEE Transactions on Knowledge and Data Engineering, 23(3), 360–372.
Su, X., & Khoshgoftaar, T. M. (2009). A Survey of Collaborative Filtering Techniques. Advances in Artificial Intelligence, 2009, 1–19.
Tamás, K. (2008). A fast classification based method for fractal image encoding. Image and Vision Computing, 26(8), 1129–1136.
Tan, B., Shen, X.-H., & Zhai, C.-X. (2006). Mining long-term search history to improve search accuracy. Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining, KDD ’06 (pp. 718–723). New York, USA: ACM.
Tan, P.-N., Steinbach, M., & Kumar, V. (2006). Introduction to data mining. Pearson Addison Wesley.
Vailaya, A., Figueiredo, M. A. ., Jain, A. K., & Zhang, H.-J. (2001). Image classification for content-based indexing. IEEE Transactions on Image Processing, 10(1), 117–130.
Wang, X.-Y., Wang, Y.-X., & Yun, J.-J. (2010). An improved no-search fractal image coding method based on a fitting plane. Image and Vision Computing, 28(8), 1303–1308.
Wang, Y., Mei, T., Gong, S., & Hua, X.-S. (2009). Combining global, regional and contextual features for automatic image annotation. Pattern Recognition, 42(2), 259–266.
Wong, R. C. ., & Leung, C. H. . (2008). Automatic Semantic Annotation of Real-World Web Images. IEEE Transactions on Pattern Analysis and Machine Intelligence, 30(11), 1933–1944.
Wu, T.-F., Lin, C.-J., & Weng, R. C. (2004). Probability Estimates for Multi-class Classification by Pairwise Coupling. The Journal of Machine Learning Research, 5, 975–1005.
Xu, Z., & Akella, R. (2010). Improving probabilistic information retrieval by modeling burstiness of words. Information Processing & Management, 46(2), 143–158.
Yang, Y., Zhang, J., Carbonell, J., & Jin, C. (2002). Topic-conditioned novelty detection. Proceedings of the eighth ACM SIGKDD international conference on knowledge discovery and data mining KDD’02 (pp. 688–693). New York, USA: ACM.
Yu, C. T., & Salton, G. (1976). Precision Weighting-An Effective Automatic Indexing Method. Journal of the ACM, 23(1), 76–88.
Zhang, J., Ghahramani, Z., & Yang, Y. (2004). A probabilistic model for online document clustering with application to novelty detection. Presented at the the eighteenth annual conference on neural information processing systems, Vancouver, Canada.
Zhang, S., Li, B., & Xue, X. (2010). Semi-automatic dynamic auxiliary-tag-aided image annotation. Pattern Recognition, 43(2), 470–477.
Zhao, Y.-F., Zhao, Y., & Zhu, Z.-F. (2009). TSVM-HMM: Transductive SVM based hidden Markov model for automatic image annotation. Expert Systems with Applications, 36(6), 9813–9818.
Zhu, S.-H., & Liu, Y.-C. (2009). Semi-Supervised Learning Model Based Efficient Image Annotation. IEEE Signal Processing Letters, 16(11), 989–992.