| 研究生: |
黃文彬 Huang, Win-Bin |
|---|---|
| 論文名稱: |
多媒體系統之壓縮、半色調與浮水印研究 Researches on Compression, Halftone and Watermark in Multimedia System |
| 指導教授: |
郭耀煌
Kuo, Yau-Hwang 蘇文鈺 Su, Wen-Yu |
| 學位類別: |
博士 Doctor |
| 系所名稱: |
電機資訊學院 - 資訊工程學系 Department of Computer Science and Information Engineering |
| 論文出版年: | 2009 |
| 畢業學年度: | 97 |
| 語文別: | 英文 |
| 論文頁數: | 103 |
| 中文關鍵詞: | 易碎型浮水印 、強健型浮水印 、半色調影像處理 、H.264視訊壓縮標準 、視訊壓縮 、離散小波轉換 、靜態影像壓縮 |
| 外文關鍵詞: | Discrete Wavelet Transform, Still Image Compression, Video Compression, H.264 Video Coding Standard, Fragile Watermarking, Robust Watermarking, Set Partitioning in Hierarchical Trees, Halftone Image Processing |
| 相關次數: | 點閱:148 下載:2 |
| 分享至: |
| 查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
近年來,高影像品質和高效能的壓縮比率是在多媒體壓縮系統裡的最重要的問題。而且,圖像完整驗證和版權保護在數位生活裡變得越來越必要。本論文將針對多媒體系統中四種重要的議題探討,分別為小波式影像壓縮、半色調影像處理、加速動態影像壓縮、及影像浮水印。在第一項議題研究裡,我們提出一個根據硬體實作上的需要,適當修改的階層式集合分割影像壓縮技術(SPIHT)編碼器並提高其硬體實現效率。階層式集合分割影像壓縮技術(SPIHT)是一種基於離散小波轉換的高效率影像壓縮技術。雖然它的壓縮效率比JPEG2000所採用的最佳方式切割的崁入式區塊編碼技術(EBCOT)稍差,但SPIHT是直接編碼影像的程序並且不需要須事先儲存的表格。這優點使得SPIHT更適合應用於低成本的硬體應用上。我們針對SPIHT演算法修改的部分,包括係數掃描過程的簡化和固定記憶體配置方式,係數掃描過程的簡化是針對小波係數的索引方式,利用一維的位置索引方式代替原先二為位置索引,而固定記憶體配置方式是將原先使用於SPIHT的動態配置方法改成資料列表的方式。雖然影像失真有些微的增加,但這些修改會讓SPIHT在硬體實作上變得更容易。
在第二項議題研究裡,我們提出基於類神經網路混合數位影像的半色調(Halftoning)和反轉半色調(Inverse Halftoning)回復連續影像的演算法。在此演算法中,半色調影像是由一階層類神經網路邏輯(SLPNN)所建立,此外,此半色調影像所對應的連續色調影像是由半徑式基底函數類神經網路 (RBFNN)所重建。這合併訓練程序同時產生半色調影像和其所對應的連續色調影像。利用我們所提出的演算法所取得的連續色調影像的峰值雜訊比(PSNR)效能和視覺影像品質,已與目前文獻上著名的反轉半色調方法相比較。並且本演算法所產生的半色調影像也比利用現存著名的擴散網紋半色調演算法所產生的結果在視覺上有更好的表現。
在第三項議題研究裡,我們提出一個加速完成H.264/AVC編碼壓縮的演算法,是利用其主要在執行眾多內部預測模式或外部預測模式之前,先行評估適當預測模式發生之可能性並省去部分模式的檢驗,即可先一步的將所需壓縮時間。在碼率失真最佳化H.264/AVC(縮寫為H.264/RDO)編碼器中,最佳的運算模式是由最小的Lagrange代價所決定,這是基於在計算壓縮率與失真代價之後,同時考慮可能的失真和所需的編碼位元數。H.264/RDO編碼器可以達到很好的壓縮率和影像失真之間的平衡,但同時需要許多額外的計算時間。我們所提出的方法稱為兩階層編碼模式決策演算法(TSMD),這方法將暫停一編碼區塊所不需要的模式,並且只執行可能成為最佳的編碼模式。TSMD 內含有兩階層的決定過程︰第一個階段將根據先前巨區塊(Macro-Block)和前一影像(Video-Frame)所需要的編碼模式,預測現在巨區塊一些可能的編碼模式。第二個階段將同時利用,預先計算好的統計結果或者預先訓練好的半徑式基底函數類神經網路(BPN)的去重新評估決策。根據我們的實驗結果,與原先的編碼時間相比,超過50%的計算時間被降低,並在峰值雜訊比(PSNR)上只有些微的降低,此輕微的失真是不容易在視覺上被意識到。此研究的實驗部分,全部程式是使用H.264/AVC標準參考軟體,JM 9.2。
在最後的議題研究裡,我們提出一個以特徵點為基礎的數位影像浮水印演算法,這個演算法達到同時具有影像驗証跟版權保護的目標。首先,我們提出的浮水印崁入演算法是利用Hessian-Affine特徵偵測器萃取影像的特徵區域。然後版權浮水印將根據每個像素的區域內的特徵方向被嵌入這些特徵區域。而且,除了版權浮水印的嵌入區域外,我們再利用區塊式易碎型浮水印針對剩餘空間嵌入影像驗証浮水印。相同的,我們所提出的浮水印萃取的演算法將獨立地依循上述程序從浮水印影像去萃取版權浮水印和影像驗証資訊。為了驗證我們提出的演算法的穩定強健性,各種不同的攻擊被應用於本演算法所產生之浮水印影像。在這研究的實驗結果,顯示出我們所提出的演算法所產生的浮水印圖案可以抵抗大多數移動和幾何學攻擊。而且,改變或修改影像將可以反映在我們所隱藏的浮水印上。
Recently, high visual quality and efficient compression rate are the most important issues in multimedia compression system. Besides, image authentication and copyright protection become more and more essential in our digital life. Four important topics, wavelet image compression, halftone image processing, the speedup of video compression and image watermarking, in multimedia system are presented in this paper. In the first research, the hardware implementation of a modified efficient SPIHT coder is presented. Set Partitioning in Hierarchical Trees (SPIHT) is a highly efficient technique for compressing Discrete Wavelet Transform (DWT) decomposed images. Though its compression efficiency is a little less famous than Embedded Block Coding with Optimized Truncation (EBCOT) adopted by JPEG2000, SPIHT has a straight forward coding procedure and requires no tables. These make SPIHT the more appropriate algorithm for lower cost hardware implementation. The modifications include a simplification of coefficient scanning process, a 1-D addressing method instead of the original 2-D arrangement of wavelet coefficients, and a fixed memory allocation for the data lists instead of a dynamic allocation approach required in the original SPIHT. Although the distortion is slightly increased, it facilitates an extremely fast throughput and easier hardware implementation.
In the second research, a hybrid neural network based method for halftoning and inverse halftoning of digital images is presented. The halftone image is performed by Single-Layer Perceptron neural network (SLPNN), and its corresponding continuous-tone image is reconstructed by Radial-Basis Function neural network (RBFNN). The combined training procedure produces halftone images and the corresponding continuous tone images at the same time. The PSNR performance and visual image quality of these contone images achieved is comparable to the well-known inverse halftoning methods. Moreover, the halftone images obtained are also visually good.
In the third research, the speedup of H.264/AVC coders by examining the predicted modes in a code block is described. In a rate-distortion optimized H.264/AVC (H.264/RDO) coder, the best mode with the minimal Lagrange cost is selected after calculating the rate-distortion (R-D) cost which simultaneously considers the distortion and the required number of bits in all possible modes. H.264/RDO coders achieve a good balance between bit rate and image distortion, but lots of additional computing power is required. The proposed method, called two-staged mode decision (TSMD), only executes the modes which might be the best mode used in H.264/RDO coders with other modes disabled. TSMD employs a two-staged decision process: the first stage is to predict some probable encoding modes according to the information when one encodes the preceding macro-blocks and video frames. The second stage refines the decision with either the pre-computed look-up-table or the pre-constructed Back-Propagation Neural (BPN) network. According to our experiment results, over 50% of the computation time is reduced with a small loss in Peak Signal-to-Noise ratio (PSNR) and a slight bit-rate is incremented which are not easily sensed by human eyes. All programs are based on the H.264/AVC standard reference software, JM 9.2.
In the last research, a feature-based robust digital image watermarking algorithm is proposed to achieve the goal of image authentication and protection simultaneously. The Hessian-Affine feature detector is at first adopted to extract characteristic regions of an image in the proposed watermark embedding scheme. Then the copyright watermark is embedded into these characteristic regions according to the local orientation of each pixel. Moreover, the remainder regions are applied for image authentication by using block-wise fragile watermarking method. Similarly, the proposed watermark detection scheme follows the above procedure to extract the copyright watermark and the authentic information independently and blindly from watermarked images. Various attacks are also applied to the watermarked images in order to examine the robustness of our algorithm. The experimental results show that the proposed watermarking algorithm can resist most removal and geometric attacks. Besides, changes of an image will be reflected in our hidden watermarks.
[ALT00] Altera Inc. Available: http://www.altera.com/products/devices/apex/ overview/ apx-overview.html.
[ANA92] M. Analoui and J. P. Allebach, "Model-based halftoning using direct binary search," Proc. SPIE, vol. 1666, pp. 96-108, Feb. 1992.
[ANT92] M. Antonini, M. Barlaud, P. Mathieu, and I. Daubechies, "Image coding using wavelet transform," IEEE Trans. Image Processing, vol. 1, pp. 205-220, Apr. 1992.
[ARS05] E. Arsura, L. D. Vecchio, R. Lancini, and L. Nisti, "Fast Macroblock Intra and Inter Modes Selection for H.264/AVC," Proc. IEEE Int. Conf. Multimedia and Expo, pp. 378-381, July 2005.
[AVC03] Draft ITU-T Recommendation and Final Draft International Standard of Joint Video Specification (ITU-T Rec. H.264 | ISO/IEC 14496-10 AVC), ISO/IEC JTC1 /SC29/WG11, May 2003.
[BAS02] P. Bas, J. M. Chassery, and B. Macq, "Geometrically invariant watermarking using feature points," IEEE Trans. Image Processing, Vol. 11, No. 9, pp. 1014-1028, Sept. 2002.
[BAY73] B. E. Bayer, "An optimum method for two-level rendition of continuous-tone prictures," Proc. IEEE Int. Conf. Communications, vol. 26, pp. 2611-2615, June 1973.
[BIN00] Available: http://cad.csie.ncku.edu.tw/~huangwb/research.html.
[BJO01] G. Bjontegaard, "Calculation of Average PSNR Difference between RDcurves," ITU-T VCEG, Doc. VCEG-M33, March 2001.
[BOZ99] G. Bozkurt, and A. E. Cetin, "Restoration of error diffused images using POCS," Proc. IEEE Int. Conf. Acoustics, Speech, and Singal Processing, vol. 6, pp. 3225-3228, March 1999.
[CHA01] P. C. Chang, C. S. Yu, and T. H. Lee, "Hybrid LMS-MMSE inverse halftoning technique," IEEE Trans. Image Processing, vol. 10, pp. 95-103, Jan. 2001.
[CHA06] Wei-Chen Chang, Jing-Xin Wang and Alvin W. Y. Su, "Harmonic Structure Reconstruction in Audio Compression Method Based on Spectral Oriented Trees," 120th Convention of Audio Engineering Society, Paper 6809, Paris, France, May 20-23, 2006.
[COX01] I. J. Cox, M. L. Miller, and J. A. Bloom, Digital Watermarking San Francisco, CA: Morgan Kaufman, 2001.
[DAI05] Q. Dai, D. Zhu, and R. Ding, "Fast Mode Decision for Inter Prediction in H.264," Proc. IEEE Int. Conf. Image Processing, vol. 1, pp. 119-122, Oct. 2005.
[DUN01] C. Dunn, "Efficient Audio Coding with Fine-Grain Scalability," presented at the 111th Convention of the Audio Engineering Society, New York, 30 Nov.- 3 Dec. 2001.
[FAN04] Z. Fan and Z. Xudong, "Fast Macroblock Mode Decision in H.264," Proc. IEEE Int. Conf. Signal Processing, vol. 2, pp. 1179-1182, Sept. 2004.
[FAN92] Z. Fan, "Retrieval of images from digital halftones," Proc. IEEE Int. Symp. Circuits systems, pp. 313-316, May 1992.
[FLO76] R. W. Floyd and L. Steinberg, "An adaptive algorithm for spatial grayscale," J. Soc. Inform Display, vol. 17, no.2, pp. 75-77, 1976.
[FRY05] T. W. Fry and S. A. Hauck, "SPIHT image compression on FPGAs," IEEE Tran. On Circuits and Systems for Video Technology, vol. 15, no. 9, pp.1138-1147, Sept. 2005.
[HAL00] http://www.ece.utexas.edu/~bevans/projects/halftoning/ index.html.
[HAN04] K. H. Han and Y. L. Lee, "Fast Macroblock Mode Decision in H.264," Proc. IEEE Int. Conf. TENCON, vol. 1, pp. 347-350, Nov. 2004.
[HAY99] S. Haykin, Neural Networks: A Comprehensive Foundation, 2nd edition, New Jersey: Prentice Hall, 1999.
[HE01] Zhihai He and S. K. Mitra, "A unified Rate-Distortion analysis framework for transform coding," IEEE Tran. On Circuits and Systems for Video Technology, vol. 11, no. 12, pp.1221-1236, Dec. 2001.
[HEI95] S. Hein and A. Zakhor, "Halftone to continuous-tone conversion of error-diffusion coded images," IEEE Trans. Image Processing, vol. 4, pp. 208-216, Feb. 1995.
[HUA03] W. B. Huang, Y. J. Chang, Alvin W. Y. Su and Yau Hwang Kuo, "VLSI Design Of A DWT/Modified Efficient SPIHT Based Image Codec," International Conference on Information, Communications & Signal Processing - Pacific-Rim Conference On Multimedia (ICICS-PCM) Symposium, Singapore, December 2003.
[IMA00] http://ise.stanford.edu/class/psych221/projects/02/deleon/htcolor .html.
[IPR00] Center for Image Processing Research, Electrical, Computer, and Systems Engineering Department Rensselaer Polytechnic Institute. http://www.cipr.rpi.edu.
[JPE00] Information Technology - JPEG 2000 Image Coding System, ISO/IEC FCD15444-1: 2000 (V1.0, Mar. 2000), page 31.
[JPE01] ISO/IEC 15444-1:2000(E), Information technology - JPEG 2000 Image Coding System - Part 1: Core Coding System.
[JVT00] Joint Video Team (JVT) Reference Software [Online]. Available: http://iphome.hhi.de/suehring/tml.
[KAN99] H. R. Kang, Digital Color Halftoning, Bellingham, WA: SPIE Optical Engineering Press, 1999.
[KIM02] S. H. Kim, and J. P. Allebach, "Impact of HVS model on model-based halftoning," IEEE Trans. Image Processing, vol. 11, pp. 258-269, Mar. 2002.
[KIM03] K. P. Lim, S. Wu, D. J. Wu, S. Rahardja, X. Lin, F. Pan, and Z. G. Li, "Fast INTER mode selection," JVT of MPEG and VCEG, Doc. JVT-J033, Dec. 2003.
[KIM04] Y. H. Kim, J. W. Yoo, S. W. Lee, J. Shin, J. Paik and H. K. Jung, "Adaptive Mode Decision for H.264 Encoder," IEE Electronics Letters, vol. 40, pp. 1172-1173, Sept. 2004.
[KIT00] T. D. Kite, N. D.- Venkata, B. L. Evans, and A. C. Bovik, "A fast high quality inverse halftoning algorithm for error diffused halftones," IEEE Trans. Image Processing, vol. 9, pp. 1583-1592, Sept. 2000.
[KUO04-1] C. H. Kuo, M. Shen, and C.-C. J. Kuo, "Fast Inter-Prediction Mode Decision and Motion Search for H.264," Proc. IEEE Int. Conf. Multimedia and Expo, vol. 1, pp. 663-666, June 2004.
[KUO04-2] T. Y. Kuo and C. H. Chan, "Fast Macroblock Partition Prediction for H.264/AVC," Proc. IEEE Int. Conf. Multimedia and Expo, vol. 1, pp. 675-678, June 2004.
[LEE04] J. Lee and B. Jeon, "Fast Mode Decision for H.264," Proc. IEEE Int. Conf. Multimedia and Expo, vol. 2, pp. 1131-1134, June 2004.
[LEE06] H. Y. Lee, H. Kim and H. K. Lee, "Robust image watermarking using local invariant features," Journal SPIE, Optical Engineering, Vol. 45, No. 3, March 2006.
[LI00] P. Li and J. P. Allebach, "Look-Up-Table based halftoning algorithm," IEEE Trans. Image Processing, vol. 9, pp. 1593-1603, Sept. 2000.
[LI04] P. Li, and J. P. Allebach, "Tone-Dependent Error Diffusion," IEEE Trans. Image Processing, vol. 13, pp.201-215, Feb. 2004.
[LIA03] Chung-Jr Lian, Kuan-Fu Chen, Hong-Hui Chen, and Liang-Gee Chen, "Analysis and architecture design of block-coding engine for EBCOT in JPEG 2000," IEEE Tran. On Circuits and Systems for Video Technology, vol. 13, no. 3, pp.219-230, Mar. 2003.
[LIC05] V. Licks, R. Jordan, "Geometric Attacks on Image Watermarking Systems," IEEE Multi-Media, Vol. 12, No. 3, pp. 68-78, Jul-Sept, 2005.
[LIE97] D. J. Lieberman and J. P. Allebach, "Efficient model based halftoning using direct binary search," Proc. Int. Conf. Image Processing, vol. 1, pp. 775-778, Oct. 1997.
[LIN01] C. Y. Lin, M. Wu, J. A. Bloom, I. J. Cox, M. L. Miller, and Y. M. Lui, "Rotation, scale and translation resilient watermarking for image," IEEE Trans. Image Processing, Vol. 10, No. 5, pp. 767-782, May 2001.
[LIN05] Y. K. Lin and T. S. Chang, "Fast Block Type Decision algorithm for Intra Prediction in H.264 FRExt," Proc. IEEE Int. Conf. Image Processing, vol. 1, pp. 585-588, Sept. 2005.
[LIN99] E. T. Lin and E. J. Delp, "Review of fragile image watermarks," in Proc. of Multimedia and Security Workshop (ACM Multimedia '99) , pp. 25-29, Oct. 1999.
[LIS03] P. List, A. Joch, J. Lainema, G. Bjontegaard, and M. Karczewicz, "Adaptive Deblocking Filter," IEEE Trans. Circuits Sys. Video Technol., vol. 13, pp. 614-619, July 2003.
[LU98] Z. Lu, and W. A. Pearlman, "An efficient, lowcomplexity audio coder delivering multiple levels of quality for interactive applications," IEEE Proceedings of Multimedia Signal Processing, pp. 529-534, Dec. 1998.
[LUT00] Available: http://www.systems.caltech.edu/mese/research/lutpaper.html.
[MAR00] M. W. Marcellin, M. J. Gormish, A. Bilgin and M. Boliek, "An overview of JPEG-2000," Proc. Of Data Compression Conference, Snowbird, Utah, pp. 523-541, Mar. 2000.
[MAR03] D. Marpe, H. Schwarz, and T. Wiegand, "Context-based Adaptive Binary Arithmetic Coding in the H.264/AVC Video Compression Standard," IEEE Trans. Circuits Sys. Video Technol., vol 13, pp. 620-636, July 2003.
[MAT02] J. Matas, O. Chum, M. Urban, and T. Pajdla, "Robust wide baseline stereo from maximally stable extremal regions," in Proc. of the British Machine Vision Conference, pp. 384-393, 2002.
[MES01] M. Mese and P. P. Vaidyanathan, "Look Up Table (LUT) for inverse halftoning," IEEE Trans. Image Processing, vol. 10, pp. 1566-1578, Oct. 2001.
[MES02] M. Mese, and P. P. Vaidyanathan, "Tree-Structured method for LUT inverse halftoning and for image halftoning," IEEE Trans. Image Processing, vol. 11, pp. 644-655, June 2002.
[MIK04] K. Mikolajczyk and C. Schmid, "Scale and affine invariant interest point detectors," Int. J. Computer Vision, Vol. 60, No. 1, pp. 63-86, Oct. 2004.
[MIK05] K. Mikolajczyk, T. Tuytelaars, C. Schmid, A. Zisserman, J. Matas, F. Schaffalitzky, T. Kadir, and L. V. Gool, "A comparison of affine region detectors," Int. J. Computer Vision, Vol.65, No 1-2, pp. 43-72, Nov. 2005.
[MUL92] J. B. Mulligan and A. J. Ahumada, Jr., "Principled halftoning based on human vision models," Proc. SPIE, vol. 1666, pp.109-121, Feb. 1992.
[PAN05] F. Pan, X. Lin, S. Rahardja, K. P. Lim, Z. G. Li, D. Wu and S. Wu, "Fast Mode decision Algorithm for Intraprediction in H.264/AVC Video Coding," IEEE Trans. Circuits Sys. Video Technol., vol. 15, pp. 813-822, July 2005.
[PAP92] T. N. Pappas and D. L. Neuhoff, "Least-squares model-based halftoning," Proc. SPIE vol. 1666, pp. 165-176, Feb. 1992.
[PEN93] W. B. Pennebaker and J. L. Mitchell, "JPEG still image compression standard," Van Nostrand Reinhold, New York, 1993.
[PER00] S. Pereira, and T. Pun, "Robust template matching for affine resistant image watermarks," IEEE Trans. Image Processing, Vol. 9, No.6, pp. 1123-1129, June 2000.
[PET00] Fabien A. P. Petitcolas, "Watermarking schemes evaluation," IEEE Signal Processing, Vol. 17, No. 5, pp. 58-64, Sept. 2000.
[RAA03] M. Raad, A. Mertins and I. Burnett, "Scalable to Lossless Audio Compression Based on Perceptual Set Partitioning in Hierarchical Trees (PSPIHT)," Proc. IEEE Int. Conf. Acoustics Speech Sig. Proc (ICASSP), 2003.
[RAB91] M. Rabbani, and P.W. Hones, Digital Image Compression Techniques, SPIE Opt. Eng. Press, Bellingham, Washington, 1991.
[REA01] M. D. Reavy and C. G. Boncelet, "An algorithm for compression of bilevel images," IEEE Trans. Image Processing, vol. 10, pp. 669-676, May 2001.
[REH05] J. U. Rehman and Y. Zhang, "Fast Intra Prediction Mode Decision Using Parallel Processing," Proc. IEEE Int. Conf. Machine Learning and Cybernetics, vol. 8, pp. 5094-5098, Aug. 2005.
[RIE93] M. Riedmiller and H. Braun, "A direct adaptive method for faster back propagation learning: the RPROP algorithm," Proc. Int. Conf. Neural Networks, San Francisco, 1993.
[RIV92] R. L. Rivest, "The MD5 message digest algorithm," Tech. Rep., 1992.
[SAI96] A. Said and W. A. Pearlman, "A new fast and efficient image codec based on set partitioning in hierarchical trees," IEEE Tran. On Circuits and Systems for Video Technology, vol. 6, no. 12, pp.243-250, June 1996.
[SCH01] R. J. Schilling, J. J. Carroll, Jr., and A. F. Al-Ajlouni, "Approximation of nonlinear systems with radial basis function neural networks," IEEE Trans. Neural Networks, vol. 12, pp. 1-15, Jan. 2001.
[SCH07] H. Schwarz, D. Marpe, and T. Wiegand, “Overview of the Scalable Video Coding Extension of the H.264/AVC Standard”, IEEE Trans. On Circuits and System for Video Technology, vol. 17, no. 9, pp. 1103-1120, Sept. 2007.
[SEO04] J. S. Seo and C. D. Yoo, "Localized image watermarking based on feature points of scale-space representation," Pattern Recognition, Vol. 37, Issue 7, pp. 1365-1375, July 2004.
[SHA93] J. M. Shapiro, "Embedded image coding using zerotrees of wavelet coefficients," IEEE Tran. on Signal Processing, vol. 41, issue. 12, pp. 3445-3462, Dec. 1993.
[SIN98] J. Singh, A. Antoniou, and D. J. Shpak, "Hardware implementation of a wavelet based image compression coder," IEEE Symp. Advances in Digital Filtering and Signal Processing, pp.169-173, 1998.
[STE97] R. L. Stevenson, "Inverse halftoning via MAP estimation," IEEE Trans. Image Processing, vol. 6, pp. 574-583, Apr. 1997.
[SU05] Alvin W. Y. Su, W. C. Chang and J. X. Wang, "A New Audio Compression Method Based On Spectral Oriented Trees," Proceedings of the Audio Engineering Society (AES) 118th Convention, Paper 6042, Barcelona, Spain, May 28-31, 2005.
[SUL01] G. J. Sullivan and G. Bjontegaard, "Recommended Simulation Common Conditions for H.26L Coding Efficiency Experiments on Low-resolution Progressive-scan Source Material," ITU-T VCEG, Doc. VCEG-N81, Sept. 2001.
[SUL91-1] J. R. Sullivan, L. A. Ray, and R. Miller, "Design of minimum visual modulation halftone patterns," IEEE Trans. Syst., Man, Cybern., vol. 21, pp.33-38, Jan. 1991.
[SUL91-2] G. J. Sullivan and R. L. Baker, "Rate-distortion optimized motion compensation for video compression using fixed or variable size blocks," Proc. IEEE Int. Conf. Global Telecomm. (GLOBECOM'91), pp. 85-90, Dec. 1991.
[SUL98] G. J. Sullivan and T. Wiegand, "Rate-distortion optimization for video compression," IEEE Signal Process. Mag., vol. 15, no. 11, pp. 74-90, Nov. 1998.
[SYN04] Synopsys Inc. (2004, Oct 24).Available: http://www.synopsys.com/ cgi-bin/ designware/ipdir.cgi?c=DW8051.
[TAN03] C. W. Tang and H. M. Hang, "A feature-based robust digital image watermarking scheme," IEEE Trans. Signal Processing, Vol. 51, No. 4, pp. 950-959, April 2003.
[TAU00] D. Taubman, "High performance scalable image compression with EBCOT," IEEE Tran. on Image Processing, vol. 9, no. 7, pp.1158-1170, Jul. 2000.
[THA97] N. T. Thao, "Set theoretic inverse halftoning," Proc. IEEE Int. Conf. Image Processing, vol. 1, pp. 783-786, Oct. 1997.
[TIN94] M. Y. Ting and E. A. Riskin, "Error-diffused iamge compression using a binary-to-gray-scale decoder and predictive pruned tree-structured vector quantization," IEEE Trans. Image Processing, vol. 3, pp. 854-858, 1994.
[TSE05] C. H. Tseng, H. M. Wang, and J. F. Yang, "Improved and Fast Algorithms for Intra 4x4 Mode Decision in H.264/AVC," Proc. IEEE Int. Conf. Circuits and Systems, vol. 3, pp. 2128-2131, May 2005.
[TUY04] T. Tuytelaars and L. Van Gool, "Matching widely separated views based on affine invariant regions," Int. J. Computer Vision, Vol. 59, No. 1, pp. 61-85, August 2004.
[VOL99] S. Voloshynovskiy, A. Herrigel, N. Baumgartner, and T. Pun, "A stochastic approach to content adaptive digital image watermarking," in Proc. 3rd Int. Workshop Information Hiding, pp. 211-236, Sept. 1999.
[WHE00] F. W. Wheeler and W. A. Pearlman, " SPIHT image compression without lists," IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Turkey, June 2000.
[WIE03-1] T. Wiegand, G. J. Sullivan, G. Bjontegaard, and A. Luthra, "Overview of the H.264/AVC video coding standard," IEEE Trans. Circuits Sys. Video Technol., vol. 13, pp. 560-576, July 2003.
[WIE03-2] T. Wiegand, H. Schwarz, A. Joch, F. Kossentini, and G. J. Sullivan, "Rate-constrained coder control and comparison of video coding standards," IEEE Trans. Circuits Syst. Video Technol., vol. 13, no. 7, pp. 688-703, Jul. 2003.
[WIE96] T. Wiegand, M. Lightstone, D. Mukherjee, T. G. Campbell, and S. K. Mitra, "Rate-Distortion Optimized Mode Selection for Very Low Bit Rate Video Coding and Emerging H.263 Standard," IEEE Trans. Circuits Sys. Video Technol., vol. 6, pp. 182-190, April 1996.
[WIL02] E. Wilson and S. M. Rock, "Gradient-based parameter optimization for systems containing discrete-valued functions," Int. J. Robust Nonlinear Control, vol. 12, pp. 1009-1028, 2002.
[WON01] P. W. Wong, and N. Memon, "Secret and public key image watermarking schemes for image authentication and ownership verification," IEEE Trans. Image Processing, Vol. 10, No. 10, pp. 1593-1601, Oct. 2001.
[WON95] P. W. Wong, "Inverse halftoning and kernel estimation for error diffusion", IEEE Trans. Image Processing, vol. 4, pp. 486-498, Apr. 1995.
[WU05] D. Wu, F. Pan, K. P. Lim, S. Wu, Z. G. Li, X. Lin, S. Rahardja, and C. C. Ko, "Fast Intermode Decision in H.264/AVC Video Coding," IEEE Trans. Circuits Sys. Video Technol., vol. 15, pp. 953-958, July 2005.
[XIL00] Xilinx Inc. Available: http://www.xilinx.com/bvdocs/publications/ ds022.pdf
[XIO96] Z. Xiong, M. T. Orchard and K. Ramchandran, "Inverse halftoning using wavelets," Proc. IEEE Int. Conf. Image Processign, vol. 1, pp. 569-572, Sep. 1996.
[ZHA05] D. Zhang, Y. Shen, S. Lin, and Y. Zhang, "Fast Inter Frame Encoding Based on modes Pre-decision in H.264," Proc. IEEE Int. Conf. Multimedia and Expo, pp. 530-533, July 2005.
[ZHU05-1] H. Zhu, C. K. Wu, Y. L. Wang, and Y. Fang, "Fast Mode Decision for H.264/AVC based on Macroblock Correlation," Proc. IEEE Int. Conf. Advanced Information Networking and Applications, vol. 1, pp. 775-780, March 2005.
[ZHU05-2] X. J. Zhu and S. A. Zhu, "Fast Mode Decision and Reduction of Reference Fames for H.264 Encoder," Proc. IEEE Int. Conf. Control and Automation, vol. 2, pp. 1040-1043, June 2005.