| 研究生: |
林棋勝 Lin, Chi-Sheng |
|---|---|
| 論文名稱: |
低功率全平行內容可定址記憶體電路設計 Design of Low-Power Fully Parallel Content-Addressable Memories |
| 指導教授: |
劉濱達
Liu, Bin-Da |
| 學位類別: |
博士 Doctor |
| 系所名稱: |
電機資訊學院 - 電機工程學系 Department of Electrical Engineering |
| 論文出版年: | 2004 |
| 畢業學年度: | 92 |
| 語文別: | 英文 |
| 論文頁數: | 106 |
| 中文關鍵詞: | 四元 、預先計算 、低功率 、最長字串符合 、內容可定址記憶體 |
| 外文關鍵詞: | precomputation, LPM, CAM, low power, quadruple |
| 相關次數: | 點閱:88 下載:3 |
| 分享至: |
| 查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
本論文提出兩種新穎的電路架構以實現高速度及低功率之全平行內容可定址記憶體設計。這兩種電路架構分別為:1. 以預先計算為基礎之內容可定址記憶體設計及 2. 虛擬-互補式金氧半內容可定址記憶體設計。基於靜態的電路設計,這兩種電路架構不僅可以提升電路效能,也可以克服傳統動態電路中的各種設計問題。為了降低靜態功率及大部分的比較功率消耗,本篇論文提出數種電路設計方法以達到高速度及低功率消耗的內容可定址記憶體設計。
在提出的以預先計算為基礎之內容可定址記憶體設計中,基於預先計算機制,不但可以降低內容可定址記憶體整體的功率消耗,更可以減少內容可定址記憶體細胞所需的電晶體數。整體電路已在 TSMC 0.35 μm 金氧半製程下製作並完成驗證。以一個內容大小為 128 × 30 之內容可定址記憶體,經由實際量測結果得知本論文所提出的電路架構在 3.3 V 的電壓供應下,資料搜尋速度達每秒 100 百萬筆,且功率消耗低於 33 mW。因此,本電路適合使用在OC-768 非同步傳輸模式交換機網路中。
在提出的虛擬-互補式金氧半內容可定址記憶體設計中,基於虛擬-互補式金氧半邏輯設計,不但可以有效提升內容可定址記憶體字組的資料比較速度,而且減少內容可定址記憶體的功率消耗。除傳統的二元內容可定址記憶體,本篇論文提出一種新的四元內容可定址記憶體以提供不同的資料搜尋功能。整體電路已在 TSMC 0.35 μm 金氧半製程下製作並完成驗證。在二元內容可定址記憶體設計方面,以一個內容大小為 128 × 30 之內容可定址記憶體,經由實際量測結果得知本論文所提出的電路架構在 3.3 V的電壓供應下,資料搜尋速度達每秒 110 百萬筆,且功率消耗低於 23 mW。除此之外,由量測中亦可發現本電路在低達 1.5 V 的供應電壓下,資料搜尋速度達每秒 32 百萬筆。在四元內容可定址記憶體設計方面,以一個內容大小為 128 × 30 之內容可定址記憶體,經由實際量測結果得知本論文所提出的電路架構在 3.3 V的電壓供應下,資料搜尋速度達每秒 94 百萬筆,且功率消耗低於 29 mW。除此之外,由量測中亦可發現本電路在低達 1.4 V 的供應電壓下,資料搜尋速度達每秒 15 百萬筆。因此,本電路亦適合使用在OC-768 非同步傳輸模式交換機網路中。
基於所提出的虛擬-互補式金氧半二元內容可定址記憶體設計,本篇論文提出一個高速度及低功率消耗的最長資料比對電路設計。以一個內容大小為 128 × 32 之記憶體容量,經模擬評估結果得知此電路在 3.3 V的電壓供應下,資料搜尋速度達每秒 66 百萬筆,且功率消耗低於 150 mW。因此,本篇論文所提出的最長資料比對電路符合現今高速網路對資料比對速度的需求,例如十億位元乙太網路中。
This thesis presents two novel VLSI architectures for high-speed and low-power fully parallel CAM design. These two proposed architectures are 1) precomputation-based content-addressable memory (PB-CAM) and 2) pseudo-CMOS content-addressable memory (PC-CAM). The proposed PB-CAM and PC-CAM architectures adopt static pseudo-nMOS circuit design that not only improves circuit performance but also solves some design problems of dynamic CMOS circuit design. To reduce the static power and majority part of comparison power dissipations of pseudo-nMOS circuit design, some low power design techniques are used in the proposed PB-CAM and PC-CAM architectures to achieve high-speed and low-power features.
In the proposed PB-CAM architecture, this design is based on a precomputation approach that saves not only power consumption of the CAM system, but also reduces transistor count and operating voltage of the CAM cell. The whole design was fabricated with the TSMC 0.35 μm SPQM CMOS process. With a 128 words by 30 bits CAM size, the measurement results indicate that the proposed circuit works up to 100 MHz with the power consumption of 33 mW at 3.3 V supply voltage and works up to 30 MHz under 1.5 V supply voltage. The PB-CAM circuit is suitable for OC-768 ATM switch.
In the proposed PC-CAM architecture, this design is based on a proposed static pseudo-CMOS logic structure that not only improves comparison speed of the PC-CAM word circuit, but also reduces power dissipation of the PC-CAM system. In addition to the conventional binary CAM (BCAM) circuit design, a new quadruple CAM (QCAM) circuit design is proposed to achieve alternative search function. The proposed PC-BCAM and PC-QCAM were fabricated with the TSMC 0.35 μm single-poly quadruple-metal (SPQM) CMOS process. With a 128 words by 30 bits CAM size, the measurement results indicate that the proposed PC-BCAM and PC-QCAM circuits work up to 110 Msearches/sec and 94 Msearches/sec with the power consumption of 23 mW and 29 mW, respectively. Therefore, The PC-BCAM and PC-QCAM circuits are suitable for OC-768 ATM switch.
Based on the proposed PC-BCAM architecture, a high-speed, low-power, and low-cost LPM search engine was proposed in the thesis. With a 128 prefix data by 32 bits size, the estimation results show that the searching speed of the circuit achieves up to 66 MHz with power consumption less than 150 mW at 3.3 V. The estimated results show that the proposed LPM search engine meets multi-gigabit/s high-speed networking requirement, for example gigabit Ethernet.
[1] S. Panchanathan and M. Goldberg, “A content-addressable memory architecture for image coding using vector quantization,” IEEE Trans. Signal Processing, vol. 39, no. 9, pp. 2066 – 2078, Sept. 1991.
[2] G. Iyengar and S. Panchanathan, “Systolic array architecture for Gabor decomposition,” IEEE Trans. Circuits Syst. Video Tech., vol. 5, no. 4, pp. 355 – 359, Aug. 1995.
[3] J. Jiang and C. V. Brett, “CAM based real-time lossless image compression,” in Proc. IEEE Int. Conf. Electron., Circuits Syst., Sept. 1998, vol. 3, pp. 515 – 518.
[4] R. Y. Yang and C. Y. Lee, “High-throughput data compressor designs using content addressable memory,” in Proc. IEEE Int. Symp. Circuits Syst., June 1994, vol. 4, pp. 147 – 150.
[5] K. J. Lin, C. W. Wu, and S. Member, “A low-power CAM design for LZ data compression,” IEEE Trans. Comput., vol. 49, no. 10, pp. 1139 – 1145, Oct. 2000.
[6] S. R. Jones, I. P. Jalowiecki, S. J. Hedge, and R. M. Lea, “A 9-kbit associative memory for high-speed parallel processing applications,” IEEE J. Solid-State Circuits, vol. 23, no. 2, pp. 543 – 548, Apr. 1988.
[7] Y. C. Shin, R. Sridha, V. Demjanenko, P. W. Palumbo, and S. N. Srihari, “A special-purpose content addressable memory chip for real-time image processing,” IEEE J. Solid-State Circuits, vol. 27, no. 5, pp. 737 – 744, Apr. 1992.
[8] T. Ogura, M. Nakanishi, T. Baba, Y. Nakabayshi, and R. Kasai, “A 336-k-bit content-addressable memory for highly parallel image processing,” in Proc. IEEE Cust. Int. Circuits Conf., May 1996, pp. 13.4.1 – 13.4.4.
[9] T. Ikenaga and T. Ogura, “A DTCNN universal machine based on highly parallel 2-D cellular automata CAM2,” IEEE Trans. Circuits and Syst. –I, vol. 45, no. 5, pp. 538 – 546. May 1998.
[10] T. Ikenaga and T. Ogura, “A fully-parallel 1 Mb CAM LSI for real-time pixel-parallel image processing,” IEEE J. Solid-State Circuits, vol. 35, no. 4, pp. 536 – 544, Apr. 2000.
[11] I. Darmawan, W. T. Hartono, E. Mozef, S. Sutikno, and Kuspriyanto, “VHDL design and simulation of MAM memory for LAPCAM parallel architecture for image processing,” in Proc. IEEE Asia-Pacific Conf. Circuits Syst., Oct. 2002, vol. 1, pp. 389 – 392.
[12] A. K. Goksel, R. H. Krambeck, P. P. Thomas, M. S. Tsay, C. Y. Chen, D. G. Clemons, F. D. LaRocca, and L. P. Mai, “A content addressable memory management unit with on-chip data cache,” IEEE J. Solid-State Circuits, vol. 24, no. 3, pp. 592 – 596, June 1989.
[13] K. Nogami, T. Sakurai, K. Sawada, K. Sakaue, Y. Miyazawa, S. Tanaka, Y. Hiruta, K. Katoh, T. Takayanagi, T. Shirotori, Y. Itoh, M. Uchida, and T. Iizuka, “A 9-ns hit-delay 32-kbyte cache macro for high-speed RISC,” IEEE J. Solid-State Circuits, vol. 25, no. 1, pp. 100 – 108, Feb. 1990.
[14] K. Arimoto, M. Asakura, H. Hidaka, Y. Matsuda, and K. Fujishama, “A circuit design of intelligent cache DRAM with automatic write-back capability,” IEEE J. Solid-State Circuits, vol. 26, no. 4, pp. 560 – 565, Apr. 1991.
[15] P. F. Lin and J. B. Kuo, “A 1-V 128-kb four-way set-associative CMOS cache memory using wordline-oriented tag-compare (WLOTC) structure with the content-addressable-memory (CAM) 10-transistor tag cell,” IEEE J. Solid-State Circuits, vol. 36, no. 4, pp. 666 – 675, Apr. 2001.
[16] A. Efthymiou and J. D. Garside, “A CAM With Mixed Serial-Parallel Comparison for Use in Low Energy Caches,” IEEE Trans. VLSI Syst. vol. 12, no. 3, pp. 325 – 329, Mar. 2004.
[17] P. B. Berra, A. Ghafoor, P. A. Mitkas, S. J. Marcinkowski, and M. Guizani, “The impact of optics on data and knowledge base systems,” IEEE Trans. Knowledge Data Eng., vol. 1, no. 1, pp. 111 – 132, Mar. 1989.
[18] I. Okabayashi, H. Kotani, and H. Kadota, “A proposed structure of a 4 Mbit content-addressable and sorting memory,” in Proc. Symp. VLSI Circuits Dig. Tech. Papers, June 1990, pp. 109 – 110.
[19] T. Yamagata, M. Mihara, T. Hamamoto, T. Kobayashi, and M. Yamada, “A 288-kbit fully parallel content addressable memory using stacked capacitor cell structure,” in Proc. IEEE Cust. Int. Circuits Conf., May 1991, pp. 10.3.1 – 10.3.4.
[20] C. D. Stormon, “The Coherent Processor: an associative processor architecture and applications,” in Proc. Compcon Spring Dig. Papers, Mar. 1991, pp. 270 – 275.
[21] S. Akyokus and P. B. Berra, “Optical content addressable memories for data/knowledge base processing,” in Proc. Int. Symp. Parallel Processing, Apr. 1991, pp. 202 – 207.
[22] H. Yamada, M. Hirata, H. Nagai, and K. Takahashi, “A high-speed string-search engine,” IEEE J. Solid-State Circuits, vol. 22, no. 5, pp. 829 – 834, Oct. 1987.
[23] R. J. Marks, S. Oh, and L. E. Atlas, “Alternating projection neural networks,” IEEE Trans. Circuits Syst., vol. 36, no. 6, pp. 846 – 857, June 1989.
[24] M. Kam,R. Cheng, A. Guez, “Pattern retrieval and learning in nets of asynchronous binary threshold elements,” IEEE Trans. Circuits Syst. vol. 36, no. 3, pp.353 – 364, Mar. 1989.
[25] M. Verleysen, B. Sirletti, A. M. Vandemeulebroecke, and P. G. A. Jespers, “Neural networks for high-storage content-addressable memory: VLSI circuit and learning algorithm,” IEEE J. Solid-State Circuits, vol. 24, no. 3, pp. 562 – 569, Jun. 1989.
[26] S. V. Kartalopoulos, “An associative RAM-based CAM and its application to broadband communications systems,” IEEE Trans. Neural Networks, vol. 9, no. 5, pp.1036 – 1041, Sept. 1998.
[27] M. Meribout, T. Ogura, M. Nakanishi, “On using the CAM concept for parametric curve extraction,” IEEE Trans. Image Processing, vol. 9, no. 12, pp.2126 – 2130, Dec. 2000.
[28] L. R. Tamura, T. S. Yang, D. E. Wingard, M. A. Horowitz, and B. A. Wolley, “A 4-ns BiCMOS translation-lookaside buffer,” IEEE J. Solid-State Circuits, vol. 25, no. 5, pp. 1093 – 1101, Oct. 1990.
[29] H. Higuchi, S. Tachibana, M. Minami, and T. Nagano, “A 2-ns, 5-mW, synchronous-powered static-circuit fully associative TLB,” in Proc. IEEE Int. Solid-State Circuits Conf. Dig. Tech. Papers, June 1995, pp. 21 – 22.
[30] M. Sumita, “A 800 MHz single cycle access 32 entry fully associative TLB with a 240 ps access match circuit,” in Proc. Symp. VLSI Circuits Dig. Tech. Papers, June 2001, pp. 231 – 232.
[31] H. H. S. Lee, and C. S. Ballapuram, “Energy efficient D-TLB and data cache using semantic-aware multilateral partitioning,” in Proc. Int. Symp. Low Power Electron. Design, Aug. 2003, pp. 306 – 311.
[32] J. H. Lee, G. H. Park, S. B. Park, and S. D. Kim, “A selective filter-bank TLB system [embedded processor MMU for low power],” in Proc. Int. Symp. Low Power Electron. Design, Aug. 2003, pp. 312 – 317.
[33] M. Sato, K. Kubota, and T. Ohtsuki, “A hardware implementation of gridless routing based on content addressable memory,” in Proc. ACM/IEEE Design Automat Conf., June 1990, pp. 24 – 28.
[34] J. G. Delgado-Frias, J. Nyathi, and D. H. Summerville, “A programmable dynamic interconnection network router with hidden refresh,” IEEE Trans. Circuits Syst. –I, vol. 45, no. 11, pp. 1182 – 1190, Nov. 1998.
[35] I. Y. L. Hsiao, and C. W. Jen,” A new hardware design and FPGA implementation for Internet routing towards IP over WDM and terabit routers,” IEEE Int. Symp. Circuit Syst., May 2000, pp. 28 – 31.
[36] J. Nyathi, and J. G. Delgado-Frias, “A hybrid wave pipelined network router,” IEEE Trans. Circuits Syst. –I, vol. 49, no. 12, pp. 1764 – 1772, Dec. 2002.
[37] D. E. Taylor, J. S. Turner, J. W. Lockwood, T. S. Sproull, and D. B. Parlour, “Scalable IP lookup for Internet routers,” IEEE J. Selected Areas Commun., vol. 21, no. 4, pp. 522 – 534, May 2003.
[38] Z. Liang, J. Wu, and K. Xu, “A TCAM-based IP lookup scheme for multi-nexthop routing,” in Proc. Int. Comput. Networks Mobile Comput., Oct. 2003, pp. 128 – 135.
[39] F. Zane, N. Girija, and A. Basu, “Coolcams: power-efficient TCAMs for forwarding engines,” in Proc. IEEE INFOCOM’03, Mar. 2003, vol. 1, pp. 42 – 52.
[40] G. Bucci, A. Del Bimbo, S. Santini, “Fast filtering in bridging Ethernet and FDDI networks: a CAM approach,” in Proc. Comput. Commun., Mar. 1990, pp. 866 – 867.
[41] G. Bucci, A. Del Bimbo, and S. Santini, “Design and performance evaluation of a high throughput Ethernet/FDDI interconnect,” in Proc. Advanced Computer Technol. Reliable Syst. Applicat., May 1991, pp. 89 – 93.
[42] G. Bucci, A. Del Bimbo, and S. Santini, “Modified filtering algorithm (MFA) for backbone interconnection-hardware design and implementation issues,” in Proc. Comput. Commun., Mar. 1991, pp. 689 – 695.
[43] K. Tanno, A. Koyama, S. Mirza, T. Taketa, and S. Noguchi, “Performance evaluation of high-speed self-token ring LAN,” in Proc. Int. Symp. Parallel Architectures, Algorithms and Networks, Dec. 1994, pp. 294 – 301.
[44] D. Riddoch, K. Mansley, and S. Pope, “Distributed computing with the CLAN network,” in Proc. Local Comput. Networks, Nov. 2002, pp. 609 – 618.
[45] T. Matsuda and K. Matsuda, “A new protocol processing architecture for high-speed networks,” in Proc. IEEE Global Telecommun. Conf., Nov. 1996, pp. 798 – 803.
[46] S. V. Kartalopoulos, “Associative RAM-based CAM applicable to packet-based broadband systems,” in Proc. IEEE Global Telecommun. Conf., Nov. 1998, vol. 5, pp. 2888 – 2891.
[47] J. Fried, “A VLSI chip set for burst and fast ATM switching,” in Proc. IEEE Int. Conf. Commun., June 1989, pp. 128 – 135.
[48] M. Akata, S. Karube, T. Sakamoto, T. Saito, S. Wakasugi, S. Yoshida, H. Matsuno, and H. Shibata, “A Scheduling Content-addressable Memory For ATM Space-division Switch Control,” in Proc. IEEE Int. Solid-State Circuits Conf. Dig. Tech. Papers, Feb. 1991, pp. 244 –245.
[49] K. J. Schultz, and P. G. Gulak, “CAM-based single-chip shared buffer ATM switch,” in Proc. IEEE Int. Conf. Commun., May 1994, pp. 1190 – 1195.
[50] H. J. Chao, H. Cheng, Y. R. Jenq, and D. Jeong, “Design of a generalized priority queue manager for ATM switches,” IEEE J. Selected Areas Commun., vol. 15, no. 5, pp. 867 – 880, June 1997.
[51] R. J. Glaise, and J. M. Munie, “A low cost searching device for an ATM adapter,” in Proc. Comput. Commun. Networks, Sept. 1997, pp. 488 – 493.
[52] J. W. Heo, and D. K. Sung, “A new fast cell resequence mechanism for multipath ATM switches,” IEEE Commun. Lett., vol. 3, no. 10, pp. 297 – 299, Oct. 1999.
[53] C. S. Lin and B. D. Liu, “Design of a shared buffer management scheme for ATM switches,” in Proc. 15th Annual IEEE Int. ASIC/SOC, Sept. 2002, pp. 261-264.
[54] C. A. Zukowski and S. Y. Wang, “Use of selective precharge for low-power content-addressable memories,” in Proc. IEEE Int. Symp. Circuits Syst., June. 1997, vol. 3, pp. 1788 – 1791.
[55] G. Thirugnanam, N. Vijaykrishnan, and M. J. Irwin, “A novel low power CAM designs,” in Proc. IEEE Int. ASIC/SOC Conf., Sept. 2001, pp. 198 – 202.
[56] F. Shafai, K. J. Schultz, G. F. R. Gibson, A. G. Bluschke, and D. E. Somppi, “Fully parallel 30-MHz, 2.5Mb CAM,” IEEE J. Solid-State Circuits, vol. 33, pp. 1690 – 1696, Nov. 1998.
[57] H. Miyatake, M. Tanaka, and Y. Mori, “A design for high-speed low-power CMOS fully parallel content-addressable memory macros,” IEEE J. Solid-State Circuits, vol. 36, no. 11, pp. 956 – 968, June 2001.
[58] P. F. Lin and J. B. Kuo, “A 1-V 128-kb four-way set-associative CMOS cache memory using wordline-oriented tag-compare (WOTC) structure with the content-addressable-memory (CAM) 10-transistor tag cell,” IEEE J. Solid-State Circuits, vol. 36, no. 4, pp. 666-675, Apr. 2001.
[59] S. C. Liu, F. A. Wu, and J. B. Kuo, “A novel low-voltage content-addressable-memory (CAM) cell with a fast tag-compare capability using partially depleted (PD) SOI CMOS dynamic-threshold (DTMOS) techniques,” IEEE J. Solid-State Circuits, vol. 36, no. 4, pp. 712 – 716, Apr. 2001.
[60] J. P. Wade and C. G. Sodini, “A ternary content-addressable search engine,” IEEE J. Solid-State Circuits, vol. 24, no. 4, pp. 1003 – 1013, Aug. 1989.
[61] M. Morisue, and K. Suzuki, “A Josephson ternary associative memory cell,” IEEE Trans. Magn., vol. 25, no. 2, pp. 849 – 852, Mar. 1989.
[62] S. R. Ramirez-Chavez, “Encoding don't cares in static and dynamic content-addressable memories,” IEEE Trans. Circuits Syst. –II, vol. 39, no.8, pp. 575 – 578, Aug. 1992.
[63] G. A. Betzos, and J. V. Oldfield, “Fast skew number computation using a CAM and related hardware for an n-bit context predictor,” Electron. Lett., vol. 29, no. 14, pp. 1265 – 1267, July 1993.
[64] J. G. Delgado-Frias, A. Yu, and J. Nyathi, ” A dynamic content addressable memory using a 4-transistor cell,” in Proc. Int Workshop Design Mixed-Mode Int. Circuits Applicat., July 1999, pp. 110 – 113.
[65] V. Lines, A. Ahmed, P. Ma, S. Ma, R. McKenzie, H. S. Kim, and C. Mar, “66 MHz 2.3 M ternary dynamic content addressable memory,” in Proc. IEEE Int. Workshop Memory Technol. Design Testing, Aug. 2002, pp. 101 – 105.
[66] A. Efthymiou and J. D. Garside, “A CAM with mixed serial-parallel comparison for use in low energy caches,” IEEE Trans. VLSI Syst., vol. 12 no. 3, pp. 325 – 329, Mar. 2004.
[67] I. Arsovski, T. Chandler, and A. Sheikholeslami, “A ternary content-addressable memory (TCAM) based on 4T static storage and including a current-race sensing scheme,” IEEE J. Solid-State Circuits, vol. 38, no. 1, pp. 155 – 158, Jan. 2003.
[68] I. Arsovski, and Sheikholeslami, “A mismatch-dependent power allocation technique for match-line sensing in content-addressable memories,” IEEE J. Solid-State Circuits, vol. 38, no. 11, pp. 1958 – 1966, Nov. 2003.
[69] H. Noda, K. Inoue, H. J. Mattausch, T. Koide, and K. Arimoto, “A cost-efficient dynamic Ternary CAM in 130 nm CMOS technology with planar complementary capacitors and TSR architecture,” in Proc. Symp. VLSI Circuits Dig. Tech. Papers, June 2003, pp. 83 – 84.
[70] K. Pagiamtzis, and A. Sheikholeslami, “Pipelined match-lines and hierarchical search-lines for low-power content-addressable memories,” in Proc. IEEE Cust. Int. Circuits Conf., Sept. 2003, pp. 383 – 386.
[71] N. H. E. Weste and K. Eshraghian, Principles of CMOS VLSI Design: A Systems Perspective. Reading, MA: Addison Wesley, 1985.
[72] T. Hanyu, N. Kanagawa, and M. Kameyama, “One-transistor-cell multiple-valued CAM for a collision detection VLSI processor,” in Proc. IEEE Int. Solid-State Circuits Conf. Dig. Tech. Papers, Feb. 1996, pp. 264-265, 457.
[73] T. Miw, H. Yamada, Y. Hirota, T. Satoh, and H. Hara, “A 1-Mb 2-Tr/b nonvolatile CAM based on flash memory technologies”, IEEE J. Solid-State Circuit, vol. 31, no. 11, pp. 1601-1609, Nov. 1996.
[74] T. Hayashi, and T. Miyazaki, “High-speed table lookup engine for IPv6 longest prefix match,” in Proc. Global Telecommun. Conf., 1999, vol. 2, pp. 1576 – 1681.
[75] N. F. Huang, and S. M. Zhao, “A novel IP-routing lookup scheme and hardware architecture for multigigabit switching routers,” IEEE J. Selected Areas in Commun., vol. 17, no. 6, pp. 1093-1104, June 1999.
[76] H. Liu, “Routing table compaction in ternary CAM,” IEEE Micro, vol. 22, no. 1, pp. 58 - 64, Feb. 2002.
[77] H. Lim, J. H. Seo, and Y. J. Jung, “High speed IP address lookup architecture using hashing,” IEEE Commun. Lett., vol. 7, no. 10, pp. 502 - 504, Oct. 2003.
[78] K. I. Kitayama, and M. Murata, “Versatile optical code-based MPLS for circuit, burst, and packet switchings,” J. Lightwave Technol., vol. 21, no. 11, pp. 2753 - 2764, Nov. 2003.
[79] R. Sangireddy, and A. K. Somani, “High-speed IP routing with binary decision diagrams based hardware address lookup engine,” IEEE J. Selected Areas in Commun., vol. 21, no. 4, pp. 513 - 521, May 2003.
[80] A. J. McAuley and P. Francis, “Fast routing table lookup using CAMs,” in Proc. IEEE INFOCOM’93, Mar. 1993, vol. 3, pp. 1382 – 1391.
[81] M. Kobayashi, T. Murase, and A. Kuriyama, “A longest prefix match search engine for multi-gigabit IP processing,” in Proc. IEEE Int. Conf. Commnu., June 2000, vol. 3, pp. 18 – 22.
[82] S. G. Teresa and L. B. Linares, “A high-precision current-mode WTA-MAX circuit with multichip capability,” IEEE J. Solid-State Circuits, vol. 33, no. 2, pp. 280-286, Feb. 1998.
[83] P. F. Sum, C. S. Leung, K. S. Tam, G. H. Young, W. K. Kan, and L. W. Chan, “Analysis for a class of winner-take-all model,” IEEE Trans. Neural Networks, vol. 10, no. 1, pp. 64-71, Jan. 1999.
[84] Y. H. Tseng and J. L. Wu, “On a constant-time, low-complexity winner-take-all neural network,” IEEE Trans. Comput., vol. 44, no. 4, pp. 601-604, Apr. 1995.
[85] C. Y. Lee, S. C. Juan, and W. W. Yang, “A parallel bit-level maximum/minimum selector for digital and video signal processing,” IEEE Trans. Circuits Syst. -II, vol. 41, no. 10, pp. 693-695, Oct. 1994.
[86] C. S. Lin, J. C. Chang, and B. D. Liu, “A low-power precomputation-based fully parallel content-addressable memory,” IEEE J. Solid-State Circuits, vol. 38, no. 4, pp. 654 – 662, Apr. 2003.
[87] C. S. Lin, J. C. Chang, and B. D. Liu, “Design for low-power, low-cost, and high-reliability precomputation-based content addressable memory,” in Proc. IEEE Asia-Pacific Conf. Circuits Syst., Oct. 2002, vol. 2, pp. 319 – 324.
[88] J. C. Chang, “Design of low-power, precomputation-based fully parallel content-addressable memory,” Master thesis, National Cheng Kung University, Taiwan, June 2002.
[89] K. Roy and S. Prasad, Low-Power CMOS VLSI Circuit Design. New York, MA: John Wiley & Sons, 2000.
[90] C. S. Lin, K. H. Chen, and B. D. Liu, “Low-power and low-voltage fully parallel content-addressable memory,” in Proc. IEEE Int. Symp. Circuits Syst., May 2003, vol. 5, pp. 373 – 376.
[91] C. S. Lin, K. H. Chen, and B. D. Liu, “Low-Power Fully Parallel Quadruple Content-Addressable Memory Based on Pseudo-CMOS Logic Structure,” IEEE Trans. VLSI Syst. (Submitted)
[92] C. S. Lin, S. H. Ou, and B. D. Liu, “Design of k-WTA/sorting network using maskable WTA/MAX circuit,” in Proc. IEEE Int. Symp. VLSI-TSA, Apr. 2001, pp. 69 – 72.
[93] S. H. Ou, C. S. Lin, and B. D. Liu, “A scalable sorting architecture based on maskable WTA/MAX circuit,” in Proc. IEEE Int. Symp. Circuits Syst., May 2002, pp. 209-212.
[94] M. Alidina, J. Monteiro, S. Davadas, A. Ghost, and M. Papaefthymiou, “Precomputation-based sequential logic optimization for low power,” in Proc. IEEE Int. Conf. Comput.-Aided Design, 1994, pp. 74 – 81.
[95] P. C. Wang, C. T. Chan, S. C. Hu, Y. C. Shin, and Y. C. Chen, “Hardware-based IP routing lookup with incremental update,” in Proc. Int. Conf. Parallel Distributed Syst., Dec. 2003, pp. 183 – 188.
[96] P. Newman, G. Minshall, T. Lyon, and L. Huston, “IP switching and gigabit routers,” IEEE Commun. Mag., vol. 35, no. 1, pp. 64 – 69, Jan. 1997.
[97] S. Jean, S. H. Chung, J. Kim, and H. Yoon, “Scalable IP lookup scheme with small forwarding table for gigabit routers,” Electron. Lett., vol. 38, no. 6, pp. 298 – 300, Mar. 2002.
[98] R. D. Adams, J. Connor, G. S. Koch, and L. J. Ternullo, “A 370-MHz memory built-in self-test state machine, “ in Proc. European Design Test Conf. Mar. 1995, pp. 139 – 141.
[99] Y. S. Kang, J. C. Lee, and S. Kang, “Parallel BIST architecture for CAMs,” Electron. Lett., vol. 33, no. 1, pp. 30 – 31, Jan. 1997.
[100] Y. S. Kang, J. C. Lee, and S. Kang, “Built-in self test for content addressable memories,” in Proc. Int. Conf. Comput. Design, Oct. 1997, pp. 48 – 53.
[101] P. R. Sidorowicz, and J. A. Brzozowski, “Verification of CAM tests for input stuck-at faults,” in Proc. Int. Memory Technol. Design Testing, Aug. 1998, pp. 76 – 82.
[102] T. Chadwick, T. Gordon, R. Nadkarni, and J. Rowland, “An ASIC-embedded content addressable memory with power-saving and design for test features,” in Proc. IEEE Cust. Int. Circuits Conf., May 2001, pp. 183 – 186.
[103] X. Du, S. M. Reddy, J. Rayhawk, and W. T. Cheng, “Testing delay faults in embedded CAMs,” in Proc. Asian Test Symp., Nov. 2003, pp. 378 – 383.
[104] S. K. Lu, “Built-in self-repair techniques for embedded RAMs,” IEE Proc. -Comput. Digit. Tech., vol. 150, no. 4, pp. 201 – 208, July 2003.