| 研究生: |
劉岱恩 Liu, DaiEn |
|---|---|
| 論文名稱: |
人與人工智慧協作激發設計創造力的認知研究 Catalyzing Creativity in Human-AI Co-creation: A Cognitive Design Approach |
| 指導教授: |
鄭泰昇
Jeng, TaySheng |
| 學位類別: |
碩士 Master |
| 系所名稱: |
規劃與設計學院 - 建築學系 Department of Architecture |
| 論文出版年: | 2026 |
| 畢業學年度: | 112 |
| 語文別: | 英文 |
| 論文頁數: | 158 |
| 中文關鍵詞: | 人機共創 、設計協議分析 、生成式人工智慧 、以人為本設計 |
| 外文關鍵詞: | Human-AI Co-creation, Design Protocol Analysis, Generative AI, Human-centered design |
| ORCID: | 0009-0005-2331-7291 |
| 相關次數: | 點閱:10 下載:0 |
| 分享至: |
| 查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
生成式人工智慧(Gen‑AI)進入設計現場後,關鍵不再只是「能生成什麼」,而是「如何激發設計創造力」。研究初期觀察顯示,Gen‑AI 的圖像生成迭代在操作與認知功能上,與建築設計中的草圖繪製(sketching)展現出若干相似性:兩者皆為支援構想外化(externalizing ideas)、記憶保持 (memory retention)、迭代探索(iterative exploration)與精煉想法(refinement)的動態平台。然而,不同於草圖媒介的手動回饋與線性發展,Gen‑AI 所引發的創意轉向往往帶有突發性與非線性邏輯。本研究聚焦於此激發創意的「觸發機制」,探討建築概念設計初期中,設計者如何在多模態(multimodality)互動中與 Gen‑AI 共創。
為回應當前 AEC(Architecture, Engineering, Construction)領域普遍偏重輸出效率、較少探討人機互動中的設計認知歷程,本研究以回溯式口語協議(retrospective verbal protocols)與設計行為編碼進行質性分析。共招募八位具建築訓練背景之研究生/初階設計者參與(2 次前導、6 次主實驗),於相同基礎影像的建築改建提案情境下,完成三種 30 分鐘任務(文字提示限定、ControlNet 草圖/影像引導限定、自主混合)。研究透過 LiblibAI 平台整合 Stable Diffusion 與 ControlNet,以蒐集螢幕錄影、影像序列與回溯敘述資料。
在分析層面,研究改寫 Suwa 等人(1998)提出之 DCOCS 認知編碼架構,增補 Wow/Good/Fail 三種設計反應作為創意觸發點(trigger points),並搭配時間軸、影像流程圖等視覺化方式進行跨個案比較。結果發現:Fail 為最常見反應,常呈現連續失敗迴圈並驅動策略修正與模態切換,形成「修正式循環」(corrective loops);Wow 雖低頻,卻穩定標誌概念樞紐,能導致重新詮釋甚至目標重設。這類 Wow 反應可對應 Akin and Akin(1996)所稱設計歷程中的「突現性心智洞察」(Sudden Mental Insight, SMI),亦即在資訊不斷重組中突發的轉譯時刻。研究並進一步歸納出三種人機共創取向: Machine‑Led、Human‑Led、Human‑Led–Machine‑Led,顯示創意轉向並非單純仰賴工具熟練度,更受制於設計者的模態敏捷(modal agility)與對不確定性的詮釋能動性(interpretive agency)。
本研究亦延伸現有設計認知研究,從微觀互動與反應歷程中揭示創意觸發與策略轉換的關聯性。綜合而言,本研究提出一種以「觸發點(trigger)」與「策略切換」為核心的人機共創認知觀點,回應現行設計工具評估過度聚焦生成結果、而忽略設計歷程的偏向。研究亦指出,目前 Gen-AI 工具在設計節奏掌握、隨機性調整與多模態銜接上仍存在操作限制,亟需介面優化。最後,本文針對 Gen-AI 工具設計與建築教育提出具體建議,包括:建構具可解釋性的輸入—輸出對應機制、提供可調式的隨機性控制,以及促進模態間無縫轉換的工作流程,以更有效支持設計者在 AI 輔助創作中的作者性與反思迭代能力。
The emergence of Generative Artificial Intelligence (Gen-AI) in design has shifted the critical question from what can be generated to how creativity is triggered and redirected. Early observations in this study suggest that Gen-AI–driven image iteration shares several cognitive and operational parallels with sketching in architectural design—both serve as media for externalizing ideas, supporting memory retention, and enabling iterative exploration and refinement. However, unlike sketching’s linear and manually controlled flow, Gen-AI interactions often unfold in abrupt and nonlinear shifts. This research focuses on the triggering mechanisms of creative shifts, investigating how designers co-create with Gen-AI across multimodal workflows in early-stage architectural ideation.
To address a gap in the Architecture, Engineering, and Construction (AEC) domain, where Gen-AI tools are commonly evaluated by output fidelity or process efficiency rather than cognitive mechanisms, this study adopts a protocol-based analysis of design cognition. Eight participants with architectural training (two pilot and six main sessions) completed three consecutive 30-minute tasks based on a shared renovation scenario using Stable Diffusion and ControlNet via the LiblibAI platform. Each session was conducted under modality constraints: prompt-only, image/sketch-only (ControlNet-guided), and free multimodal interaction. Screen recordings, generated image sequences, and retrospective verbal protocols were collected for analysis.
The study extends Suwa et al.’s (1998) DCOCS framework by incorporating cross-cutting reaction tags: Wow, Good, and Fail, as cognitive trigger points. These were visualized through time-based and image-flow diagrams to enable cross-case comparison. Findings show that Fail reactions are most frequent, often forming iterative failure loops that trigger corrective strategies and modality switching; Wow reactions, though rare, consistently mark conceptual pivots that prompt reinterpretation or goal resets. These Wow moments resemble what Akin and Akin (1996) described as “Sudden Mental Insight” (SMI), a cognitive leap triggered by the restructuring of internal information.
Synthesizing these dynamics, the study identifies three distinct co-creation orientations: Machine-Led, Human-Led, and Hybrid (Human-Led–Machine-Led), demonstrating that creative outcomes depend less on tool fluency than on designers’ modal agility and interpretive agency under uncertainty. In doing so, this research contributes to the broader field of design cognition by illuminating the micro-level linkages between triggering events and strategic adaptation during human–AI interaction.
This thesis proposes a trigger-oriented cognitive account of human–Gen-AI co-creation, shifting the evaluation lens from output quality to the underlying generative dynamics. It also surfaces current interface limitations, such as disjointed creative rhythms, imprecise control over randomness, and restricted multimodal handoffs, that hinder smooth co-creation. Accordingly, the study offers concrete design and pedagogical recommendations: building interpretable input–output mappings to support sense-making, introducing tunable stochasticity to regulate exploratory uncertainty, and enabling seamless modality switching to preserve authorship and support reflective iteration in AI-assisted workflows.
Akin, O. (1978). HOW DO ARCHITECTS DESIGN?
Akin, Ö. (1986). Psychology of architectural design. Pion.
Akin, Ö., & Akin, C. (1996). Frames of reference in architectural design: analysing the hyperacclamation (A-h-a-!). Design studies, 17(4), 341–361. https://doi.org/10.1016/S0142-694X(96)00024-5
Akin, Ö., & Lin, C. (1995). Design protocol data and novel design decisions. Design studies, 16(2), 211–236.
ArchitectureNow Editorial Desk. (2024, 2024/03/14). How MVRDV is using AI to design its buildings. https://architecturenow.co.nz/articles/how-mvrdv-is-using-ai-to-design-its-buildings/
Baduge, S. K., Thilakarathna, S., Perera, J. S., Arashpour, M., Sharafi, P., Teodosio, B., Shringi, A., & Mendis, P. (2022). Artificial intelligence and smart vision for building and construction 4.0: Machine and deep learning methods and applications. Automation in construction, 141, 104440. https://doi.org/10.1016/j.autcon.2022.104440
Bilda, Z. (2001). Designers' cognition in traditional versus digital media during conceptual design [MFA (Master of Fine Arts), Bilkent University]. http://hdl.handle.net/11693/29213
Bilda, Z., & Demirkan, H. (2003). An insight on designers’ sketching activities in traditional versus digital media. Design studies, 24(1), 27–50. https://doi.org/10.1016/S0142-694X(02)00032-7
Castelo-Branco, R., Caetano, I., & Leitão, A. (2022). Digital representation methods: The case of algorithmic design. Frontiers of Architectural Research, 11(3), 527–541. https://doi.org/10.1016/j.foar.2021.12.008
Chang, K.-H., Cheng, C.-Y., Luo, J., Murata, S., Nourbakhsh, M., & Tsuji, Y. (2021, 2021/10). Building-GAN: Graph-Conditioned Architectural Volumetric Design Generation. Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV),
Cho, K., Van Merriënboer, B., Gulcehre, C., Bahdanau, D., Bougares, F., Schwenk, H., & Bengio, Y. (2014). Learning phrase representations using RNN encoder-decoder for statistical machine translation. arXiv preprint arXiv:1406.1078.
Cross, N. (1997). Creativity in design: analyzing and modeling the creative leap. Leonardo, 30(4), 311–317.
Cross, N. (2001). Design cognition: Results from protocol and other empirical studies of design activity. Design Knowing and Learning: Cognition in Design Education, 79–103.
Davis, N., Hsiao, C.-P., Singh, K. Y., & Magerko, B. (2016). Co-creative drawing agent with object recognition. Proceedings of the AAAI conference on artificial intelligence and interactive digital entertainment,
Do, E. Y.-L., Gross, M. D., Neiman, B., & Zimring, C. (2000). Intentions in and relations among design drawings. Design studies, 21(5), 483–503. https://doi.org/10.1016/S0142-694X(00)00020-X
Dorst, K., & Cross, N. (2001). Creativity in the design process: co-evolution of problem–solution. Design studies, 22(5), 425–437. https://doi.org/10.1016/S0142-694X(01)00009-6
Eapen, T., Finkenstadt, D. J., Folk, J., & Venkataswamy, L. (2023). How generative AI can augment human creativity. Harvard Business Review, 101(4), 56–64.
Eastman, C. (2001). Chapter 8 - New Directions in Design Cognition: Studies of Representation and Recall. In C. M. Eastman, W. M. McCracken, & W. C. Newstetter (Eds.), Design Knowing and Learning: Cognition in Design Education (pp. 147–198). Elsevier Science. https://doi.org/10.1016/B978-008043868-9/50008-5
Eastman, C. M. (1968). Explorations of the cognitive processes in design.
Eastman, C. M. (1970). On the analysis of intuitive design processes. Emerging Methods in Environtmental Design and Planning, 21–37.
Ericsson, K. A., & Simon, H. A. (1993). Protocol analysis: Verbal reports as data Cambridge. EUA Massachusetts Inst. Technol.
Fan, C. F. (2009). 設計早期階段的知覺場現象與創造力的關係 [The relationship between perceptual field phenomena and creativity in early design stages] [Master's thesis, National Chiao Tung University]. Taiwan. https://hdl.handle.net/11296/3ffx53
Finke, R. A., Ward, T. B., & Smith, S. M. (1992). Creative Cognition: Theory, Research, and Applications. The MIT Press. https://doi.org/10.7551/mitpress/7722.001.0001
Gaver, W. W., Beaver, J., & Benford, S. (2003). Ambiguity as a resource for design Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, Ft. Lauderdale, Florida, USA. https://doi.org/10.1145/642611.642653
Goel, V. (1995). Sketches of Thought. The MIT Press. https://doi.org/10.7551/mitpress/6270.001.0001
Goel, V., & Pirolli, P. (1992). The structure of design problem spaces. Cognitive science, 16(3), 395–429.
Goldschmidt, G. (1990). Linkography: assessing design productivity. Cyberbetics and System'90, Proceedings of the Tenth European Meeting on Cybernetics and Systems Research, Singapore,
Goldschmidt, G. (1991). The dialectics of sketching. Creativity Research Journal, 4(2), 123–143. https://doi.org/10.1080/10400419109534381
Goldschmidt, G. (1994). On visual design thinking: the vis kids of architecture. Design studies, 15(2), 158–174. https://doi.org/10.1016/0142-694X(94)90022-1
Goldschmidt, G. (2003). The backtalk of self-generated sketches. Design issues, 19(1), 72–88.
Goodfellow, I., Pouget-Abadie, J., Mirza, M., Xu, B., Warde-Farley, D., Ozair, S., Courville, A., & Bengio, Y. (2020). Generative adversarial networks. Communications of the ACM, 63(11), 139–144.
Guilford, J. P. (1967). The nature of human intelligence. McGraw-Hill.
Gürel, A. (2018). Cognitive comparison of using hand sketching and parametric tools in the conceptual design phase [MS (Master of Science), Bilkent University]. http://hdl.handle.net/11693/49002
Howard-Jones, P. A. (1998). The Variation of Ideational Productivity over Short Timescales and the Influence of an Instructional Strategy to Drfocus Attention. Proceedings of the Twentieth Annual Conference of the Cognitive Science Society,
Huang, Y. H. (2005). 從專家、風格到創造力的形成過程之認知行為探討 [Cognitive processes from expertise and style to creativity formation] [Doctoral dissertation, National Chiao Tung University]. Taiwan. https://hdl.handle.net/11296/abn3e8
Jansson, D. G., & Smith, S. M. (1991). Design fixation. Design studies, 12(1), 3–11. https://doi.org/10.1016/0142-694X(91)90003-F
Kapur, M. (2008). Productive Failure. Cognition and Instruction, 26(3), 379–424. https://doi.org/10.1080/07370000802212669
Kavakli, M., & Gero, J. S. (2001). Sketching as mental imagery processing. Design studies, 22(4), 347–364.
Kavakli, M., Suwa, M., Gero, J., & Purcell, T. (1999). Sketching interpretation in novice and expert designers. Visual and Spatial Reasoning in Design,
Kirsh, D. (1995). The intelligent use of space. Artificial intelligence, 73(1), 31–68. https://doi.org/10.1016/0004-3702(94)00017-U
Kounios, J., & Beeman, M. (2009). The Aha! moment: The cognitive neuroscience of insight. Current directions in psychological science, 18(4), 210–216.
Kudless, A. (2024). Stop Making Sense: Complexity and Contradiction in AI and Architecture. Architectural Design, 94(3), 86–93. https://doi.org/10.1002/ad.3059
Lee, J. H., Gu, N., & Ostwald, M. J. (2015). Creativity and parametric design? Comparing designer's cognitive approaches with assessed levels of creativity. International Journal of Design Creativity and Innovation, 3(2), 78–94.
Li, C., Zhang, T., Du, X., Zhang, Y., & Xie, H. (2024). Generative AI Models for Different Steps in Architectural Design: A Literature Review. In.
Li, J., Luo, Y., Lu, S., Zhang, J., Wang, J., Guo, R., & Wang, S. (2024). ChatDesign: Bootstrapping generative floor plan design with pre-trained large language models. Proceedings of the 29th International Conference of the Association for Computer Aided Architectural Design Research in Asia (CAADRIA), Hongkong,
Li, P., Li, B., & Li, Z. (2024). Sketch-to-architecture: Generative ai-aided architectural design. arXiv preprint arXiv:2403.20186.
Lloyd, P., Lawson, B., & Scott, P. (1995). Can concurrent verbalization reveal design cognition? Design studies, 16(2), 237–259.
Maher, M. L., Poon, J., & Boulanger, S. (1996). Formalising Design Exploration as Co-Evolution. In J. S. Gero & F. Sudweeks (Eds.), Advances in Formal Design Methods for CAD: Proceedings of the IFIP WG5.2 Workshop on Formal Design Methods for Computer-Aided Design, June 1995 (pp. 3–30). Springer US. https://doi.org/10.1007/978-0-387-34925-1_1
Mathieu, J. E., Heffner, T. S., Goodwin, G. F., Salas, E., & Cannon-Bowers, J. A. (2000). The influence of shared mental models on team process and performance. Journal of applied psychology, 85(2), 273.
Mohammad, A. S. (2019). Hybrid elevations using GAN networks [Master's thesis, The University of North Carolina at Charlotte].
Murphy, J. (2025/06/12). Muddy robots: Ronald Rael speaks with AN about combining ancestral intelligence with artificial intelligence. https://www.archpaper.com/2025/06/ronald-rael-combining-ancestral-intelligence-artificial-intelligence/
Nauata, N., Chang, K.-H., Cheng, C.-Y., Mori, G., & Furukawa, Y. (2020). House-GAN: Relational Generative Adversarial Networks for Graph-constrained House Layout Generation. CoRR, abs/2003.06988.
Norman, D. A. (1993). Things that make us smart: defending human attributes in the age of the machine. Addison-Wesley Longman Publishing Co., Inc.
Ploennigs, J., & Berger, M. (2022). AI Art in Architecture. arXiv.org. https://doi.org/10.48550/arxiv.2212.09399
Purcell, A., Gero, J., Edwards, H., & Matka, E. (1994). Design fixation and intelligent design aids. Artificial Intelligence in Design’94,
Reitman, W. R. (1964). Heuristic Decision Procedures, Open Contraints [sic], and the Structure of Ill-defined Problems. Graduate School of Industrial Administration, Carnegie Institute of Technology.
Rittel, H. W., & Webber, M. M. (1973). Dilemmas in a general theory of planning. Policy sciences, 4(2), 155–169.
Salem, A. A., Mansour, Y., & Eldaly, H. (2024). Generative vs. Non-Generative AI: Analyzing the Effects of AI on the Architectural Design Process. Engineering Research Journal (Shoubra), 53(2), 119–128. https://doi.org/10.21608/erjsh.2024.255372.1256
Scheutz, M., DeLoach, S. A., & Adams, J. A. (2017). A framework for developing and using shared mental models in human-agent teams. Journal of Cognitive Engineering and Decision Making, 11(3), 203–224.
Schön, D. A. (1983). The reflective practitioner. Basic Books.
Schön, D. A. (1992). Designing as reflective conversation with the materials of a design situation. Research in Engineering Design, 3(3), 131–147. https://doi.org/10.1007/BF01580516
Schön, D. A., & Wiggins, G. (1992). Kinds of seeing and their functions in designing. Design studies, 13(2), 135–156. https://doi.org/10.1016/0142-694X(92)90268-F
Seeber, I., Bittner, E., Briggs, R. O., de Vreede, T., de Vreede, G.-J., Elkins, A., Maier, R., Merz, A. B., Oeste-Reiß, S., Randrup, N., Schwabe, G., & Söllner, M. (2020). Machines as teammates: A research agenda on AI in team collaboration. Information & Management, 57(2), 103174. https://doi.org/10.1016/j.im.2019.103174
Shi, M., Seo, J., Cha, S. H., Xiao, B., & Chi, H.-L. (2024). Generative AI-powered architectural exterior conceptual design based on the design intent. Journal of Computational Design and Engineering, 11(5), 125–142. https://doi.org/10.1093/jcde/qwae077
Shi, Y., Gao, T., Jiao, X., & Cao, N. (2023). Understanding design collaboration between designers and artificial intelligence: a systematic literature review. Proceedings of the ACM on Human-Computer Interaction, 7(CSCW2), 1–35.
Simon, H. A. (1973). The structure of ill structured problems. Artificial intelligence, 4(3-4), 181–201.
Sutskever, I., Vinyals, O., & Le, Q. V. (2014). Sequence to sequence learning with neural networks. Advances in neural information processing systems, 27.
Suwa, M., Gero, J., & Purcell, T. (2000). Unexpected discoveries and S-invention of design requirements: important vehicles for a design process. Design studies, 21(6), 539–567.
Suwa, M., Purcell, T., & Gero, J. (1998). Macroscopic analysis of design processes based on a scheme for coding designers' cognitive actions. Design studies, 19(4), 455–483. https://doi.org/10.1016/S0142-694X(98)00016-7
Suwa, M., & Tversky, B. (1997). What do architects and students perceive in their design sketches? A protocol analysis. Design studies, 18(4), 385–403.
Tang, H.-H., & Gero, J. S. (2000). Content-oriented coding scheme for protocol analysis and computer-aided architectural design. CAADRIA2000, CASA, Singapore, 265–275.
Tversky, B. (1999). What Does Drawing Reveal about Thinking? Visual and Spatial Reasoning in Design.
Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., Kaiser, Ł., & Polosukhin, I. (2017). Attention is all you need. Advances in neural information processing systems, 30.
Verstijnen, I. M., van Leeuwen, C., Goldschmidt, G., Hamel, R., & Hennessey, J. (1998). Sketching and creative discovery. Design studies, 19(4), 519–546.
Wallas, G. (1926). The art of thought. Harcourt, Brace.
Wang, A., Dong, J., Lee, L.-H., Shen, J., & Hui, P. (2024). A Survey on Deep Learning for Design and Generation of Virtual Architecture. ACM Comput. Surv., 57(2), Article 29. https://doi.org/10.1145/3688569
Wang, S.-Y., Su, W.-C., Chen, S., Tsai, C.-Y., Misztal, M., Cheng, K. M., Lin, A., Chen, Y., & Chen, M. Y. (2024). RoomDreaming: Generative-AI Approach to Facilitating Iterative, Preliminary Interior Design Exploration Proceedings of the 2024 CHI Conference on Human Factors in Computing Systems, Honolulu, HI, USA. https://doi-org.tudelft.idm.oclc.org/10.1145/3613904.3642901
Weisberg, D. E. (2008). The Engineering Design Revolution: The People, Companies and Computer Systems That Changed Forever the Practice of Engineering.
Weisberg, R. W. (1993). Creativity : beyond the myth of genius. W.H. Freeman.
Weisz, J. D., Muller, M., He, J., & Houde, S. (2023). Toward General Design Principles for Generative AI Applications. arXiv.org. https://doi.org/10.48550/arxiv.2301.05578
Zhang, C., Wang, W., Pangaro, P., Martelaro, N., & Byrne, D. (2023). Generative Image AI Using Design Sketches as input: Opportunities and Challenges Proceedings of the 15th Conference on Creativity and Cognition, Virtual Event, USA. https://doi-org.tudelft.idm.oclc.org/10.1145/3591196.3596820
Zhang, L., Rao, A., & Agrawala, M. (2023). Adding conditional control to text-to-image diffusion models. Proceedings of the IEEE/CVF international conference on computer vision,
Zhou, E., & Lee, D. (2024). Generative artificial intelligence, human creativity, and art. PNAS Nexus, 3(3). https://doi.org/10.1093/pnasnexus/pgae052