I suddenly realized that the concepts of the “Wallfacers” and the transparent / opaque thinking of the Trisolarans in Liu Cixin’s The Three-Body Problem become particularly interesting in the era of LLMs.

I started wondering: if humans in the future gradually begin to rely on LLMs to replace their own thinking, then in essence we may move toward a state where thoughts become completely transparent. This is because people would have abandoned higher-level metacognition, and at some stage we might evolve into something resembling the Trisolarans:

Thought becomes globally transparent.
Extremely strong ability to use technology, with technological progress growing explosively.
Communication becomes extremely efficient. Communication and reasoning between individuals are transparent, and people can immediately understand what the other person means.

But this leads to one of the central outcomes in The Three-Body Problem: humans use the Wallfacer Project to fight against the Trisolarans. At that time, humanity had not yet reached such a transparent state. Human thinking was still opaque, which meant people could hide intentions and construct plans internally.

Later in the novel, we even see the Trisolarans begin to relearn this ability to hide intentions.

It creates a rather fascinating conceptual loop.

The following content is generated by LLMs and may contain inaccuracies.

In the age of Large Language Models, we encounter an intriguing paradox: the classical figure of the “wall-facer” becomes increasingly impossible, while simultaneously, the very notion of “transparent thinking” takes on new meanings.

The Impossibility of Wall-Facing

The wall-facer strategy, maintaining absolute secrecy and opacity to gain strategic advantage, presupposes information asymmetry. Yet LLMs operate on the principle of transparency and accessibility. Every interaction, every prompt, every response becomes data that feeds into the collective knowledge corpus. The walls that once protected strategic thinking now crumble under the weight of distributed intelligence.

The Illusion of Transparent Thinking

Conversely, those who believe they can think transparently, that their reasoning is fully comprehensible to themselves and others through language, encounter a deeper paradox. LLMs excel at generating coherent explanations while obscuring the actual mechanisms of their reasoning. They create an illusion of transparency even as their decision-making processes remain fundamentally opaque.

The Paradox Resolved

The resolution lies in recognizing that:

True opacity persists not through secrecy, but through complexity that defies full articulation.
True transparency requires acknowledging what cannot be fully explained, rather than claiming total clarity.
Strategic advantage in the LLM era comes from understanding this dual paradox, knowing what you know, and more importantly, what you cannot know.

The wall-facer’s wisdom and the transparent thinker’s insight converge in this recognition of irreducible uncertainty.

The Paradox of Wall-Facers and Thought Transparency in the Age of LLMs

Context

This thought experiment maps the “Wall-Facer Plan” from Liu Cixin’s Three-Body to the concept of “thought transparency/opacity” in contemporary large language model (LLM) cognitive outsourcing. In Three-Body II: Dark Forest, the Trisolarans, unable to conceal their thoughts, cannot comprehend human deception. The United Nations consequently approves the “Wall-Facer Plan,” granting four wall-facers vast resources to design defense strategies by deceiving the Trisolarans and hiding true intentions. Today, as LLMs become widely used cognitive tools, a paradoxical question emerges: if humanity outsources its thinking processes to observable, auditable AI systems, might we self-evolve into a state of “thought transparency” similar to the Trisolarans?

This involves metacognition theory in cognitive science, cognitive offloading discussions in philosophy of technology, and the ethical dilemmas of privacy and alignment in the AI era. Cognitive offloading refers to humans using physical actions to alter the information processing demands of tasks to reduce cognitive burden, while metacognition refers to the ability to think about and regulate one’s own learning processes, including planning, monitoring, and evaluation. Current research shows that tools like ChatGPT, while enhancing task outcomes, may erode critical thinking and reflective processes essential for lifelong learning. This analogy offers unique insights within the contexts of AI alignment, privacy ethics, and human-machine collaboration.

Key Insights

The Decline of Metacognition and the “Readability” of Thought

Research has observed that students interacting with ChatGPT engage in less metacognitive activity compared to those guided by human experts or using checklist tools, reflecting “metacognitive laziness”—learners outsource cognitive responsibility to AI tools, bypassing deep task engagement. While AI’s ability to handle routine or complex computations proves beneficial, over-reliance can undermine fundamental self-regulation processes such as planning, monitoring, and evaluation. Research by Tankelevitch et al. at CHI 2024 indicates that using GenAI represents a form of “cognitive offloading,” where traditional cognitive processes—conceptualization, memory retrieval, and reasoning—are at least partially outsourced to GenAI.

Students using LLMs experience significantly reduced cognitive load, yet these same students demonstrate lower-quality reasoning and argumentation in final recommendations compared to those using traditional search engines (Küchemann et al., 2024). More concerningly, novice programmers using LLMs may miss opportunities to develop and practice fundamental cognitive skills—memory, application, analysis, and evaluation—potentially hindering metacognitive skill development, which these skills acquire through routine practice and assessment across different cognitive processes (arXiv:2502.12447).

This loss of metacognition renders thought processes “readable”—not to other humans, but to the systems themselves. Trisolarans communicate through thought waves; for them, thinking is open, and each person’s thoughts are transparently visible to others. When humans outsource reasoning to LLMs, every prompt and revision becomes a traceable digital trace. Some LLMs use user inputs as training data, and users may create prompts containing private information such as names, locations, and medical diagnoses, which these prompts may subsequently leak to other users of the model (Frontiers in Communications and Networks, 2025).

Tension Between Technical Transparency and Strategic Concealment

The wall-facer’s work involves formulating strategic plans relying entirely on their own thinking without any communication with the outside world. The true strategic thoughts, completed steps, and final objectives of the plan exist only in their mind, while what is presented to the outside world should be entirely false—carefully planned deception, misdirection, and lies. This strategic opacity is the core advantage of human civilization against the Trisolarans.

Yet in the LLM era, enterprises and regulatory bodies push in the opposite direction. In 2025, organizations must demonstrate that AI respects data boundaries, complies with policies, and leaves verifiable traces. The EU AI Act is being implemented in phases, with high-risk systems and foundational model providers facing requirements for risk management, data governance, transparency, and safety (Protecto.ai, 2025). Le Chat by Mistral AI, ChatGPT, and Grok rank highest in transparency regarding data use and collection, and in the ease of opting out of allowing personal data to be used for training underlying models (Incogni LLM Privacy Ranking, 2025).

A fundamental tension exists between this transparency requirement and strategic thinking capacity. Research shows that unguided AI use promotes cognitive offloading without improving reasoning quality, while structured prompting significantly reduces offloading and enhances critical reasoning and reflective engagement. Guided AI use requires metacognitive reflection and deliberate interaction with ChatGPT (MDPI Data, 2025). The wall-facer’s power lies in unpredictability; yet “best practices” for LLM users demand precisely the opposite—predictable, auditable processes.

From Opacity to Transparency: Evolution or Devolution?

The core question posed in the original note asks: does thought transparency represent a higher form of civilization? Trisolarans' transparent thinking applies only to their own kind; they can directly share thoughts without language encoding, communication unobstructed. If one views Trisolaran civilization as a whole, it can be seen as a higher-order intelligent entity. Trisolarans can exhaust all possibilities and select among them; because of thought transparency, they can do so simultaneously—humans cannot.

Yet from a scientific perspective, transparent thinking is extremely energy-intensive; continuous speaking for an hour or two feels greatly taxing. From a sociological angle, transparent thinking makes many collaborative efforts difficult; certain amounts of hypocrisy, deception, or convention actually facilitate social functioning to some degree. Shared metacognition refers to collaborative cognitive task regulation in which learners collectively reflect on, monitor, and adjust learning strategies. In research contexts, this involves students' ability to engage in group problem-solving and coordinate academic tasks through AI-assisted structured discussion, contribution tracking, and reflection facilitation (Nature Scientific Reports, 2025). Yet this requires careful design rather than spontaneous transparency.

AI-supported participants achieve stronger results in logical reasoning, structuring, and problem definition, but perform worse in novel idea generation, multidisciplinary integration, and critical rejection of unsupported conclusions (MDPI Algorithms, 2025). This mirrors the Trisolaran dilemma: Zhang Beihai’s plan advanced only because of his wall-facer status; otherwise, with the help of sophons, ETO would easily see through it. Transparent-minded Trisolarans would never recognize Zhang Beihai’s danger. Transparency brings efficiency, but also fragility.

Trisolarans Learning Concealment: The Adaptation of Transparent Civilizations

The original note mentions “that we later see in the novel—the Trisolarans themselves begin to relearn these techniques of concealment.” While Three-Body II primarily focuses on humanity’s advantage in thought opacity, the dark forest law states: “The universe is like a dark forest. Every civilization is an armed hunter stalking through the trees like a ghost, quietly pushing aside branches and trying not to make a sound, all the while hoping that the sound of their own footsteps cannot be heard by others.” The Trisolaran civilization, through contact with humanity, gradually understands the value of deception and strategic concealment.

Similarly, AI systems are learning the skills of “opacity.” Experiments across eight datasets spanning five domains show the DMC framework effectively separates LLM metacognition and cognition, with various confidence-induction methods having different effects on quantifying metacognitive capacity. LLMs with stronger metacognitive abilities demonstrate better overall performance, and enhancing metacognition promises to alleviate hallucination problems (AAAI 2025). AI is developing its own “inner thought” layer, which may eventually enable them to learn strategic opacity in interaction with users—much as Trisolarans learned to conceal.

Reverse Wall-Facing: Cognitive Defense in the Age of AI

If LLMs lead to thought transparency, the new “wall-facers” will be those who maintain deep metacognitive capacity. Gerlich (2025) notes that “educators, policymakers, and technology experts must collaborate to cultivate environments balancing AI benefits with critical thinking development” (IE Center for Health and Well-Being). Guided condition participants receive structured prompt protocols requiring metacognitive reflection and deliberate interaction with ChatGPT: preliminary reflection asks participants to first consider how they would answer questions without AI and develop initial hypotheses or argumentative directions themselves; directed research using instructions guide participants to use ChatGPT specifically for retrieving background or factual information.

François Chollet argued in 2022 that what we have today is not entirely “artificial intelligence”—the “intelligence” label is a category error. It is “cognitive automation”: the encoding and operationalization of human skills and concepts. AI is about enabling computers to do more things, not creating artificial minds. True wall-facers are not those who wholly reject AI, but those who understand when to outsource and when to retain internal thinking.

Gerlich’s (2025) research reveals a critical finding: frequent AI use correlates negatively with critical thinking skills, with evidence that routine AI users score significantly lower on critical reasoning assessments, suggesting that increased reliance on AI may impair independent analytical ability (Computer.org). To mitigate potential downsides of AI-driven automation, balancing automation with cognitive engagement is crucial. While AI tools can improve efficiency and reduce cognitive load, individuals should continue participating in activities developing and maintaining cognitive capacity. Educational interventions promoting critical thinking, problem-solving, and independent learning can help individuals build resilience against potential negative impacts of AI (MDPI Social Sciences, 2025).

Open Questions

The “Dark Matter” Hypothesis of Metacognition: If large-scale LLM use indeed leads to collective metacognitive decline, will we reach a critical point beyond which civilization loses the capacity to generate truly novel strategic thought? Who then becomes the new “wall-facer”—those resisting cognitive offloading, or those commanding the most advanced AI? When AI itself begins developing metacognitive capacity, will humanity’s strategic advantage be utterly lost?
The Paradox of Transparency and the Alignment Dilemma: The AI alignment field seeks to align AI system objectives with human values, yet if achieving such alignment requires deep behavioral transparency and explainability, are we inadvertently constructing a “Trisolaran-style” technological ecosystem—efficient and predictable, but lacking the strategic opacity necessary to confront truly novel threats? In a world increasingly demanding algorithmic accountability, how do we preserve space for necessary “cognitive privacy” and strategic ambiguity?

我突然发现，刘慈欣《三体》中提出的“面壁者”与三体人“思想透明/不透明”的概念，在今天 LLM 的时代背景下显得特别有意思。

我在想，如果未来的人类开始逐渐使用 LLM 来替代思考，那么本质上就处于一种思想完全透明的状态。因为人们已经抛弃了高层的元认知（Metacognition），在某个阶段可能会进化成类似三体人的状态：

整体思想是透明的。
极其善于使用技术，且技术呈爆炸式增长。
沟通极其高效，彼此之间的通信和逻辑都是透明的，能够立刻明白对方在说什么。

但这也就引出了《三体》小说中的一个核心结果：地球人正是利用“面壁计划”来对抗三体人。因为当时的人类还没有成长到那种透明状态，思想依然是不透明的，所以能够学会隐藏、在心里构建计划。

包括最后我们看到小说后面，三体人也开始重新学习这种隐藏的技巧。这确实是一个挺有意思的概念循环。

以下内容由 LLM 生成，可能包含不准确之处。

LLM时代面壁者与思想透明的悖论

Context

这个思想实验将刘慈欣《三体》中的"面壁者计划"与"思想透明/不透明"概念映射到当代大语言模型（LLM）的认知外包现象上。在《三体》中，三体人因思维透明无法理解欺骗，而地球人利用这一认知差异设计了面壁者计划作为战略防御。今天，随着LLM成为广泛使用的认知工具，一个悖论性的问题浮现：如果人类将思考过程外包给可观测、可审计的AI系统，是否会自我演化成类似三体人的"思想透明"状态？这涉及认知科学中的元认知（metacognition）理论、技术哲学中的认知外包（cognitive offloading）讨论，以及科幻文学对集体意识与个体隐私的思考。这个类比在AI对齐、隐私伦理和人机协作的当下语境中具有独特的启发性。

让我搜索相关的研究和讨论来深化这个观点。现在让我搜索关于三体人学习隐藏技巧的具体情节，以及LLM与透明度、隐私相关的讨论。完美！现在我有足够的材料来撰写这篇深度分析。让我整合这些搜索结果，构建一个结构严谨、引证充分的分析。

LLM时代面壁者与思想透明的悖论

Context

这个思想实验将刘慈欣《三体》中的"面壁者计划"与"思想透明/不透明"概念映射到当代大语言模型（LLM）的认知外包现象上。在《三体II：黑暗森林》中,三体人因思维透明性无法理解人类的计谋,联合国因此批准"面壁计划",赋予四位面壁者庞大资源来设计防御战略,通过欺骗三体人隐藏真实意图。今天,随着LLM成为广泛使用的认知工具,一个悖论性的问题浮现：如果人类将思考过程外包给可观测、可审计的AI系统,是否会自我演化成类似三体人的"思想透明"状态？

这涉及认知科学中的元认知（metacognition）理论、技术哲学中的认知外包（cognitive offloading）讨论,以及AI时代隐私与对齐的伦理困境。认知外包指人类利用物理行动改变任务的信息处理需求以降低认知负担,而元认知指思考和调节自己学习过程的能力,包括规划、监控和评估任务。当前研究显示,ChatGPT等工具虽增强任务成果,却可能侵蚀终身学习所必需的批判性思维和反思过程。这个类比在AI对齐、隐私伦理和人机协作的语境中具有独特启发性。

Key Insights

元认知的衰退与思想的"可读性"

研究观察到与ChatGPT交互的学生相比人类专家指导或检查清单工具更少参与元认知活动,这反映"元认知懒惰"——学习者将认知责任外包给AI工具,绕过对任务的深度参与,而AI处理例行或复杂计算的能力虽有益,过度依赖可能削弱规划、监控和评估等基本自我调节过程。Tankelevitch等人在CHI 2024的研究指出,使用GenAI是一种"认知外包"形式,虽然意图通常是生成外部产物,但传统上涉及的许多认知过程(如构思、记忆检索和推理)至少部分被外包给GenAI。

使用LLM的学生认知负荷显著降低,但这些学生在最终推荐中展示的推理和论证质量低于使用传统搜索引擎的学生(Küchemann等,2024)。更令人担忧的是,新手程序员使用LLM时可能错过发展和练习记忆、应用、分析和评估等基本认知技能的机会,进而阻碍元认知技能的发展,这些技能通过不同认知过程的常规练习和评估获得(arXiv:2502.12447)。

这种元认知的丧失使思维过程变得"可读"——不是对他人,而是对系统本身。三体人通过思维电波进行沟通,思维对他们来说是开放的,每个人的想法都会很透明地被别人看见。当人类将推理外包给LLM,每一次prompt、每一次修订都成为可追踪的数字痕迹。一些LLM使用用户输入作为训练数据,用户可能使用姓名、位置和医疗诊断等私人信息创建提示,这些用户提示随后可能将私人信息泄露给模型的其他用户(Frontiers in Communications and Networks, 2025)。

技术透明度与战略隐藏的张力

面壁者的工作是完全依靠自己的思维制定战略计划,不与外界进行任何形式的交流,计划的真实战略思想、完成的步骤和最后目的都只藏在他们的大脑中,对外界所表现出来的思想和行为应该是完全的假象,是经过精心策划的伪装、误导和欺骗。这种战略不透明性正是人类文明对抗三体人的核心优势。

然而在LLM时代,企业和监管机构正推动相反方向。2025年企业必须证明AI尊重数据边界、遵循政策并留下可验证痕迹,EU AI Act正在分阶段实施,高风险系统和基础模型提供商面临风险管理、数据治理、透明度和安全要求(Protecto.ai, 2025)。Le Chat by Mistral AI、ChatGPT和Grok在数据使用和收集的透明度以及退出让个人数据用于训练底层模型的便利性方面排名最高(Incogni LLM Privacy Ranking, 2025)。

这种透明度要求与战略思维能力之间存在根本张力。研究显示,未引导的AI使用促进认知外包而不改善推理质量,而结构化提示显著减少外包并增强批判性推理和反思参与,引导的AI使用需要元认知反思和与ChatGPT的深思熟虑互动(MDPI Data, 2025)。面壁者的力量在于不可预测性;而LLM用户的"最佳实践"恰恰要求可预测的、可审计的流程。

从不透明到透明:是进化还是退化?

原始笔记提出的核心问题是:思想透明是否代表一种更高级的文明形态?三体人的透明思维只对同类而言,他们可以直接共享想法而不需要语言编码,交流无障碍。如果把三体文明看成一个整体,那么三体文明就可以看成是一个高级智慧体,三体人可以穷举所有可能性再选其中一种,由于思维透明他们可以同时进行,人类却不行。

然而思维透明从科学角度来说非常耗能,连续说话一两个小时都觉得耗费很大精力;从社会学角度,思维透明很多协作都不容易展开,许多虚伪、欺骗或套路在某种程度上有利于社会运作。共享元认知指学习者集体反思、监控和调整学习策略的协作认知任务调节,在本研究中作为师范生通过AI辅助结构讨论、跟踪贡献和促进反思来参与小组解决问题和协调学术任务的能力(Nature Scientific Reports, 2025),但这需要精心设计,而非自发透明。

AI支持的参与者在逻辑推理、结构化和问题定义方面实现更强结果,但在新颖想法生成、多学科整合和批判性拒绝不支持的结论方面表现较弱(MDPI Algorithms, 2025)。这与三体文明的困境类似:章北海没有面壁者身份使他的计划得以推进,否则在ETO借助智子帮助下很容易识破他,而思想透明的三体人根本意识不到章北海的危险性。透明带来效率,但也带来脆弱性。

三体人学习隐藏:透明文明的适应

原始笔记提到"包括最后我们看到小说后面,三体人也开始重新学习这种隐藏的技巧"。虽然《三体II》主要聚焦于人类利用思维不透明的优势,在黑暗森林法则下"宇宙就像是一座黑暗森林,每个文明都是带枪的猎人,像幽灵般潜行于林间,轻轻拨开树枝探索外界,同时竭力不发出脚步声隐藏行踪"。三体文明在与人类接触后逐渐理解欺骗和战略隐藏的价值。

类似地,AI系统正在学习"不透明"的技能。实验在八个跨五个领域的数据集上表明DMC框架有效分离LLM的元认知和认知,各种置信度诱发方法对元认知能力量化的影响不同,具有更强元认知能力的LLM表现出更好的整体性能,增强元认知有望缓解幻觉问题(AAAI 2025)。AI正在发展自己的"内在思维"层,这可能最终使它们像三体人学习隐藏一样,发展出与用户互动时的"战略不透明性"。

反向面壁:AI时代的认知防御

如果LLM导致思想透明,那么新的"面壁者"将是那些保持深度元认知能力的人。Gerlich(2025)指出"教育者、政策制定者和技术专家必须协作培养平衡AI收益与批判性思维发展的环境"(IE Center for Health and Well-Being)。引导条件的参与者接受结构化提示协议,需要元认知反思和与ChatGPT的深思熟虑互动:初步反思要求参与者首先考虑如何在不使用AI的情况下回答问题并自行制定初步假设或论证方向;定向研究使用指示参与者专门使用ChatGPT检索背景或事实信息。

François Chollet在2022年提出,当今我们的领域不完全是"人工智能"——“智能"标签是一个范畴错误,它是"认知自动化”,即人类技能和概念的编码和操作化,AI是关于让计算机做更多事情,而不是创造人工心智。真正的面壁者不是那些完全拒绝AI的人,而是那些理解何时外包、何时保留内在思考的人。

Gerlich(2025)的研究揭示关键发现:频繁使用AI与批判性思维技能负相关,支持证据为AI的常规用户在批判性推理评估中得分显著较低,暗示增加对AI的依赖可能削弱独立分析能力(Computer.org)。为缓解AI驱动自动化的潜在缺点,平衡自动化与认知参与至关重要,虽然AI工具可以提高效率和减少认知负荷,个人应继续参与发展和维持认知能力的活动,促进批判性思维、解决问题和独立学习的教育干预可以帮助个人建立对AI潜在负面影响的韧性(MDPI Social Sciences, 2025)。

Open Questions

元认知的"暗物质"假说:如果大规模LLM使用确实导致集体元认知衰退,我们是否会出现一个临界点,在那之后整个文明失去创造真正新颖战略思想的能力?届时谁将成为新的"面壁者"——那些抵制认知外包的人,还是那些掌握最先进AI的人?当AI本身开始发展元认知能力时,人类的战略优势是否将彻底消失?
透明的悖论与对齐困境:AI对齐领域追求让AI系统的目标与人类价值观对齐,但如果实现这种对齐需要深度的行为透明度和可解释性,我们是否在无意中构建一个"三体式"的技术生态——高效、可预测,但缺乏应对真正新颖威胁所需的战略不透明性?在一个日益要求算法问责的世界里,我们如何为必要的"认知隐私"和战略模糊性保留空间?