Wednesday, January 21, 2026

 AI’s “Memory Crisis”: Why Don’t Large Models Remember What You Said?

AI Memory Crisis

AI is getting smarter and smarter, yet its “memory” can still be maddening.

Have you ever had this experience? You’re halfway through a conversation with ChatGPT, and it suddenly “forgets” what you just said. Or you provide detailed background at the start, only for the AI to ignore it completely in later replies.

This isn’t a bug. It’s AI’s “original sin”: the context management problem.

In 2026, this issue is standing at a crossroads of change. Let’s look at how the world’s top AI labs are trying to crack it.


01 The Illusion of a Million-Token Window

Here’s a counterintuitive fact:

A bigger context window makes AI smarter? Wrong.

Today’s models are competing on “window size”—Gemini supports one million tokens, and Llama 4 has pushed beyond ten million. But that’s capacity, not capability.

Research shows that a model’s attention to context can follow a strange U-shaped curve:

  • Information at the beginning: remembered clearly

  • Information at the end: key points are still captured

  • Information in the middle: sorry—“forgotten”

This is the well-known “Lost in the Middle” phenomenon.

Worse still, as conversations grow longer, two fatal problems emerge:

  • Context Rot: the longer the dialogue, the worse the answer quality

  • Attention Dilution: crucial instructions get “drowned” in oceans of background detail

It’s like asking someone to memorize an entire encyclopedia, then quizzing them on the third paragraph of page 327— even if they can see everything, it’s hard to pinpoint exactly what matters.


02 Breaking the Deadlock

To deal with this trap, the industry is pushing forward on four fronts.

Strategy 1: Compress, Don’t Pile On

Core idea: instead of stuffing in everything, keep only what matters.

Anthropic’s Claude uses an “intelligent compression” approach:

  • Summarize conversation history—shrink 10,000 words into 500

  • Preserve key facts and delete redundant descriptions

  • Use “soft compression” to encode information into dense vectors

It’s like condensing a book into study notes—less text, same essence.

Strategy 2: Notes—AI’s “Second Brain”

Core idea: let the AI take notes for itself.

This is one of Anthropic’s latest practices:

  • An agent proactively records important information into a “notebook” while working

  • Notes live outside the context window, so they don’t consume precious “working memory”

  • When needed, retrieval mechanisms pull them back instantly

The benefits are obvious:

  • Memory can be persistent, instead of disappearing when the chat ends

  • Enables cross-task progress tracking

  • Prevents context-window overflow

Strategy 3: Just-in-Time Loading, Retrieve on Demand

Core idea: don’t preload—fetch only when needed.

The old approach dumps all relevant documents into the context at once. The new approach:

  • Keep only lightweight identifiers (file paths, URLs, database IDs)

  • Dynamically load required data at runtime via tool calls

It’s like a librarian—they don’t pile every book onto the table; they just know where it is and fetch it when asked.

Strategy 4: Hybrid Memory, Each to Its Own Job

Core idea: different kinds of memory require different techniques.

State-of-the-art systems are building hybrid memory architectures:

Memory TypeTechniqueBest For
Vector memoryEmbeddingsSemantic retrieval
Graph memoryKnowledge graphsRelational reasoning
Relational memorySQLStructured queries
Key–value memoryRedisFast, exact lookups

This mirrors how the brain is compartmentalized— the hippocampus handles short-term memory, the cortex stores long-term knowledge; different roles, working together.


03 Context Engineering: An Underrated New Paradigm

If you’re only focused on “Prompt Engineering,” you may already be behind.

The industry is quietly shifting toward a bigger concept: Context Engineering.

Anthropic offers a precise definition:

Context engineering is the art of curating and maintaining the optimal set of tokens available to an LLM at runtime.

Put simply: it’s not “give the AI more information,” but “give the AI the right information.”

Three golden rules:

  1. Quality over quantity: provide the smallest high-signal token set; avoid attention dilution

  2. Dynamic organization: load on demand, truncate intelligently, manage in layers

  3. Completeness: good context should include user metadata, dialogue history, tool definitions, retrieval results, and more

It’s an emerging “art”—and likely a core competency for future AI engineers.


04 The Future: Where Is AI Memory Headed?

Looking ahead, several directions are worth watching:

  • Adaptive context management: AI automatically adjusts memory strategies by task

  • Causal-chain preservation: when truncating context, preserve complete reasoning chains

  • Privacy-preserving memory: distributed storage and a user-controlled “right to be forgotten”

  • Multimodal fusion: unified memory across text, images, and video

Most exciting of all: future AI agents may truly gain the ability to “learn”—not just retrieve, but accumulate wisdom through experience the way humans do.


Closing

Context management may sound like a technical detail, but it’s a key step on the path to real intelligence.

From “bigger windows” to “smarter management,” from “passive intake” to “active memory,” AI is learning how to remember better.

Maybe one day you’ll find that talking to AI feels like talking to a friend who genuinely understands you—who remembers your preferences, your habits, and your whole story.

That day may be closer than we think.

Saturday, January 10, 2026

The Rise of a New Generation of Productivity Tools, Seen Through AI Programming

 AI programming has recently acquired a name that feels very much of its time: vibe coding. You no longer type code line by line. Instead, you tell the machine your intent, your goal, your sense of what you want, and it translates those vague ideas into executable instructions.

Its popularity is not driven by novelty, but by usefulness.

More and more companies are already using AI to assist—or even lead—their internal development workflows. Writing CRUD logic, wiring APIs, adding tests, fixing bugs, refactoring legacy code—tasks that once consumed large amounts of engineering time can now be handled with a few natural-language descriptions. This is not a “future concept.” It is happening now.


I. The First Problem It Solves: Humans Are Too Slow

Traditional programming is, at its core, a high-intensity human–machine translation task. Human thinking is far faster than keyboard input.

The first value of AI programming is not “intelligence,” but liberation from input. You no longer spend ten minutes agonizing over variable names, repeatedly copying boilerplate, or digging through documentation for a minor syntax detail. For simple, well-defined requirements, AI can already generate code of reasonably high quality.

And one thing is certain: models will become more capable, contexts will grow longer, system-level understanding will improve, and code quality will continue to rise.

This means one thing:

The act of writing code itself is rapidly depreciating in value.


II. The Real Shift Is Not Generation, but Review

Many people quickly discover something unexpected: getting AI to write code is easy; reading AI-generated code is more exhausting.

You must evaluate:

  • Does this code truly meet the requirements?

  • Are there hidden edge cases?

  • Are there performance issues?

  • Could it delete data or amplify risk?

  • Does it damage the overall system structure?

In other words, the center of work has fundamentally shifted—from how to write to whether it is correct, appropriate, and justified.

Future programmers will no longer be skilled typists at a keyboard. They will resemble hybrid roles: able to decompose vague requirements like a product manager, understand system boundaries and long-term evolution like an architect, and take responsibility for risk and outcomes like a project manager.

Code is moving from being the goal to being a by-product of thought.


III. From Machine-Oriented to Purpose-Oriented: A Delayed but Inevitable Stage

Viewed over the long arc of programming language evolution, the path is clear:

  • First, machine-oriented: assembly

  • Then, procedural: C

  • Later, object-oriented paradigms

For years, there has been a quiet intuition that the next step would be purpose-oriented programming: humans state what they want, not how each step should be executed.

This idea was not new. What was missing were the enabling conditions.

Large language models changed that. AI is not the first technology to reduce how much code people write—but it is the first to make not writing code at all a realistic option.


IV. Do Not Lament Replacement; Productivity Tools Are Indifferent to Emotion

Every generation of productivity tools eliminates some roles and creates others. AI replacing labor is not a possibility—it is an inevitability. Resistance, denial, and emotional debate are irrelevant.

There is only one meaningful question:

Do you treat AI as a competitor, or as a multiplier?

In the past, a great engineer was said to be worth ten average ones. In the future, a skilled AI operator may be worth a hundred ordinary users. The gap is not about access to tools, but about:

  • Asking high-quality questions

  • Defining clear objectives

  • Judging output quality

  • Understanding the product at a deep level


V. When AI Becomes Cheap and Smart Enough, Talent Standards Will Be Rewritten

It is reasonable to expect that AI will become cheaper, more capable, and easier to use. At that point, coding itself will no longer be a core competitive advantage.

What will be scarce is the ability to define requirements.

Who understands users best? Who understands systems? Who knows what should not be built? These people will become the new core of research and development.


VI. This Is Not Limited to Programming

The same transformation is underway across all creative and knowledge-based fields: writing, design, art, finance, law, consulting, research.

AI is not simply “replacing humans.” It is doing something more fundamental: changing the mode of production itself.

When tools advance far enough, professional boundaries blur, skill structures are reshuffled, and what truly determines value is cognition, judgment, and taste. Technology has never cared about individual security. History shows, again and again, that those who understand and master new tools earliest are often the beneficiaries of the new order.

AI programming is only the beginning. The real transformation has just begun.

Thursday, January 01, 2026

The Age of AI: Redefining What It Means to Be Human

When viewed through the long lens of history, the current explosion of AI technology is not an isolated anomaly.

  • When the steam engine arrived, carriage drivers panicked.

  • When electricity became common, the lamplighters vanished.

  • When computers entered the office, clerks were replaced.

History has proven time and again: Technology does not phase out "people"; it phases out fixed roles in the division of labor. The Age of AI is no different. However, this time, the change is faster, deeper, and touches more directly upon the very essence of "being human."


I. AI is Not an Option—It is the Background

AI is no longer a question of "to learn or not to learn." It is about realizing you are already immersed in it.

Specifically, the generations born between 1970 and the early 2000s find themselves in a precarious position: they spent the first half of their lives painstakingly accumulating experience, only to face a reality where that experience is rapidly devaluing. This isn't a matter of personal effort; it is a generational shift in technology.

AI does not ask for permission. It won't wait for society to be ready, nor will it provide a buffer zone for individuals. Like electricity or the internet, once it becomes infrastructure, it accelerates until it becomes the environment itself.

The real danger is not that AI is too strong, but that humans are still using "old maps" to navigate a "new continent."


II. AI Does Not "Replace Humans"—It Amplifies Choices

A common misconception is that AI will replace people. A more accurate statement is: AI replaces roles of "execution without judgment."

  • The automobile didn't end human travel; it ended the carriage as the only option.

  • The calculator didn't make math obsolete; it freed humans from wasting energy on repetitive computation.

AI follows the same logic. It exponentially enhances our ability to calculate, deduce, generate, and retrieve—but it does not decide where to go. Direction remains a human prerogative. AI can calculate the probability, cost, and efficiency of every path, but it cannot tell you which path is worth walking for a lifetime. Our true role is not to "calculate faster," but to judge what is worth calculating.


III. AI Leads in Expertise, But Stumbles in "Understanding"

We must face a hard truth: in almost every field of professional knowledge, AI has already surpassed the average human.

Whether it is medicine, law, coding, finance, or linguistics—in terms of breadth of knowledge, update speed, and consistency of output—AI is a tireless expert that never forgets. To deny this is mere self-consolation.

However, its weaknesses are equally glaring:

  • It understands statistical correlation, not meaning.

  • It generates formally correct results, not value judgments.

  • It excels at "looking like" something, without truly knowing "what it is."

AI can mimic style but cannot bear responsibility. It can synthesize creativity but does not understand the sacrifice involved. It can provide answers but does not suffer the consequences. AI has no worldview and no life story. Understanding the world, owning one's choices, and bearing the results—these are the core of what it means to be human.


IV. The Growing Pains are Real: A Temporary Loss of Productivity Roles

We must honestly face a harsh reality: as AI permeates society, many jobs will lose their economic significance in a very short time.

This isn't because people aren't working hard enough; it’s because the speed of technological transition has, for the first time, outpaced the speed of individual adaptation. We will see a "fault line" in our social structure: some will upgrade quickly, while others are ejected from the old system entirely. In the long run, humanity may experience a contraction of traditional roles. As productivity skyrockets, the demand for "raw labor" drops. Society will be forced to redefine work, value, and distribution. While future milestones—space colonization, interstellar resources, and galactic civilization—may lead to a new era of expansion, we must first survive this period of intense self-restructuring.


V. So, How Should We Respond?

The answer is simple in concept, though difficult in execution:

  1. Do what AI cannot. AI struggles with true understanding, complex ethical judgment, cross-disciplinary synthesis of meaning, and the building of trust. It cannot truly empathize. Focus on decisions that require "skin in the game" and problems that are ambiguous and have no standard answer.

  2. Treat AI as an amplifier, not an opponent. The most competitive people of the future won't be those who avoid AI, but those who harness it to perform higher-order thinking. The person who knows how to ask is more important than the one who knows how to answer. Defining the problem is now more valuable than solving it.

  3. Redefine "Learning." Learning is no longer about memorizing information; it is about building frameworks for judgment, abstraction, and the ability to transfer skills between fields. It’s not about "what I know," but "how quickly I can understand, reorganize, and create."


An Upgrade, Not an Ending

Technology will neither save nor destroy humanity. It will merely amplify who we already are. If we grow accustomed to dependence, avoidance, and intellectual lethargy, AI will make that state absolute. If we insist on understanding, judging, creating, and taking responsibility, AI will become an unprecedented catalyst for our potential.

The real question has never been: "What will AI turn us into?" But rather: "In the face of AI, are we still willing to do the hard work of being human?"

Tuesday, July 29, 2025

AI Safety: The Final Threshold

As artificial intelligence continues to make breakthroughs in multimodal perception, logical reasoning, natural language processing, and intelligent control, human society is rapidly entering an era dominated by pervasive intelligent systems. From cognition-driven LLMs (large language models) to embodied intelligence in autonomous vehicles and robotic agents, AI is evolving from a tool for information processing to an autonomous actor with real-world agency. In this transformation, AI safety has moved from a marginal academic topic to a structural imperative for the continuity of civilization.


Technological Singularity Approaches: Systemic Risks at the Safety Threshold

Today’s AI systems are no longer confined to closed tasks or static data—they exhibit:

  • Adaptivity: Online learning and real-time policy updating in dynamic environments;

  • High-Degree Autonomy: The ability to make high-stakes, high-impact decisions with minimal or no human supervision;

  • Cross-Modal Sensorimotor Integration: Fusing visual, auditory, textual, and sensor inputs to drive both mechanical actuators and digital infrastructure.

Under these conditions, AI failures no longer mean simple system bugs—they imply potential cascading disasters. Examples:

  • A slight misalignment in model objectives may cause fleets of autonomous vehicles to behave erratically, paralyzing urban transport;

  • Misguided optimization in grid control systems could destabilize frequency balance, leading to blackouts;

  • Medical LLMs might issue misleading diagnostics, resulting in mass misdiagnosis or treatment errors.

Such risks possess three dangerous properties:

  1. Opacity: Root causes are often buried in training data distributions or loss function formulations, evading detection by standard tests;

  2. Amplification: Public APIs and model integration accelerate the propagation of faulty outputs;

  3. Irreversibility: Once AI interfaces with critical physical infrastructure, even minor errors can lead to irreversible outcomes with cascading societal effects.

This marks a critical inflection point where technological uncertainty rapidly amplifies systemic fragility.


The Governance Divide: A False Binary of Open vs. Closed Models

A central point of contention in AI safety is whether models should be open-source and transparent, or closed-source and tightly controlled. This debate reflects not only technical trade-offs, but also fundamentally divergent governance philosophies.

Open-Source Models: The Illusion of Collective Oversight

Proponents of open-source AI argue that transparency enables broader community scrutiny, drawing parallels to the success of open systems like Linux or TLS protocols.

However, foundational AI models differ profoundly:

  • Interpretability Limits: Transformer-based architectures exhibit nonlinear, high-dimensional reasoning paths that even experts cannot reliably trace;

  • Unbounded Input Space: Open-sourcing models doesn’t ensure exhaustive adversarial testing or safety guarantees;

  • Externalities and Incentives: Even if some community members identify safety issues, there's no institutional mechanism to mandate fixes or coordinate responses.

Historical examples such as the multi-year undetected Heartbleed vulnerability in OpenSSL underscore that “open” is not synonymous with “secure.” AI models are orders of magnitude more complex and behaviorally opaque than traditional software systems.

Closed Models: Isolated Systems under Commercial Incentives

Advocates for closed models argue that proprietary systems can compete for robustness, creating redundancy: if one model fails, others can compensate. This vision relies on two fragile assumptions:

  1. Error Independence: In practice, today’s models overwhelmingly rely on similar data, architectures (e.g., transformers), and optimization paradigms (e.g., RLHF, DPO). Systemic biases are highly correlated.

  2. Rational Long-Term Safety Investment: Competitive pressure in AI races incentivizes speed and performance over long-horizon safety engineering. Firms routinely deprioritize safeguards in favor of time-to-market metrics.

Furthermore, closed-source systems suffer from:

  • Lack of External Accountability: Regulatory agencies and the public lack visibility into model behavior;

  • Black Box Effect: Profit incentives encourage risk concealment, as seen in disasters like the Boeing 737 Max crisis.


Core Principle I: Observability and Controllability

Regardless of model openness, AI safety must be grounded in two foundational capabilities:

Observability

Can we audit and understand what a model is doing internally?

  • Are intermediate activations traceable?

  • Are outputs explainable in terms of input features or latent reasoning paths?

  • Can we simulate behavior across edge-case conditions?

  • Is behavioral logging and traceability built in?

Without observability, we cannot detect early-stage drift or build meaningful safety monitors. The system becomes untestable at runtime.

Controllability

Can humans intervene at critical moments to halt or override model actions?

  • Does a “kill switch” or emergency interrupt mechanism exist?

  • Can human instructions override the model’s policy in real time?

  • Are behavior thresholds enforced?

  • Do sandboxed and multi-layered control interfaces limit autonomous escalation?

These control channels are not optional—they constitute the final fallback mechanisms for averting catastrophic behavior.


Core Principle II: Severing AI’s Direct Agency over the Physical World

Before comprehensive safety architectures mature, the most effective short-term defense is strict separation between AI models and the physical systems they could control. Tactics include:

  • Action Confirmation Loops: No high-risk action should execute without explicit human approval;

  • Hardware-Level Isolation: All model-issued instructions must pass through trusted hardware authentication, such as TPM or FPGA-controlled gates;

  • Behavior Sandboxing: New policies or learned behaviors must be tested in secure emulated environments before deployment;

  • Dynamic Privilege Management (PAM): AI access to physical systems should adjust based on model state, system load, and contextual risk.

These constraints mirror the “separation of powers” design in critical systems like aviation control and serve as the first line of defense against autonomous execution hazards.


The Final Protocol: Redundancy as a Prerequisite for Civilizational Survival

As AI systems eventually exceed the cognitive boundaries of human oversight—becoming general, adaptive, and self-improving—the question of human sovereignty will pivot on whether we’ve built sufficient institutional and architectural buffers today.

Technology’s limits are ultimately defined by policy and design, not capability. Safety must not depend on model goodwill—it must be enforced through irrevocable mechanisms: interruptibility, auditability, bounded agency, and verifiable behavior space.

AI safety is not an application-layer patch; it is a foundational layer in humanity’s civilizational protocol stack.

Without this “final key,” we will soon hand operational control to agents we cannot interpret, predict, or restrain. This is not hypothetical. It is a question of timing—driven by the exponential trajectory of model capability.

Wednesday, June 04, 2025

Some recent thoughts on AI

1. Software form will change greatly

The role of software, in the final analysis, is a bridge between humans and data. Software allows people to extract value from data more efficiently and operate data at the same time.

Traditional software is biased towards the data side, so people have to learn and adapt to machine language.

In the future, AI will become a new bridge. Software begins to learn how to understand people. The direction of the bridge has reversed.

There will no longer be the concept of software in the future. AI will become a universal new "interface".

2. AI's model and data are essentially equivalent

Whether data is more important or model is more important has been a topic of endless debate in academia.

In fact, the two are essentially equivalent.

Data can be used to train models, and models record knowledge extracted from data. Therefore, models are the "form transfer" of data and another expression dimension of information.

AI is not just changing information systems, it is itself becoming the ultimate form of information systems.