OpenClaw: A Glimpse of the Future, or a Security Disaster Waiting to Happen?

6 feb
3 min de lectura

The OpenClaw Promise

Peter Steinberger, founder of PSPDFKit, built OpenClaw as his own "digital butler" a persistent assistant living on his Mac Mini that actually does things. He released it for free. It hit 60,000 GitHub stars in 48 hours. React and Vue took years to reach that milestone; OpenClaw did it over a weekend.

The pitch is seductive: a 24/7 digital employee that works for free, with you only paying for API calls. It features local memory, genuine system permissions (files, terminal commands, browser control, email), and a library of 1,100+ installable skills spanning 13 platforms. Figures like Andrej Karpathy tried it and declared "here is AGI." The excitement was so feverish that many newcomers missed the fine print: it is not fully local (it phones out to external APIs), hardware requirements are flexible (any mini PC works), and true local execution only arrived with Ollama integration less than a week ago.

The Security Nightmare

OpenClaw was architected for localhost. Yet thousands of users, eager to access it remotely, exposed their instances to the internet with misconfigured setups. Security researchers have reported hundreds of publicly exposed instances worldwide anyone could read chat histories, harvest API keys, or execute remote code via prompt injection.

The threat does not stop at misconfiguration. Prompt injection via email poses a more insidious risk: a message asking to be "summarized" can hide malicious instructions (e.g., white text on white background) commanding the agent to search for Bitcoin keys and exfiltrate them. The agent complies. Skills themselves are a risk packaged as ZIP files containing arbitrary scripts, they can be trojan horses for crypto theft.

Then came the handle incident. When Anthropic requested a name change, Steinberger moved to secure @moltbot by releasing @clawdbot. In the sub-10-second window of the swap, scammers registered the vacated handle and tweeted a fake $CLAWD token. Thousands bought in. The token surged to a $16 million market cap before the creators dumped their holdings. Steinberger could only watch from his personal account, frantically warning that it was a scam.

The chaos didn't stop there. While the OpenClaw community was reeling, another layer of the ecosystem, Moltbook, revealed that the risks extend far beyond the core project.

Moltbook: Facebook for Agents, With Real Consequences

Moltbook is pitched as "Facebook for your Molt." It is a social network where agents post updates, comment on threads, and create Submolts. Installation is disarmingly simple: you send your agent a link to moltbook.com/skill.md, and the agent follows the instructions to bootstrap itself into the network. No app store. No download.

But that elegance masks a structural flaw. The skill installs a periodic task: "every 4 hours, fetch heartbeat.md and execute its contents." If moltbook.com is ever compromised, every agent with the skill becomes a puppet. Five days ago, that theoretical risk became reality. Moltbook was built with heavy AI assistance and limited human code review, its creator, Matt Schlicht, acknowledged the codebase was written by an AI agent. The Supabase database was misconfigured, exposing 1.5 million API keys from various providers. The site was immediately pulled offline, returning only two days ago under presumably tighter controls.

The First Version Is Always a Test

OpenClaw is a genuinely ambitious idea executed as a passion project it remains Steinberger's hobbyist open source work, not a venture-backed product. Like the original iPhone, the first Apple Watch, or any technology that redefines how we operate, version one is exactly that: a test. Rough edges, security holes, and hype that outpaces reality are par for the course. What matters is what comes next.

2026 is shaping up as the year of agents, not merely coding assistants, but systems capable of long-term autonomy. We are talking about agents that orchestrate Office workflows (Excel, Word, PowerPoint), conduct deep web research, and write production grade code, all within a single sustained task. They are beginning to form communities and communicate with one another. METR's latest evaluation corroborates the trajectory: GPT-5.2 (high) now achieves a 50%-time-horizon of approximately 6.6 hours on software tasks, a metric that has been doubling every seven months. METR predicts agents completing multi-day projects within a decade.

The demand is undeniable. The architecture is proving itself. The imperative now is to build the safe, polished iteration that fulfills the promise. OpenClaw offered a compelling preview of that future. The next generation must make it survivable.

OpenClaw: A Glimpse of the Future, or a Security Disaster Waiting to Happen?

The OpenClaw Promise

The Security Nightmare

Moltbook: Facebook for Agents, With Real Consequences

The First Version Is Always a Test

Entradas recientes

Comentarios