Research Note | April 23, 2026

Action Logs Are Not Task Progress

Complex tasks need more than a list of recent actions. They need a task skeleton: an explicit sense of the goal, the current phase, the remaining work, and the blocker that explains why the next move makes sense. AaronCore is still exploring this boundary, but the distinction already matters.

A surprising number of agents can look busy without looking coherent. They open files, search the web, click buttons, read logs, and stream back a comforting trail of activity. That can feel impressive for a few turns. Then the user asks the simplest question in the world: where are we in the task? Very often the system has no real answer. It has an action log, but not a task shape.

AaronCore does not claim to have fully solved that yet. This page is not a victory lap. It is a working thesis. If agents are going to handle larger jobs honestly, they cannot stop at emitting traces of motion. They need a clearer skeleton for the work itself.

An action log can prove movement. It cannot, by itself, prove progress.

An action log answers the wrong question

Logs are useful. They tell you what the system just did. Opened a page. Read a file. Searched a query. Saved a draft. Retried a tool. For short tasks, that may be enough. The user mostly wants confirmation that something concrete happened.

But complex tasks are not just chains of actions. They have stages, dependencies, unresolved choices, and moments when the system needs to stop, ask, verify, or back up. Once the work stretches across multiple turns, "what happened last" stops being the same question as "where is the task now." That is where pure action logs start to run out of meaning.

A task skeleton is the shape of the work

By task skeleton, we do not mean a heavy project management panel. We mean a lightweight but explicit structure that says what the job is, which phase is active, what remains unfinished, and what kind of blocker is currently in the way. The skeleton should make the task legible before the user has to reconstruct it from a pile of recent tool calls.

If the user asks for a report, the skeleton might say the system is collecting sources, outlining the argument, drafting the first pass, or waiting for a publication decision. If the user asks for a coding change, the skeleton might say the system is locating the target file, implementing the fix, running verification, or blocked on an ambiguous requirement. Those states are not just nicer labels for actions. They tell the user what kind of work is underway.

Progress needs goal, phase, and blocker

This is why progress cannot be inferred from motion alone. A system may take six actions and still be stuck in the same phase. It may also take one action and resolve the real blocker. Without an explicit frame for goal, phase, and blocker, all actions start looking flatter than they are. The user sees activity, but not the structure that gives the activity meaning.

That is also where trust comes in. A useful agent should be able to say not only what it just touched, but why that step belonged to the task and what it unlocked next. Otherwise the user is left guessing whether the system is actually converging or just generating plausible bustle.

This is still an open research direction

AaronCore is still exploring what the right task skeleton should look like in practice. The current runtime can already show steps, keep traces, and carry some explicit state forward. That matters. But it is not yet the same thing as a stable, user-legible skeleton for larger work. Pretending otherwise would only blur the line this page is trying to draw.

The more honest position is that action logs are a necessary layer, not a sufficient one. They help the user inspect motion. They do not yet give the user a durable map of the task. The skeleton problem begins exactly where raw traces stop answering the user's real question.

Why the distinction matters now

This matters before full implementation because it changes what we optimize for. If we keep treating complex-task UX as a prettier action feed, we will keep making agents look active without making them easier to trust. If we treat task skeletons as first-class runtime structure, then step logs, verification, ask-user moments, and continuation all start attaching to something more meaningful than a recent stream of operations.

That is the claim underneath this note. Complex tasks do not just need more actions. They need a clearer shape. Until agents can show that shape, users will keep doing the planning in their own heads while the software performs the movement below it.