People who just started using Codex and want it to help them track what they did each day; people who have many AI conversations and file changes and often feel disoriented by evening; people who have heard about Hermes or work mirrors but are not sure whether they need them.
After using AI to get work done, the traces of that work are scattered across conversations, files, and folders. By evening you can't clearly describe what you finished, and the next morning you don't know where to pick up.
One decision principle (Codex-only means use its built-in automation), one upgrade principle (multiple agents means add a work mirror), three copy-ready prompts, and a three-stage practice moving from manual to automatic.
Start with One Question: Single Agent or Multiple Agents?
I now split AI work logging into two situations. Decide which one you are in first, then choose the tool. Picking the tool too early makes the system heavy; once you know where you stand, the right tool naturally follows.
Path A · Codex Only
Take this path if most of your answers below are "yes"
- I mainly work inside Codex
- Most of my file edits are made by Codex
- I just want to know what I did each day
- I don't need to track how different agents divided the work
- I prefer the simplest possible setup
Path B · Multi-Agent Work Mirror
Take this path if most of your answers below are "yes"
- I regularly use Claude Code, Codex, Gemini, or other agents together
- I often can't tell which agent changed what
- I want to track how much time and how many tokens each task consumed
- I want to know whether a task has any follow-up
- I need to write work traces back into a knowledge base or work log
Most People Start by Building a System That Is Too Big
When people first learn AI automation, they often want to do everything at once. Should I install a dedicated monitoring tool? Should I auto-summarize every 20 minutes? Should I track tokens, focus levels, and operation frequency? Should I connect Claude, Codex, and Gemini all at the same time?
All of those are possible. But for most people just starting out, the first step is simply to be able to see their daily work trail. Three questions are enough to begin:
- What did I do today?
- Which files did I change?
- What is the next step tomorrow?
If you can answer those three questions every day, your AI workflow already has a shape.
Codex-Only Path: Built-In Automation Is Enough
If you only use Codex, I recommend starting with three stages. Build the habit first, then hand it over to automation.
When a stretch of work wraps up, ask Codex to summarize what you just did. The goal is to build the habit that work needs to leave a trace.
Once the manual habit is steady, have Codex produce a daily report each day based on the conversations, edited files, and documents produced.
Once you know what a good log looks like, set up a scheduled task to record automatically at fixed intervals. Default is every 30 minutes; you can adjust to 15 minutes or 1 hour.
Stage 1: Manually Ask Codex to Write a Work Log
Don't automate anything at first. Build the habit: when a block of work finishes, ask Codex to help you summarize what just happened. You can paste this prompt directly:
Please help me write a work log for this last stretch of work. Organize it into five sections: 1. What I originally intended to do 2. What was actually completed 3. Which files were changed or what outputs were produced 4. Where I got stuck along the way 5. What the next step should be Only record what has already happened. Do not infer results. If you are unsure about something, mark it as "unconfirmed".
Many people think work logs are written for a manager. In practice, a work log is written first for your future self. When you use AI to get work done, a single session today might change a dozen different things. Without a record, it's easy to wake up tomorrow with no idea where this thread left off.
Stage 2: Let Codex Read Today's File Changes
Once manual logging is consistent, have Codex check what you did each day. Update the prompt to something like this:
Please help me write today's work log. Based on today's conversations, edited files, and produced documents, compile a daily report. Include: 1. Today's main tasks 2. Which files or outputs each task touched 3. What has been completed 4. What has not yet been completed 5. The three best things to continue with tomorrow Constraints: - Do not write this as a reflective essay - Do not add outcomes that did not actually happen - Mark anything uncertain as "unconfirmed" - If there are file paths, list them directly
This stage will help you realize that AI can function as a retrospective tool, looking back at your actions from the day and pulling together context that had scattered.
Stage 3: Use Codex Scheduled Tasks for Automatic Logging
Once you know what a good log looks like, turn it into a scheduled task that records automatically at fixed intervals. I personally default to every 30 minutes; adjust to 15 minutes or 1 hour based on your own work rhythm. The prompt can look like this:
Please compile a Codex work log for this recent interval. Review the main activity in this workspace since the last log entry and organize it into a reviewable work log. Output format: # Work Log (this interval) ## What Was Completed - List tasks completed during this interval ## Files and Outputs - List important files added or modified during this interval ## Blockers - List errors, waiting periods, or unfinished items that came up ## Next Steps - The 3 best things to continue with next ## Confidence Level - High: backed by file evidence or clear conversation record - Medium: conversation context exists but nothing was committed to a file - Low: only scattered clues available Rules: - Only record what actually happened during this interval - Do not add outcomes I did not mention - Do not write this as a motivational reflection - If there is not enough data, write "insufficient records to compile for this interval"
That is a complete Codex-only automatic work log. For many people, this version is more than sufficient.
When Do You Actually Need a Multi-Agent Work Mirror?
When your work starts spreading across multiple agents, new problems emerge.
For example: Codex edits your files, Claude Code polishes your drafts, Gemini looks things up, NotebookLM organizes your sources, and Hermes handles periodic reviews. At that point you realize each tool only knows its own slice. Codex doesn't necessarily know what Claude Code just did; the data Gemini retrieved might be sitting somewhere else entirely.
That's when you need a cross-agent work mirror. Its job is to bring together the traces left by different tools and answer a more detailed set of questions:
- Which agents were involved during this period? What did each one do?
- Which files were actually changed? Which actions stayed in conversation and never landed in a file?
- Does this task have a work log? What is the full context around it?
- Roughly how many tokens, operations, and focus-cost units did it take?
A Minimal Practice for Beginners
If I were teaching a complete beginner, I would give them one very small exercise. Don't try to build a full automation today. Do just this one thing: at the end of a work session, say to Codex:
Please summarize this last stretch of work in 10 lines or fewer: 1. What I did 2. What was produced 3. What the next step is Facts only. No reflections.
This exercise looks small, but it teaches you something important: an AI workflow needs two actions, not one. You give instructions when a task starts, and you write a record when it ends. Those two actions together are what creates a workflow that compounds over time.
One Step Further: Turning Work Logs into Skills
Once someone has been writing work logs consistently for a while, they start to notice repeating patterns. For instance: every time they write an article, they first dictate a rough draft, then ask AI to distill the structure. Every time they build a website, they modify it locally first, then deploy, then review. Those repeating patterns are the raw material for Skills.
That's why I treat work logs as the foundation of an AI workflow. Without work logs, Skills tend to become SOPs invented from thin air. With work logs, Skills grow out of real work.
Common Pitfalls
Four places people most often stumble. Expand each one to see the fix.
Pitfall 1: Building the system too big from the start
At the beginning, all you need is one work log per day. Don't rush to track tokens, focus levels, or operation frequency. Those belong to a more advanced stage.
Pitfall 2: Writing the work log as a reflective essay
A work log's primary job is to record facts. Reflection is fine, but don't let it crowd out the facts. This matters especially for automatically generated logs: they need to answer "what was done" before anything else.
Pitfall 3: Trusting the "success" status display
Every automation needs to be verified. Don't just check that the system says it succeeded. Check whether it actually wrote to the file, wrote to the right place, and didn't mistake an error message for work output.
Pitfall 4: Leaving out the next step
A good work log always ends with a next step. Tomorrow's version of you needs a clear entry point to pick up from.
A Positioning Reminder for Yourself
I think of the Codex-only path as the entry route. Help everyone build their own work log with Codex first. Once they start using multiple agents at the same time, introduce the multi-agent work mirror. This keeps the learning cost low and fits the pace of real work.
The essence of an AI workflow is capturing each piece of work so it becomes a process you can continue next time, reuse, and teach to the AI.
Two free talks every month.
Join the LINE Community ↗