Agentic harnesses and prompting

Turns out that a lot of manual work has moved toward using LLMs and agents to drive some parts of our daily workflows. At some point in your work you are probably using them, because they are highly effective and, frankly, addictive. If you are not, you are probably in some state of Nirvana.

Your work starts by opening your terminal, favorite agentic IDE, or a GUI like CoWork. I am assuming you are using some SoTA LLM model and wrapped in an agentic harness. If you are doing this in the web interface, you are probably about a year behind what is currently possible. I know that sounds a bit like AI-bro advice, but I mean it.

The key is that the agent running your prompt has access to tools, skills, and MCP servers it can operate on. That means it has hands to perform tasks for you. That combination of model, harness, and tools is what makes it an agent.

Good, now we get to the fun part: the agent doing some work for you. Jake has sent you a message, you have seen the start of it and a link inside, and you decide that the agent should deal with it:

Take a look at the message Jake sent to the infra channel. There is a link in the message. Review it and do as much work as you can autonomously.

The big question is: how should the agent think about this task?

Context engineering

The agent operates purely on context. It is like a working memory it can process, and that memory is highly limited, so it needs to stay lean and optimized. You have given the agent a task through a prompt it needs to interpret, but there is not much context in it.

So the question is: what is the context in this case?

It is actually:

Infra channel
Message and message content
Link and link context
Review task
Permission to operate autonomously

That is the context you see from the prompt you have written. There is also a system prompt and more, which goes under the topic of prompt engineering, and I will not go into that rabbit hole here, but it is important that different levels of prompts exist.

Now the question becomes: what context is not visible from your prompt? As an employee in a company, you probably carry a lot of context based on the current state of the company, so the task sounds easy to you.

Here are some examples of what is not visible:

Who are you inside the organization, what are your role, responsibilities, actions you can take?
What is Jake’s role, responsibilities, and actions he can take?
What decision is driving Jake’s message?
How does the task contribute to company goals, metrics, or the unit as a whole?

That is the kind of context that can be captured, and it goes into the topic of context engineering because it is exactly the context that is currently missing from your prompt.

The question is: where are you getting that context from, and how are you retrieving it?

Context graphs

At this point in time there are several approaches you can take to solve this problem of retrieval, but if we are talking about enterprise knowledge bases, that knowledge is stored in multiple systems and formats, and the question is how all of it connects under a single umbrella. Just think about how you would connect documentation to a Git repository and an internal chat system, and there is already sufficient chaos to handle.

As I have discussed before, it turns out that the graph model is great for modeling in the AI space due to its flexibility and its ability to represent structure. You can pretty much express any problem as a graph problem. These days, graph databases support vectors, text, edge, and property indexes, so it is easier to connect different sources while optimizing storage for actual search.

This leads me to a naming point: if we are solving a context-engineering problem by using a graph as the model, we are talking about a context graph. My definition is simple: a context graph is a knowledge graph organized specifically for agents to consume and update. It is an AI-first graph solution to a given problem and a use case. I use “knowledge graph” here to clarify that it is modeled with intent, not just complete chaos.

In the end, humans can remain operators of intent while agents handle execution, search, and storage with context graphs.

Context graph use cases

Use cases are already appearing everywhere, but in my current state of mind, here are some examples with short descriptions and questions:

Agent Memory graphs -> What is the work the agent has done prior to this? What do agents already know about this problem?
Agent Decision trees/traces -> Storing, retrieving, and evolving decision traces. What were the decisions and why?
Agent Actions graphs -> What actions can the agent take, what is the right action to take, and what responsibility or ability is attached to that action?
Agent Code graphs -> How are source files and components composed together, and what are their dependencies?
Skills graphs -> What skills need to evolve or be used to solve a particular problem?
Agent entity/ontology/data graphs -> What is the actual data or problem the agent is working on, and what problem is it trying to solve relative to organizational data?
…

Obviously this list can keep going, so in my mind there is no single context graph use case. It is really about the application of graph models and graph algorithms to solve context-engineering problems for agents.

Technical challenges and ambiguity

We are still mostly operating in single-player mode, where you run agents toward a goal using graph data, memory, tools, and components that are mostly local to you. The harder problem is making this work in an environment where multiple agents, teams, and systems are operating at the organizational level (multiplayer mode). That is where context graphs become much more valuable, and much harder to build at the same time.

Turns out from time to time we really like to explain some theoretical concept with different names, which introduces ambiguity. The reason usually comes from the background of the author, or from the fact that the theory itself stands on shaky legs, so people reuse the same theory in conjecture while trying to fit it to the actual narrative they are working through.

Thinking now about GraphRAG, I see context graphs as the broader framing. GraphRAG usually describes a retrieval pattern: use graph structure to improve retrieval and generation. A context graph is a wider operational concept that covers things from agents first perspecitve.

That is why I think context graph is the more useful term for the workloads we are starting to face. The point is not just to retrieve better context. The point is to organize the environment around the agent in a way that makes reasoning and action more reliable and efficient.

Some of the sources I came across while thinking about context graphs before writing this blog, but I find them missing the bigger picture:

The Context Graph

Agentic harnesses and prompting

Context engineering

Context graphs

Context graph use cases

Technical challenges and ambiguity

Enjoy Reading This Article?