reconstruct tool calls #30

comhar · 2025-10-15T17:11:48Z

fixes #29

This pr reconstructs tool call messages from a tool call summary message. Concretely it converts a message like this:

I'll help you with that! Let me use the tools to perform these calculations.
<details class='tool-usage-details'>

 - `addy({"a": 5, "b": 7})`
 - `12`
 - `toolu_018LkjHQE9peK85MjXPcjaSm`

</details>

Now I'll subtract 3 from that result:
<details class='tool-usage-details'>

 - `subby({"a": 12, "b": 3})`
 - `9`
 - `toolu_01CkEqtBtBjBvrNBJ9ATaqRt`

</details>

Perfect! First, adding 5 and 7 gives us 12, and then subtracting 3 from 12 gives us the final result of **9**.

to this:

[
    Message(
        content="I'll help you with that! Let me use the tools to perform these calculations.", role='assistant', 
        function_call=None, provider_specific_fields=None
        tool_calls=[{'index': 1, 'function': {'arguments': '{"a": 5, "b": 7}', 'name': 'addy'}, 
        'id': 'toolu_018LkjHQE9peK85MjXPcjaSm', 'type': 'function'}]
    ),
    {'tool_call_id': 'toolu_018LkjHQE9peK85MjXPcjaSm', 'role': 'tool', 'name': 'addy',  'content': '12'},
      Message(
        content="Now I'll subtract 3 from that result:", role='assistant', 
        function_call=None, provider_specific_fields=None
        tool_calls=[{'index': 1, 'function': {'arguments': '{"a": 12, "b": 3}', 'name': 'subby'}, 
        'id': 'toolu_01CkEqtBtBjBvrNBJ9ATaqRt', 'type': 'function'}]
    ),
    {'tool_call_id': 'toolu_01CkEqtBtBjBvrNBJ9ATaqRt', 'role': 'tool', 'name': 'subby',  'content': '9'},
    {"role": "assistant", "content": "Perfect! First, adding 5 and 7 gives us 12, and then subtracting 3 from 12 gives us the final result of **9**"}.
]

This change should make lisette more resilient to tool call hallucinations.

comhar · 2025-10-15T17:14:33Z

cachy.jsonl

+{"key": "8ff05b20", "response": "{\"model\":\"claude-sonnet-4-5-20250929\",\"id\":\"msg_01UkSZXczvtptdBLJemBkwnv\",\"type\":\"message\",\"role\":\"assistant\",\"content\":[{\"type\":\"text\",\"text\":\"# Image Description\\n\\nThis adorable image shows a **Cavalier King Charles Spaniel puppy** with the classic Blenheim coloring (chestnut brown and white markings). \\n\\n## Key features visible:\\n- **Expressive brown eyes** looking directly at the camera\\n- **Soft, fluffy ears** with rich brown fur\\n- **White blaze** down the center of the face\\n- **White chest and paws**\\n- The puppy is lying on **green grass**\\n- **Purple flowers** (appear to be asters or similar) in the background\\n- Warm, soft lighting creating a charming portrait effect\\n\\nThe puppy has that irresistibly sweet, gentle expression that Cavalier King Charles Spaniels are famous for. This looks like a professional or carefully composed photograph, possibly for a breeder, pet portrait, or greeting card.\"}],\"stop_reason\":\"end_turn\",\"stop_sequence\":null,\"usage\":{\"input_tokens\":105,\"cache_creation_input_tokens\":0,\"cache_read_input_tokens\":0,\"cache_creation\":{\"ephemeral_5m_input_tokens\":0,\"ephemeral_1h_input_tokens\":0},\"output_tokens\":195,\"service_tier\":\"standard\"}}"}
+{"key": "130a52f1", "response": "{\"model\":\"claude-sonnet-4-5-20250929\",\"id\":\"msg_01NsVrovfY7JrTr5dPygRJhb\",\"type\":\"message\",\"role\":\"assistant\",\"content\":[{\"type\":\"text\",\"text\":\" D A C T E D\\n\\nI don't actually know your name - you haven't told me what it is yet! If you'd like me to spell your name, please let me know what it is first.\"}],\"stop_reason\":\"end_turn\",\"stop_sequence\":null,\"usage\":{\"input_tokens\":16,\"cache_creation_input_tokens\":0,\"cache_read_input_tokens\":0,\"cache_creation\":{\"ephemeral_5m_input_tokens\":0,\"ephemeral_1h_input_tokens\":0},\"output_tokens\":47,\"service_tier\":\"standard\"}}"}


Although these 2 items are unrelated to the pr, they were automatically added to the cache because their chat history is linked to upstream calls. This dependency has been removed in this pr.

comhar · 2025-10-15T17:17:18Z

lisette/core.py

+    if not msgs: return []
+    if not isinstance(msgs, list): msgs = [msgs]
+    res,role = [],'user'
+    msgs = L(msgs).map(lambda m: _build_tool_hist(m) if "<details class='tool-usage-details'>" in m else [m]).concat()


This line ensures that we automatically reconstruct the tool call history for every tool call summary msg in msgs.

comhar · 2025-10-15T17:19:42Z

lisette/core.py

+                yield f"\n<details class='tool-usage-details'>\n\n - `{fn.name}({_trunc_str(fn.arguments, replace='<TRUNCATED>')})`\n"
        elif isinstance(o, dict) and 'tool_call_id' in o: 
-            yield f"  - `{_trunc_str(_clean_str(o.get('content')))}`\n\n</details>\n\n"
+            yield f"  - `{o['tool_call_id']}`\n\n - `{_trunc_str(_clean_str(o.get('content')),replace='<TRUNCATED>')}`\n\n</details>\n\n"


We needed to include the tool_call_id in the summary so that we could fully reproduce the original tool call message. If we used a random id, it would bust the LLM cache and cachy's cache 😅 .

comhar · 2025-10-15T17:23:59Z

lisette/core.py

+def mk_tc(func, args, tcid=None, idx=1):
+    if not tcid: tcid = random_tool_id()
+    return {'index': idx, 'function': {'arguments': args, 'name': func}, 'id': tcid, 'type': 'function'}


These changes make it easy to create tool messages from the tool call summary message (i.e. when the called function and args are strings).

The downside is that there's a little more effort involved in creating a tool call message when the function and args are symbols.

For example. Here's the syntax on main

mk_tc(simple_add, a=5, b=7)

vs the syntax for this pr.

mk_tc(simple_add.__name__, json.dumps(dict(a=5, b=7)))

comhar · 2025-10-15T17:25:31Z

lisette/core.py

+    return hist
+
+# %% ../nbs/00_core.ipynb
+def mk_msgs(msgs,                   # List of messages (each: str, bytes, list, or dict w 'role' and 'content' fields)


Redefining mk_msgs for the sake of a 1 line change isn't ideal. Is there a better way to do this?

Yeah the implementation feels a bit over-clever to me. It needn't be so integrated and automatic. It's quite special-case behaviour. Instead, I'd expect to have a function like "extract_tcs()" which you pass a message to, and it turns it into a list of messages with tool calls expanded.

We went back and forth on this. We opted for the automatic implementation because we didn't see a strong reason not to expand these messages.

For extract_tcs() would that be applied before passing hist to Chat? Maybe we could add a param extract_tcs to Chat which would automatically expand messages using the extract_tcs fn?

comhar · 2025-10-15T17:31:56Z

lisette/core.py

+def _details_extract(x):
+    "Extract fn, args, tool_call_id, result from <details>"
+    m = re.search(r'<details.*?>(.*?)</details>', x, re.DOTALL)
+    tc, tcid, res = re.findall(r'-\s*`([^`]+)`', m.group(1))
+    fn, args = re.search(r'(\w+)\((.*?)\)', tc).groups()
+    return fn, args, res, tcid


This pr modifies the tool call summary message by including a tool call id in the details section.

As a result, _details_extract will throw an error if it runs on a tool call summary message generated with the current version of lisette.

That version generates a message like this

I'll use the `addy` function to add 5 and 3 for you. <details class='tool-usage-details'> `addy({"a": 5, "b": 3})` - `8` </details> The result is 8.

Whereas this pr expects this structure

I'll use the `addy` function to sum 5 and 7 for you. <details class='tool-usage-details'> - `simple_add({"a": 5, "b": 7})` - `12` - `toolu_01RPbSeouj8mc2N4rfjw2BaH` </details> The sum of 5 and 7 is **12**.

We could make it backwards compatible by using a random tool call id? Maybe this is overkill?

Using regexen here doesn't feel robust to me. I'd have thought that using json would be better - i.e. a proper structured format with a well-tested standard implementation. wdyt?

comhar · 2025-10-15T17:35:36Z

This pr doesn't handle multiple tool calls. We'll incorporate any changes in #22 if that pr is merged first. cc @erikgaas

jph00 · 2025-10-16T00:12:21Z

This is very exciting @comhar ! :D I don't think we should keep this design for long, but I'll release it for now so we've got something to play with.

reconstruct tool calls

451134d

comhar assigned comhar and KeremTurgutlu Oct 15, 2025

comhar added the enhancement New feature or request label Oct 15, 2025

comhar commented Oct 15, 2025

View reviewed changes

comhar requested a review from jph00 October 15, 2025 17:35

jph00 merged commit 95bf54e into main Oct 16, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

reconstruct tool calls #30

reconstruct tool calls #30

Uh oh!

comhar commented Oct 15, 2025

Uh oh!

comhar Oct 15, 2025

Uh oh!

comhar Oct 15, 2025 •

edited

Loading

Uh oh!

comhar Oct 15, 2025 •

edited

Loading

Uh oh!

comhar Oct 15, 2025 •

edited

Loading

Uh oh!

comhar Oct 15, 2025

Uh oh!

jph00 Oct 16, 2025

Uh oh!

comhar Oct 16, 2025

Uh oh!

comhar Oct 15, 2025 •

edited

Loading

Uh oh!

jph00 Oct 16, 2025

Uh oh!

comhar commented Oct 15, 2025

Uh oh!

jph00 commented Oct 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		{"key": "8ff05b20", "response": "{\"model\":\"claude-sonnet-4-5-20250929\",\"id\":\"msg_01UkSZXczvtptdBLJemBkwnv\",\"type\":\"message\",\"role\":\"assistant\",\"content\":[{\"type\":\"text\",\"text\":\"# Image Description\\n\\nThis adorable image shows a Cavalier King Charles Spaniel puppy with the classic Blenheim coloring (chestnut brown and white markings). \\n\\n## Key features visible:\\n- Expressive brown eyes looking directly at the camera\\n- Soft, fluffy ears with rich brown fur\\n- White blaze down the center of the face\\n- White chest and paws\\n- The puppy is lying on green grass\\n- Purple flowers (appear to be asters or similar) in the background\\n- Warm, soft lighting creating a charming portrait effect\\n\\nThe puppy has that irresistibly sweet, gentle expression that Cavalier King Charles Spaniels are famous for. This looks like a professional or carefully composed photograph, possibly for a breeder, pet portrait, or greeting card.\"}],\"stop_reason\":\"end_turn\",\"stop_sequence\":null,\"usage\":{\"input_tokens\":105,\"cache_creation_input_tokens\":0,\"cache_read_input_tokens\":0,\"cache_creation\":{\"ephemeral_5m_input_tokens\":0,\"ephemeral_1h_input_tokens\":0},\"output_tokens\":195,\"service_tier\":\"standard\"}}"}
		{"key": "130a52f1", "response": "{\"model\":\"claude-sonnet-4-5-20250929\",\"id\":\"msg_01NsVrovfY7JrTr5dPygRJhb\",\"type\":\"message\",\"role\":\"assistant\",\"content\":[{\"type\":\"text\",\"text\":\" D A C T E D\\n\\nI don't actually know your name - you haven't told me what it is yet! If you'd like me to spell your name, please let me know what it is first.\"}],\"stop_reason\":\"end_turn\",\"stop_sequence\":null,\"usage\":{\"input_tokens\":16,\"cache_creation_input_tokens\":0,\"cache_read_input_tokens\":0,\"cache_creation\":{\"ephemeral_5m_input_tokens\":0,\"ephemeral_1h_input_tokens\":0},\"output_tokens\":47,\"service_tier\":\"standard\"}}"}

reconstruct tool calls #30

reconstruct tool calls #30

Uh oh!

Conversation

comhar commented Oct 15, 2025

Uh oh!

comhar Oct 15, 2025

Choose a reason for hiding this comment

Uh oh!

comhar Oct 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

comhar Oct 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

comhar Oct 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

comhar Oct 15, 2025

Choose a reason for hiding this comment

Uh oh!

jph00 Oct 16, 2025

Choose a reason for hiding this comment

Uh oh!

comhar Oct 16, 2025

Choose a reason for hiding this comment

Uh oh!

comhar Oct 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jph00 Oct 16, 2025

Choose a reason for hiding this comment

Uh oh!

comhar commented Oct 15, 2025

Uh oh!

jph00 commented Oct 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

comhar Oct 15, 2025 •

edited

Loading

comhar Oct 15, 2025 •

edited

Loading

comhar Oct 15, 2025 •

edited

Loading

comhar Oct 15, 2025 •

edited

Loading