Added Error Logging #164

ParamThakkar123 · 2025-09-12T13:09:14Z

Fixes #74

ParamThakkar123 · 2025-09-12T13:33:43Z

PaliC

Thanks for the PR!

This is in the right direction, however, we have a colleague whose using this code to run experiments (@shaahins please take a look), so I want to be careful about touching this code.

Generally I have two points of feedback!

For this toy agent, there is a distinction between 1) Errors in the llm output (mostly we can't pull a kernel from it) and 2) our api provider doing something weird like rate limiting. When putting feedback back into the llm we only care about the 1st error class (so Agent Error should only cover the first case) as the 2nd is not actionable by the agent. (ie. Claude can't do anything about us not paying our anthropic bill lol). It is somewhat useful to have an error class for 2 for when we run experiments, but it is not necessary, so I wouldn't include it in this PR.

The second point of feedback is there is a lot of redundancy in your code. Generally, if we can't pull a kernel out of the code (in this iteration of the agent), that's enough to say "ohh something is wrong here".

PaliC · 2025-09-12T17:19:46Z

BackendBench/backends/kernel_agent.py

+            # Agent error detection
+            if not result.get("kernel_code") or not isinstance(result.get("kernel_code"), str):
+                raise AgentError(f"Agent error: No kernel code produced for {op_name}.")
+            if "rate limit" in result.get("message", "").lower():


I think this check and the check below are redundant / shouldn't get hit

PaliC · 2025-09-12T17:20:20Z

BackendBench/backends/kernel_agent.py

+        except AgentError as e:
+            print(f"❌ {e}")
+            return "", False
+        except AgentError as e:


why the two exceptions here?

PaliC · 2025-09-12T17:20:55Z

BackendBench/backends/llm.py

        }

        try:
+            # Agent error detection before compilation
+            if not kernel_code or not isinstance(kernel_code, str):
+                raise AgentError(


similar to above, I'm not sure when the below two conditinals would get hit if there isn't kernel code

PaliC · 2025-09-12T17:21:16Z

BackendBench/backends/llm.py

                if torch.cuda.is_available():
                    torch.cuda.empty_cache()
                    torch.cuda.synchronize()

            correct_count = 0
            total_count = 0
            correctness_results = []
-            # todo: this is to protect against IMA errors, however, we should make this work / make sense with multiple workers


this is a legit todo lol

Oh. I am sorry putting that back 😅

PaliC · 2025-09-12T17:22:05Z

BackendBench/backends/llm.py

@@ -247,6 +247,10 @@ def test_kernel_correctness(

            return is_correct, feedback_info

+        except AgentError as e:
+            feedback_info["agent_error"] = str(e)


By default this should be empty

PaliC · 2025-09-12T17:25:03Z

BackendBench/llm_client.py

+                raise AgentError("Agent error: Empty response or rate limit encountered.")
+            return content
+        except anthropic.AnthropicError as e:
+            raise AgentError(f"Anthropic API error: {e}")


So there is a difference between the API not working (ie. getting rate limited) and the agent producing a not useful output (ie. something without a kernel). I think you'd want to distinguish between the two.

PaliC · 2025-09-12T17:40:24Z

Also @ParamThakkar123 run pytest to make sure things work.

ParamThakkar123 · 2025-09-12T17:41:39Z

Sure @PaliC . I am using pytest to for testing. And all I noted all your feedbacks and suggestion. Will make sure all code changes are aligned with your feedbacks and I would all of them work. Thank you so much!

Added Error Logging

eac24fe

meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Sep 12, 2025

Updates

25d090a

PaliC requested changes Sep 12, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Added Error Logging #164

Added Error Logging #164

ParamThakkar123 commented Sep 12, 2025

Uh oh!

ParamThakkar123 commented Sep 12, 2025

Uh oh!

PaliC left a comment

Uh oh!

PaliC Sep 12, 2025

Uh oh!

PaliC Sep 12, 2025

Uh oh!

PaliC Sep 12, 2025

Uh oh!

PaliC Sep 12, 2025

Uh oh!

ParamThakkar123 Sep 12, 2025

Uh oh!

PaliC Sep 12, 2025

Uh oh!

PaliC Sep 12, 2025

Uh oh!

PaliC commented Sep 12, 2025

Uh oh!

ParamThakkar123 commented Sep 12, 2025

Uh oh!

Uh oh!

Added Error Logging #164

Are you sure you want to change the base?

Added Error Logging #164

Conversation

ParamThakkar123 commented Sep 12, 2025

Uh oh!

ParamThakkar123 commented Sep 12, 2025

Uh oh!

PaliC left a comment

Choose a reason for hiding this comment

Uh oh!

PaliC Sep 12, 2025

Choose a reason for hiding this comment

Uh oh!

PaliC Sep 12, 2025

Choose a reason for hiding this comment

Uh oh!

PaliC Sep 12, 2025

Choose a reason for hiding this comment

Uh oh!

PaliC Sep 12, 2025

Choose a reason for hiding this comment

Uh oh!

ParamThakkar123 Sep 12, 2025

Choose a reason for hiding this comment

Uh oh!

PaliC Sep 12, 2025

Choose a reason for hiding this comment

Uh oh!

PaliC Sep 12, 2025

Choose a reason for hiding this comment

Uh oh!

PaliC commented Sep 12, 2025

Uh oh!

ParamThakkar123 commented Sep 12, 2025

Uh oh!

Uh oh!