Leveraging Python Code and LLM Collaboration for Problem Solving

Recently, while playing the Mario game, I observed that LLMs struggle with physical simulations, especially when processing non-periodic screen data conveyed through text.

To address these limitations, I considered using Python code to compute specific status recognitions (e.g., distance calculations or collision detection) and then pass refined data to the LLM. This hybrid approach could simplify problem-solving by leveraging Python’s speed and precision for mathematical computations, while the LLM focuses on strategic decisions.

I wonder if this role distribution would improve competition’s qualitative scores, or if handling all tasks—even those challenging for LLMs—solely through the LLM is more beneficial.

(1) Outsourcing tasks that the LLM struggles V.S. (2) solving everything with the LLM

And I believe direction (2) is more suitable for this contest.