limitedDistribution · Industry Research
From Language to Action: A Review of Large Language Models as Autonomous Agents and Tool Users
LLMs are utilized as decision-making agents, interpreting instructions and managing tasks. This review examines LLMs as autonomous agents and tool users.

Stargo's Stardox leverages LLMs for decision-making, transforming unstructured data into actionable insights for supply chain optimization.
Executive Summary
The pursuit of human-level artificial intelligence (AI) has significantly advanced the development of autonomous agents and Large Language Models (LLMs). LLMs are now widely utilized as decision-making agents for their ability to interpret instructions, manage sequential tasks, and adapt through feedback. This review examines recent developments in employing LLMs as autonomous agents and tool users and comprises seven research questions. We only used the papers published between 2023 and 2025 in conferences of the A* and A rank and Q1 journals. A structured analysis of the LLM agents’ architectural design principles, dividing their applications into single-agent and multi-agent systems, and strategies for integrating external tools is presented. In addition, the cognitive mechanisms of LLM, including reasoning, planning, and memory, and the impact of prompting methods and fine-tuning procedures on agent performance are also investigated. Furthermore, we evaluated current benchmarks and assessment protocols and have provided an analysis of 68 publicly available datasets to assess the performance of LLM-based agents in various tasks. In conducting this review, we have identified critical findings on verifiable reasoning of LLMs, the capacity for self-improvement, and the personalization of LLM-based agents. Finally, we have discussed ten future research directions to overcome these gaps.
Source: arxiv.org
Original Article: https://arxiv.org/html/2508.17281v1
More from the News Room
View allWe are publishing more related coverage here soon. Explore the full News Room for the latest articles.
See ROI in 12 weeks
See where enterprise data is slowing operations down.
Estimate the manual effort, delays, and leakage hidden across your current workflow before you automate it.