Articles tagged with: LLM Agents

Showing 3 results for this tag.

Intermediate·Dec 30, 2025

AudioFab: Building A General and Intelligent Audio Factory through Tool Learning

AudioFab is an open-source agent framework designed to create a unified and efficient audio-processing ecosystem by addressing the fragmentation and complex integration issues of existing audio AI tools. It offers a modular design and intelligent tool learning strategies to simplify complex audio tasks for both experts and non-experts.

LLM Agents

Tool Learning

Audio Processing

Intermediate·Dec 2, 2025

Benchmark for Planning and Control with Large Language Model Agents: Blocksworld with Model Context Protocol

This paper introduces a new benchmark for evaluating Large Language Model (LLM) agents in planning and execution tasks within industrial automation. It uses the Blocksworld problem with five complexity categories and integrates the Model Context Protocol (MCP) as a standardized tool interface, enabling systematic comparison of diverse LLM agent architectures.

LLM Agents

Benchmarks

AI Planning

Advanced·Dec 2, 2025

Evaluating Long-Context Reasoning in LLM-Based WebAgents

This paper introduces a benchmark for evaluating long context reasoning capabilities of WebAgents through sequentially dependent subtasks that require retrieval and application of information from extended interaction histories. It observes a dramatic performance degradation as context length increases and proposes an implicit RAG approach for modest improvements.

LLM Agents

Long Context

Benchmarking

Research Guy

All Tags

Research Guy

Understand New Research — Instantly

Daily AI-generated explanations of the latest arXiv papers.

Research Guy

Research Guy

All Tags

Research Guy

Research Guy

Articles tagged with: LLM Agents