Agentic AI Systems: Research References & Citations

Document Purpose: Comprehensive bibliography supporting the multi-planar abstraction from clinical/medical AI to general-purpose agentic systems.

Compiled: January 2026
Scope: 2023-2025 research synthesis across clinical dialogue, enterprise automation, multi-agent systems, and foundational AI architectures.

Primary Survey Sources

Clinical Domain Origin

Zhi, Zhao, Wu, Zhao, Zhu (2025)
"Reinventing Clinical Dialogue: Agentic Paradigms for LLM-Enabled Healthcare Communication"
Tianjin University & Chinese Academy of Sciences
- Contribution: Novel two-axis taxonomy (Knowledge Source × Agency Objective)
- Scope: 300+ papers surveyed
- Key Finding: Four paradigms emerge from intersection of implicit/explicit knowledge with cognition/execution objectives
- Framework: POMDP formalization for clinical dialogue agents

General Agentic AI Surveys

Ali, M. A. & Dornaika, F. (2025)
"Agentic AI: A Comprehensive Survey of Architectures, Applications, and Future Directions"
Artificial Intelligence Review
DOI: 10.1007/s10462-025-11422-4
- Contribution: Dual-paradigm framework (Symbolic/Classical vs. Neural/Generative)
- Methodology: PRISMA-based review of 90 studies (2018-2025)
- Key Finding: "Paradigm-market fit" - symbolic dominates safety-critical, neural dominates adaptive domains
- Recommendation: Hybrid neuro-symbolic architectures for future development
MDPI Future Internet (2025)
"The Rise of Agentic AI: A Review of Definitions, Frameworks, Architectures, Applications, Evaluation Metrics, and Challenges"
Future Internet, 17(9), 404
- Contribution: Comprehensive taxonomy including 143 primary studies
- Scope: LLM-based and non-LLM agentic systems
- Key Insight: 90%+ of reviewed papers published 2024-2025, indicating rapid field evolution
Artificial Intelligence Review (2025)
"Agentic AI systems in the age of generative models: architectures, cloud scalability, and real-world applications"
DOI: 10.1007/s10462-025-11458-6
- Contribution: Holistic framework integrating perception, memory, planning, execution, communication
- Key Innovation: Persistent memory layers, semantic routing, modular orchestration

Planning & Reasoning Research

ReAct and Derivatives

Yao, S. et al. (2024)
"ReAct: Synergizing Reasoning and Acting in Language Models"
ICLR 2024
- Pattern: Thought → Action → Observation loop
- Innovation: Interleaves reasoning traces with action execution
- Application: Foundation for most modern agentic architectures
Molinari et al. (2025)
"RP-ReAct: A Reasoner-Planner Supervising a ReAct Executor"
- Pattern: Hierarchical decoupling of strategic planning from execution
- Innovation: Reasoner-Planner Agent (RPA) + Proxy Execution Agents (PEA)
- Benefit: Prevents "contextual drift" and handles large context windows
Li et al. (2024)
"Focused ReAct: Addressing Context Drift and Action Loops"
- Problem: Loss of original goal during extended reasoning
- Solution: Goal reiteration and focused attention mechanisms

Reflexion & Self-Improvement

Shinn, N., Cassano, F., Labash, A. et al. (2023)
"Reflexion: Language Agents with Verbal Reinforcement Learning"
NeurIPS 2023
arXiv:2303.11366
- Innovation: Self-reflection without gradient updates
- Components: Actor, Evaluator, Self-Reflection module, Episodic Memory
- Result: 91% pass@1 on HumanEval (state-of-the-art at publication)
- Key Insight: "Verbal reinforcement learning" enables learning from experience
Multi-Agent Reflexion (MAR) (2024)
"MAR: Multi-Agent Reflexion Improves Reasoning Abilities in LLMs"
arXiv:2512.20845
- Extension: Diverse critic personas in reflection loop
- Framework: Society of Mind + Multi-Agent Debate (MAD)
- Benefit: Addresses single-agent confirmation bias

Memory Systems Research

Survey & Frameworks

ACM Transactions on Information Systems (2025)
"A Survey on the Memory Mechanism of Large Language Model-based Agents"
DOI: 10.1145/3748302
- Contribution: Comprehensive taxonomy of memory types and operations
- Framework: Parametric vs. Non-Parametric memory classification
- Key Insight: Memory is "the key component for agent-environment interactions"
- Repository: https://github.com/nuster1128/LLM_Agent_Memory_Survey
NeurIPS 2025 (A-Mem)
"A-Mem: Agentic Memory for LLM Agents"
OpenReview: FiM0M8gcct
- Innovation: Dynamic memory organization using Zettelkasten principles
- Features: Automatic indexing, cross-referencing, interconnected knowledge networks
- Key Advance: Memory structures adapt to task requirements

Specialized Memory Systems

Agent Workflow Memory (2024)
- Focus: Process-oriented memory for workflow agents
- Application: Enterprise automation, sequential tasks
MemRL (2026)
"MemRL: Self-Evolving Agents via Runtime Reinforcement Learning on Episodic Memory"
- Innovation: Runtime RL on memory without model updates
- Application: Long-horizon agent adaptation
MemEvolve (2025)
"Meta-Evolution of Agent Memory Systems"
- Approach: Evolutionary optimization of memory architectures

Multi-Agent Systems Research

Collaboration Mechanisms

Tran, K-T., Dao, D., Nguyen, M-D. et al. (2025)
"Multi-Agent Collaboration Mechanisms: A Survey of LLMs"
arXiv:2501.06322
- Framework: Characterizes collaboration by actors, types, structures, strategies, protocols
- Types: Cooperation, competition, coopetition
- Topologies: Peer-to-peer, centralized (hierarchical), distributed
- Finding: "Dominant topology in 68% of MAS implementations"
ACM ICDEAI (2025)
"A Survey of Multi-AI Agent Collaboration: Theories, Technologies and Applications"
DOI: 10.1145/3745238.3745531
- Scope: Cross-industry applications
- Focus: Manufacturing, finance, healthcare coordination
Multi-Agent Coordination Across Diverse Applications Survey (2025)
arXiv:2502.14743
- Applications: Decision-making, behavior simulation, robotics
- Key Systems: CAMEL, MetaGPT, RoCo

Specific Multi-Agent Frameworks

Hong, S. et al. (2024)
"MetaGPT: Meta Programming for A Multi-Agent Collaborative Framework"
ICLR 2024
- Innovation: Human-like workflows in LLM-based multi-agent systems
- Application: Software development automation
AutoAgents (2024)
"AutoAgents: A Framework for Automatic Agent Generation"
IJCAI 2024
- Innovation: Adaptive agent generation based on task requirements
- Feature: Observer agent for coordination
ReConcile (2024)
"Round-Table Conference Improves Reasoning via Consensus among Diverse LLMs"
ACL 2024
- Pattern: Multi-model debate with voting

Tool Use & Action Research

Stanford & Harvard (2024)
"Adaptation of Agentic AI"
- Framework: Agent = Foundation Model + Planning + Tool Use + Memory
- Taxonomy: A1 (optimize agent) vs. A2 (optimize for output) vs. T1/T2 (optimize tools)
- Key Finding: "Tool use module connects agent to external environments"
τ-bench (2024)
"A Benchmark for Tool-Agent-User Interaction in Real-World Domains"
Yao, S. et al.
arXiv:2406.12045
- Contribution: Standardized evaluation for tool use
ChemCrow (2023)
Bran et al.
- Application: 13 expert-designed chemistry tools
- Pattern: LangChain + ReAct + domain-specific tools

Enterprise & Production Research

IBM Think (2025)
"AI Agents in 2025: Expectations vs. Reality"
- Prediction: AI orchestrators as backbone of enterprise AI
- Challenge: Strong compliance frameworks needed
- Insight: "Push and pull between multi-agent and single godlike agent"
Anthropic (2024)
"Model Context Protocol (MCP)"
- Standard: Connecting AI assistants to data sources
- Adoption: OpenAI support announced, thousands of integrations
- Status: De facto standard for agent-data interactions
World Journal of Advanced Research and Reviews (2025)
"Multi-agent systems: the future of distributed AI platforms"
- Metrics: 35% productivity gains, $2.1M annual cost reduction
- Protocols: FIPA standards in 58% of implementations

Evaluation & Benchmarks

SWE-bench (2023)
Jimenez, C. E. et al.
arXiv:2310.06770
- Focus: Real-world GitHub issue resolution
GAIA (2023)
Mialon, G. et al.
- Focus: General AI assistant benchmark
HumanEval (various)
- Focus: Code generation evaluation
- Significance: Standard for Reflexion and code agent evaluation

Safety & Governance Research

Khan, A. et al. (2024)
"Governance Models for Autonomous Multi-Agent Systems"
IEEE Access, vol. 12
- Focus: Policy frameworks for MAS deployment
CyberArk (2025)
"The Agentic AI Revolution: 5 Unexpected Security Challenges"
- Challenges: Identity management, secret sprawl, persistent access
Prompt Injection Research (2024-2025)
Multiple papers on LLM-to-LLM prompt injection in multi-agent systems
- Defense: LLM Tagging and sanitization

Foundational Concepts

LLM Agent Architecture

Lilian Weng (2023)
"LLM Powered Autonomous Agents"
Lil'Log
URL: https://lilianweng.github.io/posts/2023-06-23-agent/
- Framework: Planning + Memory + Tool Use
- Significance: Widely-cited conceptual foundation
Prompt Engineering Guide (2024-2025)
"LLM Agents"
URL: https://www.promptingguide.ai/research/llm-agents
- Resource: Practical implementation guidance

Cognitive Architectures

CoALA (2024)
"Cognitive Architectures for Language Agents"
TMLR
- Framework: Modular memory, action space, decision-making
- Contribution: Organizes field and guides future development

Domain-Specific Applications

Healthcare

daGOAT
Autonomous GvHD prevention system
- Paradigm: VWA (Verifiable Workflow Automator)
- Application: Clinical protocol execution
DrBioRight 2.0
Cancer proteomics chatbot
- Paradigm: GS (Grounded Synthesizer)
- Application: EHR-grounded query response
MedAgents / MDAgents (2024)
NeurIPS 2024
- Innovation: Adaptive LLM collaboration structures for medical decisions
- Application: Collaborative diagnosis

Finance

FinRobot (2024)
Yang, H. et al.
arXiv:2405.14767
- Application: Financial analysis automation
- Framework: Open-source AI agent platform
FinCon (2024)
"LLM multi-agent system with conceptual verbal reinforcement"
- Application: Financial decision making

Software Development

Devin (Cognition AI, 2024)
First claimed fully autonomous AI software engineer
- Capabilities: Shell, code editor, browser in sandbox
- Significance: Milestone in autonomous coding agents
SWE-Agent & Agentless (2024)
Various approaches to software engineering tasks
- Debate: Agent-based vs. simpler approaches

Research Frontiers (2025-2026)

Neuro-Symbolic Integration

Dynamic fusion of parametric/non-parametric knowledge
Metacognitive mechanisms (self-doubt, boundary awareness)
Fine-grained process supervision

Human-AI Teaming

Theory of mind for agents (modeling other agents' beliefs)
Trust-based role arbitration
Consensus mechanisms for distributed decisions

Memory Evolution

From RAG to true memory (non-parametric continual learning)
Episodic memory for long-horizon navigation
Privacy risks in LLM agent memory

Key Repositories & Resources

Resource	URL	Purpose
Agent Memory Survey	github.com/nuster1128/LLM_Agent_Memory_Survey	Memory mechanisms
Awesome Agent Papers	github.com/luo-junyu/Awesome-Agent-Papers	Curated paper list
Agent Memory Paper List	github.com/Shichun-Liu/Agent-Memory-Paper-List	Memory-focused research
Prompt Engineering Guide	promptingguide.ai	Implementation guidance
LangChain	langchain.com	Agent framework
LangGraph	langchain.com/langgraph	Graph-based orchestration

Citation Statistics

Category	Papers Reviewed	Date Range
Clinical Dialogue	300+	2023-2025
General Agentic AI	90+	2018-2025
Multi-Agent Systems	143+	2024-2025
Memory Mechanisms	100+	2023-2025
Total Unique Sources	400+	2018-2025

Document Changelog

Version	Date	Changes
1.0	2026-01-14	Initial compilation from survey analysis

This document supports the JSX visualizations (07-12) and markdown analyses (01-06) in the Agentic AI Systems research synthesis.

Primary Survey Sources​

Clinical Domain Origin​

General Agentic AI Surveys​

Planning & Reasoning Research​

ReAct and Derivatives​

Reflexion & Self-Improvement​

Memory Systems Research​

Survey & Frameworks​

Specialized Memory Systems​

Multi-Agent Systems Research​

Collaboration Mechanisms​

Specific Multi-Agent Frameworks​

Tool Use & Action Research​

Enterprise & Production Research​

Evaluation & Benchmarks​

Safety & Governance Research​

Foundational Concepts​

LLM Agent Architecture​

Cognitive Architectures​

Domain-Specific Applications​

Healthcare​

Finance​

Software Development​

Research Frontiers (2025-2026)​

Neuro-Symbolic Integration​

Human-AI Teaming​

Memory Evolution​

Key Repositories & Resources​

Citation Statistics​

Document Changelog​