The Agentic Frontier Insights from Emergence AI Blog

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

June 25, 2025

Achieving Self-Improvement in Agentic Systems with Skill Harvesting

Skill harvesting allows agentic systems to self-reflect, autonomously developing more specialized skills.

June 25, 2025

MathViz-E - Agent Tool Control

At Emergence, we’ve always believed that the next significant advancement in workflow automation will come from the planning, selection, and use of multiple external tools by artificial intelligence.

June 25, 2025

Self-Improving Agents

Self-improving agents have varying objectives, and the issue of aligning them with human values is critical.

June 23, 2025

SOTA on LongMemEval with RAG

LongMemEval is highlighted as the premier benchmark for evaluating long-term memory, surpassing simple tasks with its complex requirements. Despite this, our RAG-like methods have achieved state-of-the-art results, suggesting that while LongMemEval is effective, it may not fully capture all aspects of memory, indicating a need for further benchmark development.

June 20, 2025

State of the Art Results in Agentic Memory

Eager to apply more sophisticated agentic memory to the largest conversational benchmark, LongMemEval, we discuss the benchmark, our approach, our somewhat disappointing state of the art findings, and the need for a more comprehensive benchmark for agentic memory than LongMemEval.

June 12, 2025

Agents Are Redefining Cybersecurity Resilience

Emergence AI agents are revolutionizing cybersecurity by autonomously correlating vast telemetry data, detecting threats in real time, automating compliance monitoring, and orchestrating efficient SOC operations while reducing manual workloads and enhancing decision-making. By acting as tireless digital teammates, these agents empower organizations to build a scalable, resilient, and proactive security posture fit for today’s complex threat landscape.

May 13, 2025

Benchmarking Agents-Creating-Agents: How LLM Choices Shape Performance, Scale, and Quality

An empirical study of how different Generative Foundation Model pairings impact agent creation, verification, and emergent system behaviors across 40 enterprise tasks.

May 6, 2025

Comparing LLMs for Planning and Code Generation in Data Science Agents

We benchmarked the latest LLMs from OpenAI, Anthropic, Deepseek, and Google within our Data Insights Agent framework to identify which delivers the most accurate, fastest, and most consistent insights.

April 28, 2025

Building Agentic Systems from First Principles Inspired by Unix and Kubernetes

A first-principles architecture for agentic systems, inspired by Unix and Kubernetes. It introduces nine core abstractions—such as Execution Contexts, Skills (as high-level system calls), and Dynamic Agent Instantiation—to enable runtime agent creation, recursive delegation, and asynchronous execution.

Inside Emergence AI

May 13, 2025

May 13, 2025

Benchmarking Agents-Creating-Agents: How LLM Choices Shape Performance, Scale, and Quality

Achieving Self-Improvement in Agentic Systems with Skill Harvesting

MathViz-E - Agent Tool Control

Self-Improving Agents

SOTA on LongMemEval with RAG

State of the Art Results in Agentic Memory

Agents Are Redefining Cybersecurity Resilience

Benchmarking Agents-Creating-Agents: How LLM Choices Shape Performance, Scale, and Quality

Comparing LLMs for Planning and Code Generation in Data Science Agents

Building Agentic Systems from First Principles Inspired by Unix and Kubernetes