Tag: LLM

Breaking the Chain: Simple Word Swaps Expose LLMs’ Reasoning Limits

October 16, 2024 AI LLM LTM Research Word Swap

Key Findings: Large Language Models (LLMs) exhibit significant limitations in handling sequentially dependent operations. Our simple word-swap experiment reveals that most models struggle to perform correctly beyond two consecutive word swap operations, highlighting a critical weakness in their sequential reasoning.

Charlie Mnemonic – Update 5: Introducing Chain-of-Thought and Integrated Recall System

October 16, 2024 AI Charlie Mnemonic LLM Research Technical blogs

We’re excited to announce the fifth major update to Charlie Mnemonic, your open-source AI assistant with Long-Term Memory. This release brings groundbreaking features, including Chain-of-Thought reasoning and an integrated Recall system that allows you to effortlessly search and reference past.

Discover the LTM Benchmark at NeurIPS 2024

October 09, 2024 AI LLM LTM Research

We are glad to announce that our paper “Beyond Prompts: Dynamic Conversational Benchmarking of Large Language Models” has been accepted to NeurIPS 2024, where we will have the opportunity to share our work and knowledge in relation to Long-Term Memory.

Major Charlie Mnemonic update released!

May 22, 2024 Charlie Mnemonic LLM LTM Research Technical blogs

We are announcing major updates for Charlie Mnemonic, your AI assistant with Long-Term Memory that’s getting smarter and more capable every day. We’ve been working hard to integrate new features and improve existing ones, and we are excited to share.

GoodAI LTM Benchmark v3 Released

April 24, 2024 AI LLM LTM Research Technical blogs

A Standardization Release: The main purpose of the GoodAI LTM Benchmark has always been to serve as an objective measure for our progress in the development of agents capable of continual and life-long learning.

LTM Benchmark: Improvements and new reports

March 22, 2024 LLM LTM Research Technical blogs

At GoodAI, we are committed to developing agents that are capable of continual and life-long learning. As part of our efforts, we have previously open-sourced the GoodAI LTM Benchmark, a suite of tests aimed to evaluate the Long-Term Memory (LTM).

Introducing Charlie Mnemonic: The First Personal Assistant with Long-Term Memory

March 01, 2024 AI LLM LTM Research Technical blogs

As part of our research efforts in continual learning, we are open-sourcing Charlie Mnemonic, the first personal assistant (LLM agent) equipped with Long-Term Memory (LTM).

Introducing GoodAI LTM Benchmark

February 09, 2024 AI LLM LTM Research Technical blogs

As part of our research efforts in the area of continual learning, we are open-sourcing a benchmark for testing agents’ ability to perform tasks involving the advanced use of the memory over very long conversations.

HALLM: An Agent that Observes and Acts through a Python Terminal

August 24, 2023 AI in Games LLM Research Robotics Technical blogs

At GoodAI, we are deeply committed to the advancement of safe AGI. Large language models (LLMs) undoubtedly offer significant power, but on their own, they have limitations — notably, the inability to learn new skills post-deployment. It's here that our.

Introducing our work on general-purpose LLM Agents

July 19, 2023 AI in Games LLM Technical blogs

At GoodAI, we are dedicated to pushing the boundaries of artificial intelligence. Our current focus is on the development of Large Language Model (LLM)-based agents with personalities that go beyond simple conversations, and instead exhibit LLM-driven behaviors, interacting with humans.

Breaking the Chain: Simple Word Swaps Expose LLMs’ Reasoning Limits

Charlie Mnemonic – Update 5: Introducing Chain-of-Thought and Integrated Recall System

Discover the LTM Benchmark at NeurIPS 2024

Major Charlie Mnemonic update released!

GoodAI LTM Benchmark v3 Released

LTM Benchmark: Improvements and new reports

Introducing Charlie Mnemonic: The First Personal Assistant with Long-Term Memory

Introducing GoodAI LTM Benchmark

HALLM: An Agent that Observes and Acts through a Python Terminal

Introducing our work on general-purpose LLM Agents

Join GoodAI

Keep in touch