Skip to content

Simple agents are good for 1-to-1 retrieval system. For more complex task we need multi steps reasoning loop. In a reasoning loop the agent can break down a complex task into subtasks and solve them step by step while maintaining a conversational memory.

Notifications You must be signed in to change notification settings

ksjpswaroop/LlamaIndex-Agent-with-Reasoning-Loop

 
 

Repository files navigation

Reasoning-AGENT

RAG system with Agent is a good syatem for single step question answering. It means a simple Agent can dynamically choose between tools based on current state and perform a single pass operation through pipeline tools to generate response. A complex task may require breaking down a instruction into subtaska and perfrom sequentially. When performing sub taks sequentially we also need a buffer memory to keep track of the works. This powers is incorporated into agent with a reasonong loop in Llama Index.

Intoduction

A simple Agentic-RAG is good for 1-to-1 question answering. However a complex task may requiire sequence of subtasks to be executed to arrive at the final answer. This is where reasoning loop comes into play. In a reasoning loop the agent is able to perform subtaks sequentially until final response.

A agent consist of Agent Runner and Agent Worker. These two componnt interact in a cyclic manner until final response. Agent worker perform the actual work of tool selection through reasoning via chain of thoughts and deciding next step or formulating the final output. Whereas Agent Runner is the orechastator responsible for mainintaining task state and memory buffer. It works more like a scheduler and dispatcher.

The Image below shows the Agent Reasoning Loop with Agent Component and their role.

image refernec:DeepLearning.AI

Control Flow

On a broader term the Reasoning Loop perform tasks in a logical sequence. Well what is the sequence. How every component is interacting?

To understand the workflow we need to undestand chain of thoughts. To break a complex task into subtask LlamaIndex Agent makes use of chain of thought to guide the LLM into deciding the next action. These chain of thought are introduced by template with some fixed set of prompts and some dymanic context relevant to the current step.

The diagram below shows the sequence of events in an AGENT REASONING LOOP

Frameworks

  • Agentic-RAG: Llama Index
  • App: Gradio
  • LLM: Llama3 8B
  • Embedding: nomic-embed-text
  • Local LLM: Ollama
  • Containerization: Docker

File Structure

  • agent-reasoning-loop.ipynb: Code for implementing an RAG Agent with Llama Index.
  • app.py: Code Gradio application. Tools creation functions are present in utils.py and AGENT creation funtions are present in get_agent.py

How to RUN

We need a LINUX system with atleast 8GB RAM.

  • Install libraries with make install
  • To Run the Application execute docker compose up If you want to run the notebooks
  • Download Ollama Docker Image and start Ollama server with ollama_docker on a new CLI as the server will block the CLI.
  • Now agent-reasoning-loop.ipynb Notebook can be executed. If previous images or container exist with same name it may create conflict. Delete image/conatiners if name conflict occurs.

References

Acknowledgements

  • Thanks to DeepLearning.AI and LlamaIndex for the wonderful course
  • Thanks to Meta for open source Llama3

If you find the repo helpful, please drop a ⭐

About

Simple agents are good for 1-to-1 retrieval system. For more complex task we need multi steps reasoning loop. In a reasoning loop the agent can break down a complex task into subtasks and solve them step by step while maintaining a conversational memory.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 50.7%
  • Python 38.0%
  • Dockerfile 10.2%
  • Makefile 1.1%