Add CRITIC agent integration #13108

nerdai · 2024-04-25T16:25:05Z

Description

This PR adds a new package, namely llama-index-agent-introspective. This package introduces IntrospectiveAgent's that perform tasks while utilizing the "reflection" agentic pattern. Two reflection agents that differ by their reflection mechanisms are also supplied in this package. Thus three main classes are introduced:

IntrospectiveAgentWorker
ToolInteractiveReflectionAgentWorker
SelfReflectionAgentWorker (adapted and ported over from add reflexion agent #13089 by @jerryjliu)

Fixes # (issue)

New Package?

Did I fill in the tool.llamahub section in the pyproject.toml and provide a detailed README.md for my new integration or package?

Yes
No

Version Bump?

Did I bump the version in the pyproject.toml file of the package I am updating? (Except for the llama-index-core package)

Yes
No

Type of Change

Please delete options that are not relevant.

New package
This change requires a documentation update — notebooks have been added and module guides updated.

How Has This Been Tested?

Please describe the tests that you ran to verify your changes. Provide instructions so we can reproduce. Please also list any relevant details for your test configuration

Added new unit/integration tests
Added new notebook (that tests end-to-end)
I stared at the code and made sure it makes sense

llama-index-integrations/agent/llama-index-agent-critic/llama_index/agent/critic/step.py

review-notebook-app · 2024-04-29T18:11:04Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

llama-index-core/llama_index/core/agent/introspective/step.py

logan-markewich · 2024-04-29T18:51:50Z

llama-index-core/llama_index/core/agent/introspective/step.py

+
+        # run reflective agent
+        reflective_agent = self._reflective_agent_worker.as_agent()
+        reflective_agent_response = reflective_agent.chat(main_agent_response.response)


the only thing the reflective agent has for context here is the response from the first agent. Shouldn't it also have some context on what the original query was too?

Yeah, that's a good point!

This is the only one that I haven't resolved yet. I do pass the chat history to the ToolInteractiveReflectionAgent, but not to the critique agent which gets the task of reflection with tools.

To do this would require a bit of re-jigging on the prompts for the CritiqueAgent and then ofc the passing of the chat history, which can be done as prompt str.

I think it would be okay to do this in a future release of this package, in order to get this package out sooner. Thoughts?

llama-index-core/llama_index/core/agent/introspective/step.py

...lama-index-agent-introspective/llama_index/agent/introspective/reflective/self_reflection.py

logan-markewich · 2024-05-02T18:38:30Z

...lama-index-agent-introspective/llama_index/agent/introspective/reflective/self_reflection.py

+        new_memory = ChatMemoryBuffer.from_defaults()
+
+        # put current history in new memory
+        messages = task.memory.get()


By calling get() here, we only get the memory that fits in the current buffer window

Then later on for finalize, we set the memory to all the memory from the task. This means that we've thrown away any memory that was outside of the original buffer window, which might not be desirable

I don't think there's a perfect solution to this, but something to be aware of

Thanks! I wonder if we should retain full memory and only use truncation when its about to be passed to an LLM.

I think how the react agent handles it is doing task_memory.get() + new_memory.get() -- but that also seems risky/not ideal.

Work for a future PR perhaps :)

logan-markewich · 2024-05-02T18:41:14Z

...lama-index-agent-introspective/llama_index/agent/introspective/reflective/self_reflection.py

+        state = step.step_state
+        state["count"] += 1
+
+        messages = task.extra_state["new_memory"].get()


Does this include the initial task input? I might have missed where that gets added to memory

yes it should, since the introspective agent first adds the chat history from the main agent which includews the initial task input to the reflective agent's memory.

llama_index/llama-index-integrations/agent/llama-index-agent-introspective/llama_index/agent/introspective/step.py

Line 162 in 0d05f90

reflective_agent_messages = task.extra_state["main"]["memory"].get()

logan-markewich · 2024-05-02T18:45:14Z

...gent-introspective/llama_index/agent/introspective/reflective/tool_interactive_reflection.py

+        task.extra_state["new_memory"].put(critique_msg)
+
+        # correct
+        if is_done:


If self.stopping_callable is none, is this just never done?

oh in that case it will stop after max_iterations has been reached. That gets implemented when building the TaskStepOutput with is_last field.

return TaskStepOutput( output=agent_response, task_step=step, is_last=is_done | (self.max_iterations == state["count"]), next_steps=new_steps, )

Hmm, interesting. I wonder if that's ideal or not. I would expect a combination of the main agent being is_done and the reflection agent being is_done to control when we return 🤔 Maybe work for a future PR

oh interesting yea, at this point it always go to reflection and this is_done is about when reflection/correction phases stop.

...x-integrations/agent/llama-index-agent-introspective/llama_index/agent/introspective/step.py

llama-index-integrations/agent/llama-index-agent-introspective/tests/test_agent_critic.py

logan-markewich

This looks good to me! Just a few comments/poossible edge cases.

Note for myself though -- need to do a better job of thinking about what goes in an integration vs. core 😅

nerdai · 2024-05-02T19:11:27Z

This looks good to me! Just a few comments/poossible edge cases.

Note for myself though -- need to do a better job of thinking about what goes in an integration vs. core 😅

Yeah, I had initially thought the IntrospectiveAgent should be in core as reflection is a key pattern. But I opted to keep it contained in this package with reflection agents cause it seemed like a better package together than not.

nerdai · 2024-05-03T04:39:58Z

alrighty @logan-markewich -- as discussed offline, I've cleaned up the memory for introspective agent removing the chat histories of the inner agents that it delegates tasks to. The final message in the history now lines up more nicely with the final response.

nerdai commented Apr 25, 2024

View reviewed changes

llama-index-integrations/agent/llama-index-agent-critic/llama_index/agent/critic/step.py Outdated Show resolved Hide resolved

nerdai commented Apr 25, 2024

View reviewed changes

llama-index-integrations/agent/llama-index-agent-critic/llama_index/agent/critic/step.py Outdated Show resolved Hide resolved

nerdai commented Apr 25, 2024

View reviewed changes

llama-index-integrations/agent/llama-index-agent-critic/llama_index/agent/critic/step.py Outdated Show resolved Hide resolved

nerdai force-pushed the nerdai/critic branch from c698ac5 to 231ea1b Compare April 29, 2024 18:11

logan-markewich reviewed Apr 29, 2024

View reviewed changes

llama-index-core/llama_index/core/agent/introspective/step.py Outdated Show resolved Hide resolved

logan-markewich reviewed Apr 29, 2024

View reviewed changes

llama-index-core/llama_index/core/agent/introspective/step.py Outdated Show resolved Hide resolved

nerdai added 11 commits May 1, 2024 10:38

init

dde7fca

more scaffolding

0ac7c41

wireframes

2c2c439

wip nb

882a3c6

add introspective agent

843014b

add introspective agent

aa42020

got memory working better now

fdc14dd

polish

057aec3

refactor

86ede2f

refactor

1eb452c

revert core changes

a225fbf

nerdai force-pushed the nerdai/critic branch from 542f9d1 to a225fbf Compare May 1, 2024 14:51

nerdai added 10 commits May 1, 2024 10:53

rm parent_task_id

10f3488

refactor for better org

c80c6f8

clean up critic class

63d1cf0

support async for critic

a39697c

carry step state forward to next steps

9f66162

add dispatcher

ab4d4b0

make max_iterations public

36ddc65

add stopping callable

e60c93b

implement jerry self reflection

9a64bec

bug fix and async support for self reflection

8e46e5e

nerdai added 8 commits May 1, 2024 19:28

clean up SelfReflection

068e9d8

update README

cfd0cc5

update README

69a348c

finish README

0186c72

readme wip

bce9443

readme wip

4b34070

readme wip

0bb9ec1

update nb and add copy to docs

c16d14e

nerdai requested a review from jerryjliu May 2, 2024 04:12

nerdai marked this pull request as ready for review May 2, 2024 04:12

dosubot bot added the size:XXL This PR changes 1000+ lines, ignoring generated files. label May 2, 2024

nerdai mentioned this pull request May 2, 2024

add reflexion agent #13089

Closed

nerdai added 2 commits May 2, 2024 00:18

rm dev nb

781684b

remove lock file

0d05f90

logan-markewich reviewed May 2, 2024

View reviewed changes

...lama-index-agent-introspective/llama_index/agent/introspective/reflective/self_reflection.py Outdated Show resolved Hide resolved

logan-markewich reviewed May 2, 2024

View reviewed changes

...x-integrations/agent/llama-index-agent-introspective/llama_index/agent/introspective/step.py Show resolved Hide resolved

logan-markewich reviewed May 2, 2024

View reviewed changes

llama-index-integrations/agent/llama-index-agent-introspective/tests/test_agent_critic.py Show resolved Hide resolved

logan-markewich reviewed May 2, 2024

View reviewed changes

nerdai added 5 commits May 2, 2024 15:40

use structured_predict

bed4fb7

update nb

0d5527f

line up chat history for introspective agent better

45fb0fc

update nb

99f0226

update nbs

f7999f0

nerdai merged commit b233b56 into main May 3, 2024
8 checks passed

nerdai deleted the nerdai/critic branch May 3, 2024 04:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add CRITIC agent integration #13108

Add CRITIC agent integration #13108

nerdai commented Apr 25, 2024 •

edited

review-notebook-app bot commented Apr 29, 2024

logan-markewich Apr 29, 2024

nerdai Apr 30, 2024

nerdai May 2, 2024

logan-markewich May 2, 2024

logan-markewich May 2, 2024

nerdai May 2, 2024

logan-markewich May 3, 2024

logan-markewich May 2, 2024

nerdai May 2, 2024

logan-markewich May 2, 2024

nerdai May 2, 2024

logan-markewich May 3, 2024

nerdai May 3, 2024

logan-markewich left a comment

nerdai commented May 2, 2024

nerdai commented May 3, 2024

Add CRITIC agent integration #13108

Add CRITIC agent integration #13108

Conversation

nerdai commented Apr 25, 2024 • edited

Description

New Package?

Version Bump?

Type of Change

How Has This Been Tested?

review-notebook-app bot commented Apr 29, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

logan-markewich left a comment

Choose a reason for hiding this comment

nerdai commented May 2, 2024

nerdai commented May 3, 2024

nerdai commented Apr 25, 2024 •

edited