Add integration test with dummy agent #1316

rbren · 2024-04-23T19:03:24Z

This will help with QA tasks--making sure the agent is able to read/write files, run commands, etc

li-boxuan · 2024-04-25T07:54:54Z

Two cents:

Is it possible to move the DummyAgent to test package? If something is solely for testing purpose, it should ideally not reside in the main package. There might be a challenge to register the agent, though.
I know why you think it's e2e testing but I feel like it's more like a special kind of unit testing.

rbren · 2024-04-25T15:25:21Z

Yeah I was thinking e2e isn't quite right 😄 maybe integration?

I kind of want the DummyAgent to show up in the UI, at least for devs--it helps to quickly test all the UI features without having to set everything up, wait for the LLM, pay money, etc.

li-boxuan · 2024-04-25T17:00:45Z

Is it possible to add a new test to run this agent and put the test under test/integration/? That way, this agent doesn't need a dedicated GitHub Actions yaml file, and can be run with pytest.

li-boxuan · 2024-04-27T05:07:02Z

tests/integration/test_actions.py

+from opendevin.llm.llm import LLM
+
+
+def test_actions_with_dummy_agent():


I am sorry but maybe it's a bad idea to add this as a pytest test.

If we want to keep it this way, we need to:

add an annotation here:

@pytest.mark.skipif(os.environ.get('AGENT') != 'DummyAgent', reason='Designed to test DummyAgent only')

Add another annotation in test_agent.py that DummyAgent needs to be skipped:

@pytest.mark.skipif(os.environ.get('AGENT') == 'DummyAgent', reason='DummyAgent is special and cannot solve any real task')

Add DummyAgent to run-integration-tests.yml, so that a dedicated job runner will run DummyAgent test. Right now, I didn't check but I suppose all test runners run this test, which is unnecessary.

This reverts commit de8121c.

neubig · 2024-04-29T17:01:09Z

Hi @rbren , I'm happy to take a look at this but:

It seems some tests are failing or at least stalled?
I also added a dummy agent in this PR:

OpenDevin/agenthub/dummy_agent/agent.py

Line 11 in 46bd836

class DummyAgent(Agent):

Feel free to get rid of my dummy agent and replace it with yours, I don't think it should break any of the unit tests I created.

rbren · 2024-04-30T00:52:12Z

@neubig I think we're good to go!

.github/workflows/dummy-agent-test.yml

li-boxuan · 2024-04-30T01:09:53Z

opendevin/sandbox/docker/exec_box.py

-        return exit_code, logs.decode('utf-8').strip()
+        logs_out = logs.decode('utf-8')
+        if logs_out.endswith('\n'):
+            logs_out = logs_out[:-1]


if you are changing strip() to this, you'd probably wanna do this in other sandboxes too?

TBH I want to redo all the log parsing in a separate PR--I don't think we should be stripping whitespace from log output. This is here because the tests don't run consistently otherwise ☹️

Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk>

rbren marked this pull request as ready for review April 23, 2024 21:43

rbren changed the title ~~[WIP] Add e2e test with dummy agent~~ Add e2e test with dummy agent Apr 23, 2024

rbren marked this pull request as draft April 23, 2024 21:44

rbren changed the title ~~Add e2e test with dummy agent~~ Add integration test with dummy agent Apr 25, 2024

rbren marked this pull request as ready for review April 25, 2024 15:40

rbren added 7 commits April 26, 2024 17:19

first pass at dummy

6dff4e3

add assertion to dummy

57b45ed

add dummy workflow

ff449dc

beef up tests

cb7d7c5

try and fix huggingface issue

048e625

remove newlines

889bbd6

rename test

0e29791

rbren force-pushed the rb/dummy-test branch from 757f32f to 0e29791 Compare April 26, 2024 21:20

move to pytest

de8121c

li-boxuan reviewed Apr 27, 2024

View reviewed changes

rbren added 2 commits April 27, 2024 08:06

Revert " move to pytest"

85cd78d

This reverts commit de8121c.

fix lint

dc87794

rbren and others added 2 commits April 29, 2024 20:13

Merge branch 'main' into rb/dummy-test

51d044b

delint

6c5cd74

li-boxuan approved these changes Apr 30, 2024

View reviewed changes

rbren and others added 2 commits April 30, 2024 12:40

Update .github/workflows/dummy-agent-test.yml

acd236f

Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk>

Merge branch 'main' into rb/dummy-test

45f13fa

rbren enabled auto-merge (squash) April 30, 2024 16:41

rbren merged commit 0cda5f6 into main Apr 30, 2024
21 of 22 checks passed

rbren deleted the rb/dummy-test branch April 30, 2024 16:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add integration test with dummy agent #1316

Add integration test with dummy agent #1316

rbren commented Apr 23, 2024 •

edited

li-boxuan commented Apr 25, 2024 •

edited

rbren commented Apr 25, 2024

li-boxuan commented Apr 25, 2024 •

edited

li-boxuan Apr 27, 2024 •

edited

neubig commented Apr 29, 2024

rbren commented Apr 30, 2024

li-boxuan Apr 30, 2024

rbren Apr 30, 2024

		from opendevin.llm.llm import LLM


		def test_actions_with_dummy_agent():

Add integration test with dummy agent #1316

Add integration test with dummy agent #1316

Conversation

rbren commented Apr 23, 2024 • edited

li-boxuan commented Apr 25, 2024 • edited

rbren commented Apr 25, 2024

li-boxuan commented Apr 25, 2024 • edited

li-boxuan Apr 27, 2024 • edited

Choose a reason for hiding this comment

neubig commented Apr 29, 2024

rbren commented Apr 30, 2024

li-boxuan Apr 30, 2024

Choose a reason for hiding this comment

rbren Apr 30, 2024

Choose a reason for hiding this comment

rbren commented Apr 23, 2024 •

edited

li-boxuan commented Apr 25, 2024 •

edited

li-boxuan commented Apr 25, 2024 •

edited

li-boxuan Apr 27, 2024 •

edited