Skip to content

The New Computer Update Part II

Pre-release
Pre-release
Compare
Choose a tag to compare
@KillianLucas KillianLucas released this 11 Mar 23:27
· 295 commits to main since this release
Screen Shot 2024-03-11 at 4 25 18 PM


The New Computer Update Part II introduces our first local vision model, a suite of native Mac integrations, 5x launch speed, and dozens of other requested features.

Mac Control

The new calendar, contacts, browser, mail, and sms Computer API modules let Open Interpreter control your Mac's native applications.

For example, Open Interpreter can execute the following code to get a contact, then send a text and an email with an attachment.

LLM-first Web Browser

The browser lets Open Interpreter quickly browse the web by querying a web-enabled language model.

Point Model

We have developed a model called point which is capable of locating visual controls precisely. It was designed to run locally on consumer hardware.

We leverage existing open-source models to "semantically search the screen" for text and icons. Language models can then call on this composite model to 'point' at text or icons on the screen:

While this model is capable of understanding simple interfaces with discrete icons, we intend to explore more general solutions in the next few weeks.

And more:

  • 5X launch speed
  • Experimental Docker support
  • Export conversation to a Jupyter notebook using %jupyter
  • Experimental one-click installers for Mac, Linux, and Windows
  • Profiles preview (a feature similar to custom GPTs)

New Contributors