Self-Operating Computer

Using the same inputs and outputs of a human operator, this framework enables multimodal AI models to view the screen and decide on a series of mouse and keyboard actions to reach an objective.

Integration

It is currently integrated with GPT-4o, o1, Gemini Pro Vision, Claude 3 and LLaVa.

Compatibility

Designed for support across operating systems and to be used various multimodal models.

This project is compatible with Mac OS, Windows, and Linux (with X server installed).

Join the Discussion and Contribute on GitHub

We encourage contributions and discussion via the Self Operating Computer GitHub page.

Our team is unable to provide custom support at this time.

An open-source framework to enable multimodal models to operate a computer.

Ask a computer to do anything

The all-encompassing AI solution you've been waiting for

AI is transforming our world, reshaping the way we work and live. We envision a future where one powerful AI agent streamlines your digital life, seamlessly integrating all your needs into a single, intelligent solution. Our Personal Assistant embodies this vision, offering you unparalleled convenience and efficiency in tackling everyday tasks, without the hassle of juggling multiple tools.

Personal Assistant

What I can help you with today?

Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.

Email Management

Effortlessly conquer your inbox. Stay organized, prioritize messages, and seamless organization, smart prioritization, and rapid responses, all with the power of AI at your fingertips.

Personal Assistant

What I can help you with today?

Order me a large pizza to One Vanderbilt?

What kind of pizza would you like?

Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.

Everyday Tasks

Streamline your daily routine. From scheduling appointments and ordering food to online shopping and bill payments, let the power of AI optimize your everyday tasks for a smoother, more efficient lifestyle.

Personal Assistant

What are we working on today?

Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.

Research

Enhance your research capabilities. Dive into a wealth of knowledge, retrieve accurate information, and uncover valuable insights, all through the brilliance of AI-driven search and thought.