OpenAI launches ChatGPT agent to handle complex tasks: How it works
ETtech July 18, 2025 05:40 PM
Synopsis

OpenAI introduces a new AI agent for ChatGPT. This agent uses a virtual computer to manage complex tasks. It can browse websites and analyze data. It also creates presentations and spreadsheets. ChatGPT Agent is available for Pro, Plus, and Team users. The chatbot seeks permission before acting. Users can interrupt or stop tasks.

OpenAI launches ChatGPT Agent
OpenAI launched an artificial intelligence agent for its popular chatbot ChatGPT on Thursday that can complete complex tasks.

AI agents are software programs that can do tasks autonomously, on par with human capabilities.

The company said that OpenAI's agent will bring together “three strengths of earlier breakthroughs: Operator’s⁠ ability to interact with websites, deep research’s⁠ skill in synthesising information, and ChatGPT’s intelligence and conversational fluency.”

ChatGPT agent is available for Pro, Plus, and Team users. It can shift from reasoning to action to handle workflows as per instructions. The chatbot will ask for permission before acting, and users can interrupt, take over the browser, or stop tasks.

What are AI agents?

US software maker IBM offers the following definition on their website: “An artificial intelligence (AI) agent refers to a system or program that is capable of autonomously performing tasks on behalf of a user or another system by designing its workflow and utilising available tools.”

In simpler terms, AI agents can take actions independently, without human intervention, and determine the best steps to achieve goals set by a user.

They are hailed as the "next big thing" by major tech players like Google, OpenAI, and Anthropic are expected to be a major focus and trend this year. According to market research firm Roots Analysis, the global AI agent market is set to grow dramatically, from $5.29 billion in 2024 to $216.8 billion by 2035, with a compound annual growth rate (CAGR) of 40.15%.

How does ChatGPT agent work?

With agentic capabilities, ChatGPT agent can handle requests like “look at my calendar and brief me on upcoming client meetings based on recent news,” “plan and buy ingredients to make Japanese breakfast for four,” and “analyse three competitors and create a slide deck”, OpenAI mentioned in a statement.

ChatGPT agent employs a visual browser to interact with the web through a graphical-user interface, a text-based browser for simpler reasoning-based web queries, a terminal, and direct API access. It can also connect with apps like Gmail and Github to find information relevant to prompts and use them in its responses. It can take over the browser for deeper analysis of the information present on a website and execute tasks.

The model can choose to open a page using the text browser or visual browser, download a file from the web, manipulate it by running a command in the terminal, and then view the output back in the visual browser.
© Copyright @2025 LIDEA. All Rights Reserved.