Generative AI is transforming our interactions with the digital world, particularly through the rise of AI-driven chatbots such as Microsoft Copilot and OpenAI’s ChatGPT. These innovations are reshaping the landscape of online searches, prompting experts to suggest that tools like ChatGPT could pose a significant challenge to Google’s dominance in this arena. While Google asserts it possesses “all the ingredients” necessary to maintain its leadership in AI, Opera has made a strategic move to solidify its position within this evolving landscape.
Opera has introduced Browser Operator, an AI agent designed to enhance the browsing experience by automating routine tasks. This launch marks a pivotal moment, as it is the first instance of a browser harnessing AI capabilities to streamline essential online activities, thereby freeing up valuable time for users. Notably, Browser Operator operates natively within the browser, ensuring that users remain logged into their accounts while safeguarding their private credentials from unauthorized third-party access. Opera emphasizes that this approach offers a more reliable and efficient solution compared to server-based alternatives.
“Browser Operator does this in an efficient and intuitive experience, where you are always in control,” Opera stated, highlighting the user-centric design of the tool.
How does Opera’s Browser Operator work?
Browser Operator takes on mundane tasks—such as purchasing socks online—allowing users to concentrate on more pressing matters. According to Opera, “Browser Operator understands your written instructions in natural language using your browser client’s local resources, and our AI Composer Engine processes this information to complete tasks in the browser.” The AI tool is designed to pause and seek user input whenever it encounters sensitive actions, such as filling out forms or confirming orders, ensuring that users maintain control throughout the process.
Opera has drawn upon its extensive experience in web development to differentiate itself in the AI space, stating, “We know how a webpage is built and what it’s meant to look like – so we brought AI into our own playing field and created Browser Operator.” While concerns regarding privacy and security have been prevalent among users, Opera assures that Browser Operator prioritizes user confidentiality. The company has clarified that the AI tool does not rely on screenshots or video captures to perform its functions.
We developed Browser Operator so that no information – no keystrokes, no screenshots, etc – will be sent to our server when you use it. When Browser Operator requests you to intervene, or when you take over, no information is being processed by the AI, and it will only start reading the website again when you click “Resume.”
Opera
Opera reiterates that the AI operates natively, ensuring that user credentials remain confined to their devices. “It uses the DOM Tree and browser layout data to get context – meaning that it uses a textual representation of the webpage,” the company explained. This method is touted as a faster and safer alternative, as Browser Operator does not need to interpret screen pixels. Furthermore, the tool can analyze a web page in its entirety, streamlining the task completion process.
While challenges such as cookie acceptance prompts and verification dialogs might seem like obstacles for the AI agent, Opera asserts that these do not hinder Browser Operator’s functionality, as the AI interprets webpage data similarly to how browsers do. Interestingly, Microsoft has launched a comparable feature called Windows Recall. However, rather than assisting users in navigating the internet while multitasking, Windows 11’s feature captures periodic snapshots of the computer screen, allowing users to search for previously viewed content using natural language.
Despite the potential privacy and security concerns that have surrounded Microsoft’s offering, Opera appears to have a solid understanding of Browser Operator’s privacy and security protocols. As the tool is currently in preview mode, it may not yet be accessible to all users. Nevertheless, Opera intends to roll out the feature widely as part of its AI feature drop program. Users can explore Browser Operator by accessing it from Opera’s sidebar or the Command Line, and it is available for download from Opera’s dedicated page.