top of page

Google’s Project Jarvis: A Glimpse into AI-Powered Browsing Automation

28/10/24

By:

BR Hariyani

The new AI agent may soon handle browsing tasks, from purchasing products to booking flights, through automated actions in Chrome.

The new AI agent may soon handle browsing tasks, from purchasing products to booking flights, through automated actions in Chrome.

Introduction

Google is reportedly developing Project Jarvis, a groundbreaking AI system designed to perform tasks on behalf of users within a web browser. Expected to be powered by the upcoming version of Google’s Gemini AI, this “computer-using agent” is said to execute web-based tasks, such as booking flights or making purchases, by analyzing on-screen elements and performing actions like clicks and text entries. Jarvis could be a significant advancement in AI-driven productivity.

Automation with AI in Chrome

Currently, Jarvis is optimized specifically for Google Chrome, where it automates tasks by interpreting screenshots and recognizing on-screen content. This innovative approach allows it to streamline repetitive online activities, transforming how users interact with the web. The system may take a few seconds to process each action, though Google aims to refine it further before a wider release.

Part of a Larger Trend in AI Agents

Google is not alone in exploring task-automating AI. Microsoft, Apple, and other tech giants are also building advanced AI agents to perform tasks based on on-screen interactions. This trend reflects the industry’s growing emphasis on simplifying user experiences through AI. Microsoft’s Copilot Vision, for instance, allows users to interact with webpages, while Apple is rumored to have similar plans.

Potential Release Timeline

Though Jarvis could be previewed by December, Google’s final release plan remains flexible. The company intends to conduct extensive testing, possibly releasing it to select users first to identify and resolve any issues.

Conclusion

Project Jarvis, if successful, could redefine browser-based AI interactions, positioning Google at the forefront of the emerging “computer-using agents” trend.

All images used in the articles published by Kushal Bharat Tech News are the property of Verge. We use these images under proper authorization and with full respect to the original copyright holders. Unauthorized use or reproduction of these images is strictly prohibited. For any inquiries or permissions related to the images, please contact Verge directly.

Latest News

13/12/24

Apple’s New HomePod Mini and Apple TV Expected in 2025

Enhanced with Apple’s proprietary “Proxima” chip for improved connectivity and smart home integration

13/12/24

Google’s Vision for Android XR: Bringing Smart Glasses and Headsets to Life

The Android XR platform aims to redefine augmented and mixed reality, powered by Gemini AI and seamless integration.

13/12/24

Google Launches Gemini 2.0: Ushering in the AI Agentic Era

The advanced multimodal AI model can generate images, audio, and promises groundbreaking agent capabilities.

bottom of page