AI processing moves to devices for speed and privacy

Tech developers are shifting artificial intelligence from distant cloud data centers to personal devices like phones and laptops to achieve faster processing, better privacy, and lower costs. This on-device AI enables tasks that require quick responses and keeps sensitive data local. Experts predict significant advancements in the coming years as hardware and models improve.

The reliance on cloud-based AI, such as Anthropic's Claude, involves sending prompts to remote data centers, which can introduce delays of seconds—unacceptable for urgent tasks like alerting a user to an obstacle in their path. Privacy is another concern, as sensitive information like health or financial data travels through multiple untrusted systems. To address these issues, companies are increasingly processing AI on devices themselves, eliminating the need for internet connectivity and reducing costs by avoiding payments to data center operators.

This shift has been underway for years. As early as 2017, iPhones used on-device AI for face recognition via a neural engine. Modern implementations, like Apple's Apple Intelligence with about 3 billion parameters, handle specific tasks such as summarizing messages or visual recognition from screenshots. Google's Pixel phones employ the Gemini Nano model on the Tensor G5 chip to power features like Magic Cue, which pulls relevant information from emails and messages without manual searching.

Experts highlight the challenges and benefits. Mahadev Satyanarayanan, a Carnegie Mellon professor, likens ideal on-device computing to the human brain, noting that while nature evolved it over a billion years, humans aim to achieve similar efficiency in five to ten years through advanced hardware and specialized models. Vinesh Sukumar, head of generative AI at Qualcomm, points out system differences for compact devices like smartwatches, often requiring offloading to the cloud—but with safeguards like user permission and secure handling to protect data.

Apple's Private Cloud Compute exemplifies privacy measures: it processes offloaded data only on company servers, sends minimal information, and stores none. For developers, on-device AI cuts ongoing costs; Charlie Chapman of the Dark Noise app uses it to mix sounds without cloud fees, allowing scalability without financial risk.

Looking ahead, on-device AI excels in object classification within 100 milliseconds but still offloads for detection, segmentation, activity recognition, and tracking. Satyanarayanan anticipates exciting progress in five years, enabling features like trip alerts via computer vision or contextual reminders about conversations.

Related Articles

Illustration depicting Apple Siri integrating Google's Gemini AI, with Apple Park backdrop and fading ChatGPT logo.
Image generated by AI

Apple chooses Google's Gemini to power next Siri upgrade

Reported by AI Image generated by AI

Apple has selected Google's Gemini AI models to enhance its Siri virtual assistant in a forthcoming update. The decision, announced in a joint statement, marks a shift from previous integrations with OpenAI's ChatGPT. This multi-year partnership aims to deliver more capable AI experiences while upholding Apple's privacy standards.

Researchers from Purdue University and the Georgia Institute of Technology have proposed a new computer architecture for AI models inspired by the human brain. This approach aims to address the energy-intensive 'memory wall' problem in current systems. The study, published in Frontiers in Science, highlights potential for more efficient AI in everyday devices.

Reported by AI

As CES 2026 approaches, experts anticipate significant advancements in smart home technology driven by artificial intelligence. Predictions highlight more conversational voice assistants, noninvasive presence sensing, and automated routines that reduce human input. These developments aim to address longstanding issues like compatibility and privacy in connected homes.

Artificial intelligence (AI) has emerged at the center of modern warfare, playing an operational support role in the recent U.S.-Israeli strike on Iran. Anthropic's Claude and Palantir's Gotham were used for intelligence assessments and target identification. Experts predict further expansion of AI in military applications.

Reported by AI

Experts foresee 2026 as the pivotal year for world models, AI systems designed to comprehend the physical world more deeply than large language models. These models aim to ground AI in reality, enabling advancements in robotics and autonomous vehicles. Industry leaders like Yann LeCun and Fei-Fei Li highlight their potential to revolutionize spatial intelligence.

Google has begun rolling out its Gemini AI tool to the Chrome browser on iPhones and iPads in the US, allowing users to access AI features directly within the app. This update eliminates the need to switch to the Google app for AI assistance. The integration follows a similar rollout to desktop users in September.

Reported by AI

In 2025, cyber threats in the Philippines stuck to traditional methods like phishing and ransomware, without new forms emerging. However, artificial intelligence amplified the volume and scale of these attacks, leading to an 'industrialization of cybercrime'. Reports from various cybersecurity firms highlight increases in speed, scale, and frequency of incidents.

 

 

 

This website uses cookies

We use cookies for analytics to improve our site. Read our privacy policy for more information.
Decline