AI processing moves to devices for speed and privacy

Tech developers are shifting artificial intelligence from distant cloud data centers to personal devices like phones and laptops to achieve faster processing, better privacy, and lower costs. This on-device AI enables tasks that require quick responses and keeps sensitive data local. Experts predict significant advancements in the coming years as hardware and models improve.

The reliance on cloud-based AI, such as Anthropic's Claude, involves sending prompts to remote data centers, which can introduce delays of seconds—unacceptable for urgent tasks like alerting a user to an obstacle in their path. Privacy is another concern, as sensitive information like health or financial data travels through multiple untrusted systems. To address these issues, companies are increasingly processing AI on devices themselves, eliminating the need for internet connectivity and reducing costs by avoiding payments to data center operators.

This shift has been underway for years. As early as 2017, iPhones used on-device AI for face recognition via a neural engine. Modern implementations, like Apple's Apple Intelligence with about 3 billion parameters, handle specific tasks such as summarizing messages or visual recognition from screenshots. Google's Pixel phones employ the Gemini Nano model on the Tensor G5 chip to power features like Magic Cue, which pulls relevant information from emails and messages without manual searching.

Experts highlight the challenges and benefits. Mahadev Satyanarayanan, a Carnegie Mellon professor, likens ideal on-device computing to the human brain, noting that while nature evolved it over a billion years, humans aim to achieve similar efficiency in five to ten years through advanced hardware and specialized models. Vinesh Sukumar, head of generative AI at Qualcomm, points out system differences for compact devices like smartwatches, often requiring offloading to the cloud—but with safeguards like user permission and secure handling to protect data.

Apple's Private Cloud Compute exemplifies privacy measures: it processes offloaded data only on company servers, sends minimal information, and stores none. For developers, on-device AI cuts ongoing costs; Charlie Chapman of the Dark Noise app uses it to mix sounds without cloud fees, allowing scalability without financial risk.

Looking ahead, on-device AI excels in object classification within 100 milliseconds but still offloads for detection, segmentation, activity recognition, and tracking. Satyanarayanan anticipates exciting progress in five years, enabling features like trip alerts via computer vision or contextual reminders about conversations.

相关文章

Illustration depicting Apple Siri integrating Google's Gemini AI, with Apple Park backdrop and fading ChatGPT logo.
AI 生成的图像

Apple chooses Google's Gemini to power next Siri upgrade

由 AI 报道 AI 生成的图像

Apple has selected Google's Gemini AI models to enhance its Siri virtual assistant in a forthcoming update. The decision, announced in a joint statement, marks a shift from previous integrations with OpenAI's ChatGPT. This multi-year partnership aims to deliver more capable AI experiences while upholding Apple's privacy standards.

Researchers from Purdue University and the Georgia Institute of Technology have proposed a new computer architecture for AI models inspired by the human brain. This approach aims to address the energy-intensive 'memory wall' problem in current systems. The study, published in Frontiers in Science, highlights potential for more efficient AI in everyday devices.

由 AI 报道

Experts argue that physical AI, involving robots and autonomous machines interacting with the real world, may provide a direct path to artificial general intelligence. Elon Musk's comments on Tesla's Optimus robots highlight this potential, amid growing investments in related technologies. The year 2026 is seen as a key inflection point for the field.

Google has launched Personal Intelligence, a new feature for its Gemini AI that integrates data from Gmail, Photos, Search, and YouTube to deliver more tailored responses. Available initially to paid subscribers in the US, the opt-in tool emphasizes user privacy controls and avoids direct training on personal data. The rollout begins in beta, with plans for broader access in the future.

由 AI 报道

Following IBM's recent findings on AI accelerating vulnerability exploits, a TechRadar report warns that hackers are turning to accessible AI solutions for faster attacks, often trading off quality or cost. Businesses must adapt defenses to these evolving threats.

AI coding agents from companies like OpenAI, Anthropic, and Google enable extended work on software projects, including writing apps and fixing bugs under human oversight. These tools rely on large language models but face challenges like limited context processing and high computational costs. Understanding their mechanics helps developers decide when to deploy them effectively.

由 AI 报道

At its Unpacked event on Wednesday, Samsung introduced the Galaxy S26 models and Galaxy Buds 4 Pro, with artificial intelligence taking center stage. New tools include an Ask AI feature in the browser and enhancements to Circle to Search for identifying purchasable items from images. The company also announced AI photo editing and various Galaxy AI updates.

 

 

 

此网站使用 cookie

我们使用 cookie 进行分析以改进我们的网站。阅读我们的 隐私政策 以获取更多信息。
拒绝