SenseTime bets on multimodal AI to regain its edge

Chinese AI pioneer SenseTime is leveraging its computer vision roots to lead the next phase of AI, shifting towards multimodal systems and embodied intelligence in the physical world. Co-founder and chief scientist Lin Dahua stated that this approach mirrors Google's, starting with vision capabilities as the core and adding language to build true multimodal systems.

SenseTime, a Hong Kong-listed company long regarded as one of the world's leading facial recognition providers, is seeking a new role in the generative AI era that began with ChatGPT's launch three years ago. In an interview with the Post on Wednesday, co-founder and chief scientist Lin Dahua explained that the company's longstanding expertise in vision-based AI positions it strongly to lead in embodied intelligence, robotics, and AI agents operating in real-world environments, amid growing debates on the limits of large language models (LLMs).

"Our strategic approach is somewhat similar to Google’s in the United States, which primarily focuses on multimodal AI including the latest Nano Banana Pro. They also start with vision capabilities as the core, then add language abilities to create real multimodal systems," said Lin, who is also an associate professor of information engineering at the Chinese University of Hong Kong.

Extending the comparison to Google—which has deep capabilities across the AI stack, including its own TPU chips for training models—Lin noted that SenseTime's decision as early as 2018 to build large-scale data centres laid a solid foundation for its ambitions. As of August, the company's total computing power stood at about 25,000 petaflops, up 8.7 per cent since the start of the year, after surging 92 per cent over the whole of 2024.

This pivot signals SenseTime's shift from hype to more hardware-focused investments, aiming to regain its edge in multimodal, real-world AI.

Articles connexes

Chinese minister announces China's AI sector exceeding $165 billion at National People's Congress, with futuristic AI graphics on display.
Image générée par IA

China's AI sector tops $165 billion in 2025, minister says

Rapporté par l'IA Image générée par IA

The output of China's core artificial intelligence industry exceeded 1.2 trillion yuan ($165 billion) in 2025, with more than 6,200 companies operating in the field, said Li Lecheng, head of the Ministry of Industry and Information Technology. The remarks came after the opening meeting of the fourth session of the 14th National People's Congress in Beijing on Thursday.

Les experts prévoient 2026 comme l’année charnière pour les modèles du monde, systèmes d’IA conçus pour appréhender le monde physique plus profondément que les grands modèles de langage. Ces modèles visent à ancrer l’IA dans la réalité, favorisant des avancées en robotique et véhicules autonomes. Des leaders de l’industrie comme Yann LeCun et Fei-Fei Li soulignent leur potentiel à révolutionner l’intelligence spatiale.

Rapporté par l'IA

Hangzhou-based startup DeepSeek has not announced plans for its next major AI model release, but its technical papers suggest potential advances. The papers highlight how AI infrastructure innovations could drive efficiency and scale up model performance.

Dans les bureaux de Google à New York, des prototypes de lunettes intelligentes ont démontré des fonctionnalités avancées comme la traduction en temps réel et l’intégration d’applications. Ces appareils, mêlant assistance IA et technologie wearable, sont prêts à être lancés en 2026 par de grandes entreprises. Cette tendance signale un virage vers des compagnons de réalité augmentée du quotidien.

Rapporté par l'IA

A new report shows major Chinese tech firms dominating the consumer AI market. ByteDance-owned Doubao remains the top consumer AI app in the country, with DeepSeek's namesake chatbot in second place.

Abhishek Singh, PDG de la mission IndiaAI, a présenté une stratégie ciblée pour le développement de l'IA en Inde, mettant l'accent sur des modèles pratiques à l'échelle de la population plutôt que sur la course mondiale à l'intelligence artificielle générale. Dans une interview, il a mis en avant le potentiel de l'Inde comme capitale mondiale de l'inférence et les préparatifs pour le prochain sommet AI Impact à New Delhi. L'approche privilégie des solutions d'IA souveraines adaptées aux défis indiens dans des secteurs comme la santé et l'agriculture.

Rapporté par l'IA

Ahead of CES 2026 in Las Vegas, major Korean tech firms including LG Electronics, Hyundai Motor Group, and Samsung Electronics unveiled AI-centric products and visions. They presented strategies like 'AI in Action' and 'Physical AI,' showcasing advances in robotics, laptops, memory, and more across daily life and industry. The events emphasized AI extending beyond screens into real-world applications.

 

 

 

Ce site utilise des cookies

Nous utilisons des cookies pour l'analyse afin d'améliorer notre site. Lisez notre politique de confidentialité pour plus d'informations.
Refuser