SenseTime bets on multimodal AI to regain its edge

Chinese AI pioneer SenseTime is leveraging its computer vision roots to lead the next phase of AI, shifting towards multimodal systems and embodied intelligence in the physical world. Co-founder and chief scientist Lin Dahua stated that this approach mirrors Google's, starting with vision capabilities as the core and adding language to build true multimodal systems.

SenseTime, a Hong Kong-listed company long regarded as one of the world's leading facial recognition providers, is seeking a new role in the generative AI era that began with ChatGPT's launch three years ago. In an interview with the Post on Wednesday, co-founder and chief scientist Lin Dahua explained that the company's longstanding expertise in vision-based AI positions it strongly to lead in embodied intelligence, robotics, and AI agents operating in real-world environments, amid growing debates on the limits of large language models (LLMs).

"Our strategic approach is somewhat similar to Google’s in the United States, which primarily focuses on multimodal AI including the latest Nano Banana Pro. They also start with vision capabilities as the core, then add language abilities to create real multimodal systems," said Lin, who is also an associate professor of information engineering at the Chinese University of Hong Kong.

Extending the comparison to Google—which has deep capabilities across the AI stack, including its own TPU chips for training models—Lin noted that SenseTime's decision as early as 2018 to build large-scale data centres laid a solid foundation for its ambitions. As of August, the company's total computing power stood at about 25,000 petaflops, up 8.7 per cent since the start of the year, after surging 92 per cent over the whole of 2024.

This pivot signals SenseTime's shift from hype to more hardware-focused investments, aiming to regain its edge in multimodal, real-world AI.

Verwandte Artikel

Chinese minister announces China's AI sector exceeding $165 billion at National People's Congress, with futuristic AI graphics on display.
Bild generiert von KI

China's AI sector tops $165 billion in 2025, minister says

Von KI berichtet Bild generiert von KI

The output of China's core artificial intelligence industry exceeded 1.2 trillion yuan ($165 billion) in 2025, with more than 6,200 companies operating in the field, said Li Lecheng, head of the Ministry of Industry and Information Technology. The remarks came after the opening meeting of the fourth session of the 14th National People's Congress in Beijing on Thursday.

Experts foresee 2026 as the pivotal year for world models, AI systems designed to comprehend the physical world more deeply than large language models. These models aim to ground AI in reality, enabling advancements in robotics and autonomous vehicles. Industry leaders like Yann LeCun and Fei-Fei Li highlight their potential to revolutionize spatial intelligence.

Von KI berichtet

Hangzhou-based startup DeepSeek has not announced plans for its next major AI model release, but its technical papers suggest potential advances. The papers highlight how AI infrastructure innovations could drive efficiency and scale up model performance.

At Google's New York offices, prototypes of smart glasses demonstrated advanced features like real-time translation and app integration. These devices, blending AI assistance with wearable tech, are set to launch in 2026 from major companies. The trend signals a shift toward everyday augmented reality companions.

Von KI berichtet

A new report shows major Chinese tech firms dominating the consumer AI market. ByteDance-owned Doubao remains the top consumer AI app in the country, with DeepSeek's namesake chatbot in second place.

Abhishek Singh, CEO der IndiaAI-Mission, hat eine fokussierte Strategie für die KI-Entwicklung Indiens umrissen, die praktische Modelle im Bevölkerungsmassstab betont statt des globalen Rennens um künstliche allgemeine Intelligenz. In einem Interview hob er Indiens Potenzial als weltweites Inferenz-Zentrum hervor sowie Vorbereitungen für den bevorstehenden AI Impact Summit in Neu-Delhi. Der Ansatz priorisiert souveräne KI-Lösungen, die auf indische Herausforderungen in Sektoren wie Gesundheitswesen und Landwirtschaft zugeschnitten sind.

Von KI berichtet

Ahead of CES 2026 in Las Vegas, major Korean tech firms including LG Electronics, Hyundai Motor Group, and Samsung Electronics unveiled AI-centric products and visions. They presented strategies like 'AI in Action' and 'Physical AI,' showcasing advances in robotics, laptops, memory, and more across daily life and industry. The events emphasized AI extending beyond screens into real-world applications.

Mittwoch, 11. März 2026, 00:15 Uhr

Experts suggest physical AI could lead to AGI

Mittwoch, 11. Februar 2026, 21:41 Uhr

China’s Zhipu AI launches GLM-5 model to challenge rivals

Sonntag, 08. Februar 2026, 02:24 Uhr

Tesla prioritizes AI and robotics investments in China

Donnerstag, 05. Februar 2026, 21:29 Uhr

China leads humanoid robot market while Tesla's Optimus trails

Dienstag, 03. Februar 2026, 09:31 Uhr

Senior OpenAI staff leave amid ChatGPT focus

Montag, 02. Februar 2026, 16:37 Uhr

Smart living enters mainstream in China

Sonntag, 18. Januar 2026, 01:24 Uhr

AI companies gear up for ads as manipulation threats emerge

Sonntag, 11. Januar 2026, 06:41 Uhr

Alibaba scientist sees less than 20% chance for China to exceed US in AI

Dienstag, 30. Dezember 2025, 06:28 Uhr

Shenzhen targets AI in every household as US tech rivalry heats up

Samstag, 27. Dezember 2025, 22:15 Uhr

GPT-5 gives way to Qwen in AI developments

 

 

 

Diese Website verwendet Cookies

Wir verwenden Cookies für Analysen, um unsere Website zu verbessern. Lesen Sie unsere Datenschutzrichtlinie für weitere Informationen.
Ablehnen