SenseTime bets on multimodal AI to regain its edge

Chinese AI pioneer SenseTime is leveraging its computer vision roots to lead the next phase of AI, shifting towards multimodal systems and embodied intelligence in the physical world. Co-founder and chief scientist Lin Dahua stated that this approach mirrors Google's, starting with vision capabilities as the core and adding language to build true multimodal systems.

SenseTime, a Hong Kong-listed company long regarded as one of the world's leading facial recognition providers, is seeking a new role in the generative AI era that began with ChatGPT's launch three years ago. In an interview with the Post on Wednesday, co-founder and chief scientist Lin Dahua explained that the company's longstanding expertise in vision-based AI positions it strongly to lead in embodied intelligence, robotics, and AI agents operating in real-world environments, amid growing debates on the limits of large language models (LLMs).

"Our strategic approach is somewhat similar to Google’s in the United States, which primarily focuses on multimodal AI including the latest Nano Banana Pro. They also start with vision capabilities as the core, then add language abilities to create real multimodal systems," said Lin, who is also an associate professor of information engineering at the Chinese University of Hong Kong.

Extending the comparison to Google—which has deep capabilities across the AI stack, including its own TPU chips for training models—Lin noted that SenseTime's decision as early as 2018 to build large-scale data centres laid a solid foundation for its ambitions. As of August, the company's total computing power stood at about 25,000 petaflops, up 8.7 per cent since the start of the year, after surging 92 per cent over the whole of 2024.

This pivot signals SenseTime's shift from hype to more hardware-focused investments, aiming to regain its edge in multimodal, real-world AI.

Artikel Terkait

Chinese minister announces China's AI sector exceeding $165 billion at National People's Congress, with futuristic AI graphics on display.
Gambar dihasilkan oleh AI

China's AI sector tops $165 billion in 2025, minister says

Dilaporkan oleh AI Gambar dihasilkan oleh AI

The output of China's core artificial intelligence industry exceeded 1.2 trillion yuan ($165 billion) in 2025, with more than 6,200 companies operating in the field, said Li Lecheng, head of the Ministry of Industry and Information Technology. The remarks came after the opening meeting of the fourth session of the 14th National People's Congress in Beijing on Thursday.

Para ahli memprediksi 2026 sebagai tahun penting bagi model dunia, sistem AI yang dirancang untuk memahami dunia fisik lebih dalam daripada model bahasa besar. Model ini bertujuan untuk membumikan AI dalam realitas, memungkinkan kemajuan dalam robotika dan kendaraan otonom. Pemimpin industri seperti Yann LeCun dan Fei-Fei Li menyoroti potensinya untuk merevolusi kecerdasan spasial.

Dilaporkan oleh AI

Hangzhou-based startup DeepSeek has not announced plans for its next major AI model release, but its technical papers suggest potential advances. The papers highlight how AI infrastructure innovations could drive efficiency and scale up model performance.

Di kantor Google di New York, prototipe kacamata pintar menampilkan fitur canggih seperti terjemahan real-time dan integrasi aplikasi. Perangkat ini, yang memadukan bantuan AI dengan teknologi wearable, siap diluncurkan pada 2026 oleh perusahaan besar. Tren ini menandakan pergeseran menuju pendamping realitas tertambah sehari-hari.

Dilaporkan oleh AI

A new report shows major Chinese tech firms dominating the consumer AI market. ByteDance-owned Doubao remains the top consumer AI app in the country, with DeepSeek's namesake chatbot in second place.

Abhishek Singh, CEO of the IndiaAI Mission, has outlined a focused strategy for India's AI development, emphasizing practical, population-scale models over the global race for artificial general intelligence. In an interview, he highlighted India's potential as the world's inference capital and preparations for the upcoming AI Impact Summit in New Delhi. The approach prioritizes sovereign AI solutions tailored to Indian challenges in sectors like healthcare and agriculture.

Dilaporkan oleh AI

Ahead of CES 2026 in Las Vegas, major Korean tech firms including LG Electronics, Hyundai Motor Group, and Samsung Electronics unveiled AI-centric products and visions. They presented strategies like 'AI in Action' and 'Physical AI,' showcasing advances in robotics, laptops, memory, and more across daily life and industry. The events emphasized AI extending beyond screens into real-world applications.

 

 

 

Situs web ini menggunakan cookie

Kami menggunakan cookie untuk analisis guna meningkatkan situs kami. Baca kebijakan privasi kami untuk informasi lebih lanjut.
Tolak