SenseTime bets on multimodal AI to regain its edge

2025年12月10日(水)

AIによるレポート

Chinese AI pioneer SenseTime is leveraging its computer vision roots to lead the next phase of AI, shifting towards multimodal systems and embodied intelligence in the physical world. Co-founder and chief scientist Lin Dahua stated that this approach mirrors Google's, starting with vision capabilities as the core and adding language to build true multimodal systems.

SenseTime, a Hong Kong-listed company long regarded as one of the world's leading facial recognition providers, is seeking a new role in the generative AI era that began with ChatGPT's launch three years ago. In an interview with the Post on Wednesday, co-founder and chief scientist Lin Dahua explained that the company's longstanding expertise in vision-based AI positions it strongly to lead in embodied intelligence, robotics, and AI agents operating in real-world environments, amid growing debates on the limits of large language models (LLMs).

"Our strategic approach is somewhat similar to Google’s in the United States, which primarily focuses on multimodal AI including the latest Nano Banana Pro. They also start with vision capabilities as the core, then add language abilities to create real multimodal systems," said Lin, who is also an associate professor of information engineering at the Chinese University of Hong Kong.

Extending the comparison to Google—which has deep capabilities across the AI stack, including its own TPU chips for training models—Lin noted that SenseTime's decision as early as 2018 to build large-scale data centres laid a solid foundation for its ambitions. As of August, the company's total computing power stood at about 25,000 petaflops, up 8.7 per cent since the start of the year, after surging 92 per cent over the whole of 2024.

This pivot signals SenseTime's shift from hype to more hardware-focused investments, aiming to regain its edge in multimodal, real-world AI.

Chinese minister announces China's AI sector exceeding $165 billion at National People's Congress, with futuristic AI graphics on display.

China's AI sector tops $165 billion in 2025, minister says

2026年03月05日(木) AIによるレポート AIによって生成された画像

The output of China's core artificial intelligence industry exceeded 1.2 trillion yuan ($165 billion) in 2025, with more than 6,200 companies operating in the field, said Li Lecheng, head of the Ministry of Industry and Information Technology. The remarks came after the opening meeting of the fourth session of the 14th National People's Congress in Beijing on Thursday.

2026年、AIにおけるワールドモデルの年と予測

専門家は2026年をワールドモデルの画期的な年と予測しており、これらは大規模言語モデルよりも物理世界を深く理解するよう設計されたAIシステムである。これらのモデルはAIを現実に根ざすことを目指し、ロボット工学や自動運転車の進歩を可能にする。Yann LeCunやFei-Fei Liのような業界リーダーは、空間知能を革命化する可能性を強調している。

DeepSeek stays silent on next AI model release as papers show frontier innovation

2026年01月13日(火) AIによるレポート

Hangzhou-based startup DeepSeek has not announced plans for its next major AI model release, but its technical papers suggest potential advances. The papers highlight how AI infrastructure innovations could drive efficiency and scale up model performance.

アジア

China's AI firms surpass 6000 in 2025

技術

OpenAIのGPT Image 1.5、会話型写真編集を進化させるが倫理的懸念も

テスラ

テスラのOptimus進展の中、中国の人型ロボットが2026年に注目を集める

スマートグラスがAIとディスプレイで2026年に進化

Googleのニューヨークオフィスで、スマートグラスのプロトタイプがリアルタイム翻訳やアプリ統合などの先進機能をデモ。AIアシスタンスとウェアラブル技術を融合したこれらのデバイスは、主要企業から2026年に発売予定。このトレンドは日常の拡張現実コンパニオンへのシフトを示唆している。

ByteDance and other Chinese tech firms dominate local consumer AI market: report

2025年12月24日(水) AIによるレポート

A new report shows major Chinese tech firms dominating the consumer AI market. ByteDance-owned Doubao remains the top consumer AI app in the country, with DeepSeek's namesake chatbot in second place.

IndiaAI chief outlines pragmatic roadmap ahead of AI summit

Abhishek Singh, CEO of the IndiaAI Mission, has outlined a focused strategy for India's AI development, emphasizing practical, population-scale models over the global race for artificial general intelligence. In an interview, he highlighted India's potential as the world's inference capital and preparations for the upcoming AI Impact Summit in New Delhi. The approach prioritizes sovereign AI solutions tailored to Indian challenges in sectors like healthcare and agriculture.

Korean firms highlight AI innovations at CES 2026

2026年01月05日(月) AIによるレポート

Ahead of CES 2026 in Las Vegas, major Korean tech firms including LG Electronics, Hyundai Motor Group, and Samsung Electronics unveiled AI-centric products and visions. They presented strategies like 'AI in Action' and 'Physical AI,' showcasing advances in robotics, laptops, memory, and more across daily life and industry. The events emphasized AI extending beyond screens into real-world applications.

2026/03/11 00:15

SenseTime bets on multimodal AI to regain its edge

関連記事

China's AI sector tops $165 billion in 2025, minister says

2026年、AIにおけるワールドモデルの年と予測

DeepSeek stays silent on next AI model release as papers show frontier innovation

China's AI firms surpass 6000 in 2025

OpenAIのGPT Image 1.5、会話型写真編集を進化させるが倫理的懸念も

テスラのOptimus進展の中、中国の人型ロボットが2026年に注目を集める

スマートグラスがAIとディスプレイで2026年に進化

ByteDance and other Chinese tech firms dominate local consumer AI market: report

IndiaAI chief outlines pragmatic roadmap ahead of AI summit

Korean firms highlight AI innovations at CES 2026

専門家ら、物理AIがAGIへの道を開く可能性を指摘

China’s Zhipu AI launches GLM-5 model to challenge rivals

テスラ、中国でのAIとロボット投資を優先

中国がヒューマノイドロボット市場をリード、テスラのオプティマスは後れを取る

OpenAI上級幹部、ChatGPT重視の中で離脱

Smart living enters mainstream in China

AI companies gear up for ads as manipulation threats emerge

Alibaba scientist sees less than 20% chance for China to exceed US in AI

Shenzhen targets AI in every household as US tech rivalry heats up

GPT-5、AI開発でQwenに道を譲る

このウェブサイトはCookieを使用します