亚洲熟妇久久国内精品,国内精品卡一卡二卡三,亚洲视频高清不卡在线观看

首頁明略動態 Can Machines Simulate Human Perception? Mininglamp Technology’s Multimodal Team Wins “Best Paper Nomination” at ACM Multimedia Global Conference

Can Machines Simulate Human Perception? Mininglamp Technology’s Multimodal Team Wins “Best Paper Nomination” at ACM Multimedia Global Conference

2024-11-07

The 2024 ACM Multimedia (ACMMM) conference held in Melbourne, Australia, from October 28 to November 1, witnessed the outstanding achievement of Mininglamp Technology’s Multimodal team and their collaborators from Peking University. Their research paper, titled “Hypergraph Multi-modal Large Language Model: Exploiting EEG and Eye-tracking Modalities to Evaluate Heterogeneous Responses for Video Understanding,” garnered a prestigious Best Paper nomination. This accomplishment stands testament to their innovative approach and significant contribution to the field of multi-modal AI.

Mininglamp Technology’s team, led by founder, chairman, and CEO Wu Minghui, along with Zhao Chenxu, head of the Multimodal Large Model department, and Su Anyang, head of the Mingjing Algorithm department, were invited to attend the conference in Melbourne.

The ACM Multimedia conference is a premier venue for researchers and practitioners in the field of multimedia and artificial intelligence. This year’s event saw a total of 4,385 submissions, with 1,149 papers accepted for presentation. Among those, 174 were selected for oral presentations, with only 26 receiving Best Paper nominations.

Can Machines Simulate Human Perception? Mininglamp Technology’s Multimodal Team Wins “Best Paper Nomination” at ACM Multimedia Global Conference — Wu Minghui, founder, chairman, CEO, and CTO of Mininglamp Technology Group, presented their latest research findings at the ACMMM Oral Session.

What is the ACMMM Conference?

The ACMMM Conference is a top international academic conference in the field of multimedia, sponsored by the Association for Computing Machinery (ACM). It is also a Class A international academic conference recommended by the China Computer Federation (CCF-A). This year marks the 32nd conference since its inception in 1993.

The conference covers various aspects of multimedia computing, such as multimedia content analysis, multimedia retrieval, multimedia security, human-computer interaction, and computer vision.

Mininglamp Technology’s Multimodal Team Achieves “Best Paper Nomination” at the ACMMM Global Conference

Addressing the limitations of current AI in video content understanding, which mainly focuses on objective aspects and lacks subjective measurement methods, as well as the development of effective methods for simulating human subjective responses, Mininglamp Technology’s latest research integrates non-standard modalities such as EEG and eye movement data to build a novel multimodal language model paradigm. This represents a significant step forward in the research direction of machine understanding and simulation of human subjective responses.

Title: Hypergraph Multi-modal Large Language Model: Exploiting EEG and Eye-tracking Modalities to Evaluate Heterogeneous Responses for Video Understanding

Authors:?Minghui?Wu,?Chenxu?Zhao,?Anyang?Su,?Donglin?Di,?Tianyu?Fu,?Da?An,?Min?He,?Ya?Gao,?Meng?Ma,?Kun?Yan,?Ping?Wang

Abstract: Understanding of video creativity and content often varies among individuals, with differences in focal points and cognitive levels across different ages, experiences, and genders. There is currently a lack of research in this area, and most existing benchmarks suffer from several drawbacks: 1) a limited number of modalities and answers with restrictive length; 2) the content and scenarios within the videos are excessively monotonous, transmitting allegories and emotions that are overly simplistic. To bridge the gap to real-world applications, we introduce a large-scale Video?Subjective?Multi-modal?Evaluation dataset, namely Video-SME. Specifically, we collected real changes in Electroencephalographic (EEG) and eye-tracking regions from different demographics while they viewed identical video content. Utilizing this multi-modal dataset, we developed tasks and protocols to analyze and evaluate the extent of cognitive understanding of video content among different users. Along with the dataset, we designed a?Hypergraph?Multi-modal?Large?Language?Model (HMLLM) to explore the associations among different demographics, video elements, EEG and eye-tracking indicators. HMLLM could bridge semantic gaps across rich modalities and integrate information beyond different modalities to perform logical reasoning. Extensive experimental evaluations on Video-SME and other additional video-based generative performance benchmarks demonstrate the effectiveness of our method.

What are human subjective feelings? What is the significance of measuring subjective responses of different groups to watching advertising videos?

When people watch advertising videos, their cognitive level of understanding of the material elements, emotional highs and lows, and eye gaze intensity are all subjective feelings. These will vary based on different genders, ages, occupations, and identities.

If machines can simulate the different subjective feelings of different groups of people watching advertising videos, then it is equivalent to being able to effectively measure the content, creativity, etc. of advertising videos, guiding the process of creating advertisement films and saving advertising costs.

The following video demonstrates the analysis of a classic advertisement film using the methods (HMLLM) in the paper, from both subjective and objective dimensions:

The following video demonstrates the unterschied subjective responses of a general audience and a specific audience to the same advertising video using the method (HMLLM) in the paper:

Enabling machines to learn, understand, and simulate human subjective feelings could be the beginning of giving machines subjective consciousness. The new baseline Video-SME proposed by Mininglamp Technology is expected to become a new starting point in the field, marking a shift in machines’ understanding of videos from objective to subjective dimensions.

As a brand-new paradigm, the development of Mininglamp Technology’s multimodal large model HMLLM is committed to providing researchers in the field with valuable experience and inspiration to solve non-standard modality issues, thus promoting the field of large models toward a bright future of human-machine collaboration.

This research project is supported by the Ministry of Science and Technology of China’s “New Generation Artificial Intelligence (2030)” major project.

More Resources

Paper Link: https://dl.acm.org/doi/10.1145/3664647.3680810
GitHub Address: GitHub – mininglamp-MLLM/HMLLM: [ACM MM2024] The code for HMLLM.

推薦閱讀

Eymeric Monange：贏占香氛藍海，從深度聯結到無限增長

2025-12-04

11月19日，第九屆營銷科學大會在上海金茂·君悅大酒店圓滿舉行。本次大會以「Agentic Marketing·營銷可信智能體：要“增長確定性”」為主題，聚焦AI營銷從生成式向代理式演進的新階段。大會聯合產業生態多方力量，以務實落地的案例與前瞻性的思考洞見為支點，全景呈現“Agentic Marketing”的實踐路徑與未來潛能。從全球策略到中國市場的品牌實戰，品牌的挑戰與機遇并存。大會上，科蒂中國彩香品類市場與電商總經理Eymeric Monange?帶來《雙核驅動，贏占香氛藍海：從深度聯結到無限增長》主題演講，展示了科蒂在中國市場的實踐和思路。

了解更多

李育輝：數智時代的組織與個體 | 2025營銷科學大會嘉賓分享

2025-12-03

11月19日，第九屆營銷科學大會在上海金茂·君悅大酒店盛大啟幕。本次大會由明略科技（2718.HK）聯合旗下秒針系統及秒針營銷科學院主辦，以「Agentic Marketing·營銷可信智能體：要“增長確定性”」為核心主題，聚焦AI營銷從生成式向代理式演進的新階段，首次對Agentic Marketing（代理式營銷）新范式進行系統性解讀。通過專家分享、高峰對話、產品體驗、行業倡導等形式，深度探討營銷科學在AI時代的新技術、新場景，以及新趨勢。中國人民大學勞動人事學院教授&人工智能治理研究院研究員李育輝帶來了主題《數智時代的組織與個體》的分享，圍繞人工智能促進新質生產力發展的變革過程中，組織與個體如何順勢而為，追求共融與發展，實現全新重塑等問題進行了探討與分享。

了解更多

營銷科學高峰對話：以科學洞見AI營銷的無限可能

2025-11-27

11月19日，第九屆營銷科學大會在上海金茂·君悅大酒店圓滿舉行。本次大會由明略科技（2718.HK）聯合秒針營銷科學院主辦，以「Agentic Marketing·營銷可信智能體：要“增長確定性”」為核心主題，聚焦AI營銷從生成式向代理式演進的新階段。大會上，明略科技集團副總裁、秒針營銷科學院院長譚北平為大家帶來了主題為《與AI 共生 · 營銷科學的未來》的分享，并邀請三位營銷科學專家依次登臺，從不同視角深度剖析營銷科學的發展現狀與前沿思考。

了解更多

上一篇：一文了解：多模態大模型有什么用？多模態技術如何賦能營銷？| 明略科技

下一篇：機器能模擬人類感受？明略科技多模態團隊斬獲ACMMM全球頂會“最佳論文提名”

返回行業資訊

亚洲爆乳中文字幕无码专区网站-黄页网站视频-暖暖 免费 高清 日本 在线-97亚洲熟妇自偷自拍另类图片-国产av久久久久精东av

What is the ACMMM Conference?

Mininglamp Technology’s Multimodal Team Achieves “Best Paper Nomination” at the ACMMM Global Conference

What are human subjective feelings? What is the significance of measuring subjective responses of different groups to watching advertising videos?

When people watch advertising videos, their cognitive level of understanding of the material elements, emotional highs and lows, and eye gaze intensity are all subjective feelings. These will vary based on different genders, ages, occupations, and identities.

More Resources

亚洲爆乳中文字幕无码专区网站-黄页网站视频-暖暖免费高清日本在线-97亚洲熟妇自偷自拍另类图片-国产av久久久久精东av