发布时间:2024-04-29
作者:光明实验室
浏览:7132次
马飞 研究员
媒体智能团队负责人
学习经历:
2017-2022 清华大学 信息与通信工程 博士
2013-2017 电子科技大学 通信工程 学士(专业排名:1/363)
工作经历:
2024-今 光明实验室研究员
2022-2024 华为高级工程师
研究领域:
聚焦多模态内容理解与生成研究,具体包括(1)数字人、人与物及场景的交互生成;(2)图像及视频的编辑与生成;(3)多模态大模型及其在情感智能上的应用等方向。
邮箱:
mafei@gml.ac.cn
代表性成果:
AIGC短剧《嫦娥奔月》, https://mp.weixin.qq.com/s/-MjvqUjeotfluCVocuqHqw
发表论文论著:
1. H. Xue, X. Luo, Z. Hu, X. Zhang, X. Xiang, Y. Dai, J. Liu, Z. Zhang, M. Li, J. Yang, F. Ma #, Z. Wu, C. Yang, Z. Dai, F. Yu. Human Motion Video Generation: A survey. Authorea Preprints, 2024. (通讯).
2. F. Ma #, Y. Yuan, Y. Xie, H. Ren, I. Liu, Y. He, F. Ren, F. Yu, S. Ni. Generative Technology for Human Emotion Recognition: A Scoping Review. Information Fusion, 102753, 2024. (中科院一区Top,CAAI A,影响因子:14.7,一作)
3. Y. Xie, T. Feng, X. Zhang, X. Luo, Z. Guo, W. Yu, H. Chang, F. Ma #, F. Yu. PointTalk: Audio-Driven Dynamic Lip Point Cloud for 3D Gaussian-based Talking Head Synthesis. AAAI 2025. (CCF A,通讯)
4. X. Xiang, Z. Dai, H. Xue, D. Wang, M. Li, Y. Yue, F. Ma #, W. Yu, H. Chang, F. Yu. ReMask-Animate: Refined Character Image Animation Using Mask-Guided Adapters. AAAI 2025. (CCF A,通讯)
5. L. Wang, S. Shi, F. Ma #, F. Yu, P. Li, Y. He. Subgraph Invariant Learning towards Large-scale Graph Node Classification. AAAI 2025. (CCF A)
6. X. Luo, X. Zhang, Y. Xie, X. Tong, W. Yu, H. Chang, F. Ma #, F. Yu. CodeSwap: Symmetrically Face Swapping Based on Prior Codebook. ACM MM 2024. (CCF A,通讯)
7. L. Xiong, X. Cheng, J. Tan, X. Wu, X. Li, L. Zhu, F. Ma #, M. Li, H. Xu, Z. Hu. SegTalker: Segmentation-based Talking Face Generation with Mask-guided Local Editing. ACM MM 2024. (CCF A)
8. C. Wang, H. Yu, X. Li, F. Ma #, X. Wang, T. Taleb, V. Leung. Dependency-Aware Microservice Deployment for Edge Computing: A Deep Reinforcement Learning Approach with Network Representation. IEEE Transactions on Mobile Computing, 2024. (CCF A)
9. T. Feng, Y. Xie, X. Guan, J. Song, Z. Liu, F. Ma #, F. Yu. UniSync: A Unified Framework for Audio-Visual Synchronization. IEEE ICME 2025. (通讯)
10. X. Luo, J. Cheng, Y. Xie, X. Zhang, T. Feng, Z. Liu, F. Ma #, F. Yu. Object Isolated Attention for Consistent Story Visualization. IEEE ICME 2025. (通讯)
10. X. Zhang, S. Huang, X. Luo, Y. Xie, W. Yu, H. Chang, F. Ma #, F. Yu. MuseFace: Text-driven Face Editing via Diffusion-based Mask Generation Approach. IEEE ICME 2025. (通讯)
11. H. Hou, P. Zeng, F. Ma #, F. Yu. VisualRWKV: Exploring Recurrent Neural Networks for Visual Language Models. COLING 2025.
12. Y. He, F. Yu, F. Ma #, M. Li, G. Zhou. DEP-SLAM: A Dynamic Environment Perception SLAM System with Large Language Models. ICASSP 2025.
13. Z. Zhong, Y. He, P. Li, F. Yu, Fei Ma #. A Language-Driven Navigation Strategy Integrating Semantic Maps and Large Language Models. IROS 2024.
14. Y. Liu, H. Hou, F. Ma #, S. Ni, F. Yu. MLLM-TA: Leveraging Multimodal Large Language Models for Precise Temporal Video Grounding. IEEE Signal Processing Letters, 2024. (通讯)
15. W. Ge, Y. Nie, F. Ma #, K. Tang, F. Yu, H. Cai, P. Li. Training-Free Language-Guided Video Summarization via Multi-Grained Saliency Scoring. CVM 2025.
16. Y. Xie, J. Wang, T. Feng, F. Ma #, Y. Li. CCIS-Diff: A Generative Model with Stable Diffusion Prior for Controlled Colonoscopy Image Synthesis. IEEE ISBI 2025.
17. I. Liu, F. Liu, Q. Zhong, F. Ma #, S. Ni. Your blush gives you away: detecting hidden mental states with remote photoplethysmography and thermal imaging. PeerJ Computer Science, 10, e1912, 2024. (SCI)
申请专利:
受理或授权中国发明专利30+项。