马飞
  • 发布时间:2024-04-29
  • 作者:光明实验室
  • 浏览:7132次

马飞 研究员

媒体智能团队负责人

学习经历:

2017-2022 清华大学 信息与通信工程 博士

2013-2017 电子科技大学 通信工程 学士(专业排名:1/363


工作经历:

2024- 光明实验室研究员

2022-2024 华为高级工程师


研究领域:

聚焦多模态内容理解与生成研究,具体包括1)数字人、人与物及场景的交互生成2图像及视频的编辑与生成3)多模态大模型及其在情感智能上的应用等方向。


邮箱:

mafei@gml.ac.cn


代表性成果:

AIGC短剧《嫦娥奔月》, https://mp.weixin.qq.com/s/-MjvqUjeotfluCVocuqHqw


发表论文论著:

1. H. Xue, X. Luo, Z. Hu, X. Zhang, X. Xiang, Y. Dai, J. Liu, Z. Zhang, M. Li, J. Yang, F. Ma #, Z. Wu, C. Yang, Z. Dai, F. Yu. Human Motion Video Generation: A survey. Authorea Preprints, 2024. (通讯).

2. F. Ma #, Y. Yuan, Y. Xie, H. Ren, I. Liu, Y. He, F. Ren, F. Yu, S. Ni. Generative Technology for Human Emotion Recognition: A Scoping Review. Information Fusion, 102753, 2024. (中科院一区Top,CAAI A,影响因子:14.7,一作)

3. Y. Xie, T. Feng, X. Zhang, X. Luo, Z. Guo, W. Yu, H. Chang, F. Ma #, F. Yu. PointTalk: Audio-Driven Dynamic Lip Point Cloud for 3D Gaussian-based Talking Head Synthesis. AAAI 2025. (CCF A,通讯)

4. X. Xiang, Z. Dai, H. Xue, D. Wang, M. Li, Y. Yue, F. Ma #, W. Yu, H. Chang, F. Yu. ReMask-Animate: Refined Character Image Animation Using Mask-Guided Adapters. AAAI 2025. (CCF A,通讯)

5. L. Wang, S. Shi, F. Ma #, F. Yu, P. Li, Y. He. Subgraph Invariant Learning towards Large-scale Graph Node Classification. AAAI 2025. (CCF A)

6. X. Luo, X. Zhang, Y. Xie, X. Tong, W. Yu, H. Chang, F. Ma #, F. Yu. CodeSwap: Symmetrically Face Swapping Based on Prior Codebook. ACM MM 2024. (CCF A,通讯)

7. L. Xiong, X. Cheng, J. Tan, X. Wu, X. Li, L. Zhu, F. Ma #, M. Li, H. Xu, Z. Hu. SegTalker: Segmentation-based Talking Face Generation with Mask-guided Local Editing. ACM MM 2024. (CCF A)

8. C. Wang, H. Yu, X. Li, F. Ma #, X. Wang, T. Taleb, V. Leung. Dependency-Aware Microservice Deployment for Edge Computing: A Deep Reinforcement Learning Approach with Network Representation. IEEE Transactions on Mobile Computing, 2024. (CCF A)

9. T. Feng, Y. Xie, X. Guan, J. Song, Z. Liu, F. Ma #, F. Yu. UniSync: A Unified Framework for Audio-Visual Synchronization. IEEE ICME 2025. (通讯)

10. X. Luo, J. Cheng, Y. Xie, X. Zhang, T. Feng, Z. Liu, F. Ma #, F. Yu. Object Isolated Attention for Consistent Story Visualization. IEEE ICME 2025. (通讯)

10. X. Zhang, S. Huang, X. Luo, Y. Xie, W. Yu, H. Chang, F. Ma #, F. Yu. MuseFace: Text-driven Face Editing via Diffusion-based Mask Generation Approach. IEEE ICME 2025. (通讯)

11. H. Hou, P. Zeng, F. Ma #, F. Yu. VisualRWKV: Exploring Recurrent Neural Networks for Visual Language Models. COLING 2025.

12. Y. He, F. Yu, F. Ma #, M. Li, G. Zhou. DEP-SLAM: A Dynamic Environment Perception SLAM System with Large Language Models. ICASSP 2025.

13. Z. Zhong, Y. He, P. Li, F. Yu, Fei Ma #. A Language-Driven Navigation Strategy Integrating Semantic Maps and Large Language Models. IROS 2024.

14. Y. Liu, H. Hou, F. Ma #, S. Ni, F. Yu. MLLM-TA: Leveraging Multimodal Large Language Models for Precise Temporal Video Grounding. IEEE Signal Processing Letters, 2024. (通讯)

15. W. Ge, Y. Nie, F. Ma #, K. Tang, F. Yu, H. Cai, P. Li. Training-Free Language-Guided Video Summarization via Multi-Grained Saliency Scoring. CVM 2025.

16. Y. Xie, J. Wang, T. Feng, F. Ma #, Y. Li. CCIS-Diff: A Generative Model with Stable Diffusion Prior for Controlled Colonoscopy Image Synthesis. IEEE ISBI 2025.

17. I. Liu, F. Liu, Q. Zhong, F. Ma #, S. Ni. Your blush gives you away: detecting hidden mental states with remote photoplethysmography and thermal imaging. PeerJ Computer Science, 10, e1912, 2024. (SCI)


申请专利:

受理或授权中国发明专利30+项。