SHENGENG TANG   唐申庚

Lecturer at Hefei University of Technology, Ph.D.

School of Computer Science and Information Engineering (SCSIE)

Hefei University of Technology (HFUT)

Email: tangsg@hfut.edu.cn

Link: Teaching Homepage, Google Scholar, CSDN, ZHIHU, GitHub


Biography Reaserches Publications Experience Services Link

Biography

I earned my PhD degree (2022.12) from Hefei University of Technology (HFUT), under the supervision of Prof. Richang Hong (洪日昌) and Prof. Dan Guo (郭丹). Before that, I received the B.E. degree from Hunan Normal University (HUNNU) in 2017. My research interests include multimedia computing and computer vision. Specifically, I focus on Sign Language Translation (SLT) and Sign Language Production (SLP).

If you are interested in visual understanding and cross-media learning, please visit our homepage of the Visual Understanding Team.

Researches  

Connectionist Temporal Modeling of Video and Language: A Joint Model for Translation and Sign Labeling
Dan Guo, Shengeng Tang, and Meng Wang
International Joint Conference on Artificial Intelligence (IJCAI), 2019
[Link] [Paper] [BibTex] [Slides] [Poster]
Graph-Based Multimodal Sequential Embedding for Sign Language Translation
Shengeng Tang, Dan Guo, Richang Hong, and Meng Wang
IEEE Transactions on Multimedia (TMM), 2022
[Link] [Paper] [BibTex]
Gloss Semantic-Enhanced Network with Online Back-Translation for Sign Language Production
Shengeng Tang, Richang Hong, Dan Guo, and Meng Wang
ACM International Conference on Multimedia (ACM MM), 2022
[Link] [Paper] [BibTex] [Poster] [Video]
Emotion-Prior Awareness Network for Emotional Video Captioning
Peipei Song, Dan Guo, Xun Yang, Shengeng Tang, Erkun Yang, and Meng Wang
ACM International Conference on Multimedia (ACM MM), 2023
[Link] [Paper] [BibTex]
Gloss-driven Conditional Diffusion Models for Sign Language Production
Shengeng Tang, Feng Xue, Jingjing Wu, Shuo Wang, and Richang Hong
ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM), 2024
[Link] [Paper] [BibTex]

Publications

Conference papers:

  1. Dan Guo, Shengeng Tang, and Meng Wang, "Connectionist Temporal Modeling of Video and Language: a Joint Model for Translation and Sign Labeling", International Joint Conference on Artificial Intelligence (IJCAI), 2019: 751-757. [Link][PDF][BibTeX]
  2. Shengeng Tang, Richang Hong, Dan Guo, and Meng Wang, "Gloss Semantic-Enhanced Network with Online Back-Translation for Sign Language Production", ACM International Conference on Multimedia (ACM MM), 2022: 5630-5638. [Link][PDF][BibTeX]
  3. Peipei Song, Dan Guo, Xun Yang, Shengeng Tang, Erkun Yang, and Meng Wang, "Emotion-Prior Awareness Network for Emotional Video Captioning", ACM International Conference on Multimedia (ACM MM), 2023: 589-600. [Link][PDF][BibTeX]
  4. Jingjing Wu, Yunkai Zhang, Xi Zhou, Shengeng Tang, and Yanyan Wei, "Comprehensive Survey on Person Identification: Queries, Methods, and Datasets", International Conference on Multimedia Retrieval Workshop on Multimedia Object Re-Identification (ICMR-MORE), 2024: 1-6. [Link][PDF][BibTeX]

Journal papers:

  1. Shengeng Tang, Dan Guo, Richang Hong, and Meng Wang, "Graph-Based Multimodal Sequential Embedding for Sign Language Translation", IEEE Transactions on Multimedia (TMM), 2022, 24: 4433-4445. [Link][PDF][BibTeX]
  2. Peipei Song, Dan Guo, Xun Yang, Shengeng Tang, and Meng Wang, "Emotional Video Captioning with Vision-based Emotion Interpretation Network", IEEE Transactions on Image Processing (TIP), 2024, 33: 1122-1135. [Link][PDF][BibTeX]
  3. Shengeng Tang, Feng Xue, Jingjing Wu, Shuo Wang, and Richang Hong, "Gloss-driven Conditional Diffusion Models for Sign Language Production", ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM), 2024. [Link][PDF][BibTeX]
  4. Jingjing Wu, Richang Hong, and Shengeng Tang, "Intermediary-Generated Bridge Network for RGB-D Cross-modal Re-identification", ACM Transactions on Intelligent Systems and Technology (TIST), 2024. [Link][PDF][BibTeX]
  5. 郭丹, 唐申庚, 洪日昌, 汪萌, "手语识别、翻译与生成综述", 计算机科学, 2021, 48(3): 60-70. [Link][PDF][BibTeX]
  6. 唐申庚, 修雪玉, 郭丹, 洪日昌, "基于智能生成技术的手语数字人发展现状与趋势", 人工智能, 2023, 4: 20-31. [Link][PDF][BibTeX]

Monographs:

  1. Dan Guo, Shengeng Tang, Richang Hong, and Meng Wang, "Sign Language Recognition", Multimedia for Accessible Human Computer Interfaces. Springer, Cham, 2021: 23-59. [Link][PDF][BibTeX]

Patents:

  1. 唐申庚; 姚骏; 王旭; 修雪玉; 董晓虎; 谭惟尹; 郭丹; 一种基于多模态语义交互增强的手语生成系统及方法, 2024-11-12, 中国, ZL202410630950.6. (授权)
  2. 郭丹; 唐申庚; 刘祥龙; 洪日昌; 汪萌; 一种基于图卷积的多模态融合手语识别系统及方法, 2023-3-14, 中国, ZL202010049714.7. (授权)
  3. 郭丹; 唐申庚; 刘祥龙; 汪萌; 一种基于多层次语义解析的手语翻译系统及方法, 2023-3-28, 中国, ZL202010103960.6. (授权)
  4. 郭丹; 谷纪豪; 唐申庚; 肖同欢; 曹晨曦; 宋万强; 一种基于深度智能交互的室外视障辅助方法, 2024-2-20, 中国, ZL202210371804.7. (授权)
  5. 郭丹; 曹晨曦; 肖同欢; 唐申庚; 谷纪豪; 黄滨; 一种基于语义分割的择优式方向偏移预警系统和方法, 2024-2-27, 中国, ZL202210374860.6. (授权)
  6. 郭丹; 刘泽宽; 郭义臣; 唐申庚; 武梓龙; 文则涵; 陈颖男; 一种基于深度学习的WiFi手语翻译系统及方法, 2022-7-8, 中国, CN202210805408.0. (实审)
  7. 唐申庚; 肖同欢; 郭丹; 谷纪豪; 曹晨曦; 宋万强; 黄滨; 一种基于图像目标检测和视觉深度估计的碰撞预警方法, 2023-2-27, 中国, CN202310188292.5. (实审)
  8. 唐申庚; 宋万强; 郭丹; 黄滨; 谷纪豪; 肖同欢; 曹晨曦; 一种基于带权无向图的视障人士路线规划方法, 2023-3-6, 中国, CN202310228006.3. (实审)
  9. 宋培培; 杨勋; 徐军军; 唐申庚; 王硕; 一种基于模态间互补性挖掘的多模态情感分析方法, 2024-4-12, 中国, CN202410442083.3. (实审)

Software copyright:

  1. 郭丹; 唐申庚; 陈颖男; 武梓龙; 文则涵; 刘泽宽; 基于关键点估计的人体姿态卡通化系统 V1.0, 2022SR0771364, 原始取得, 全部权利, 2022-06-16.
  2. 唐申庚; 黄滨; 郭丹; 谷纪豪; 盲人避障出行辅助系统 V1.0, 2023SR0517944, 原始取得, 全部权利, 2023-05-05.
  3. 唐申庚; 修雪玉; 郭丹; 董晓虎; 姚骏; 谢伟豪; 跨语言手语翻译系统 V1.0, 2023SR1107827, 原始取得, 全部权利, 2023-09-20.
  4. 唐申庚; 周家豪; 程乐超; 郭丹; 多源数据关联查询与推荐系统 V1.0, 2024SR1773469, 原始取得, 全部权利, 2024-11-13.

Experience  

Professional Services  



© Shengeng Tang 2024