中国机械工程学会生产工程分会知识服务平台

会议文集


文集名MultiMedia Modeling
会议名29th International Conference on MultiMedia Modeling (MMM 2023)
中译名《第二十九届国际多媒体建模会议,卷1》
会议日期January 9-12, 2023
会议地点Bergen, Norway
出版年2023
馆藏号349210


题名作者出版年
MMM-GCN: Multi-Level Multi-Modal Graph Convolution Network for Video-Based Person IdentificationZiyan Liao; Dening Di; Jingsong Hao; Jiang Zhang; Shulei Zhu; Jun Yin2023
Feature Enhancement and Reconstruction for Small Object DetectionChong-Jian Zhang; Song-Lu Chen; Qi Liu; Zhi-Yong Huang; Feng Chen; Xu-Cheng Yin2023
Toward More Accurate Heterogeneous Iris Recognition with Transformers and CapsulesZhiyong Zhou; Yuanning Liu; Xiaodong Zhu; Shuai Liu; Shaoqiang Zhang; Zhen Liu2023
MCANet: Multiscale Cross-Modality Attention Network for Multispectral Pedestrian DetectionXiaotian Wang; Letian Zhao; Wei Wu; Xi Jin2023
Overall-Distinctive GCN for Social Relation Recognition on VideosYibo Hu; Chenyu Cao; Fangtao Li; Chenghao Yan; Jinsheng Qi; Bin Wu2023
Weakly-Supervised Temporal Action Localization with Regional Similarity ConsistencyHaoran Ren; Hao Ren; Hong Lu; Cheng Jin2023
A Spatio-Temporal Identity Verification Method for Person-Action Instance Search in MoviesYanrui Niu; Jingyao Yang; Chao Liang; Baojin Huang; Zhongyuan Wang2023
Binary Neural Network for Video Action RecognitionHongfeng Han; Zhiwu Lu; Ji-Rong Wen2023
STN: Stochastic Triplet Neighboring Approach to Self-supervised Denoising from Limited Noisy ImagesBowen Wan; Daming Shi; Yukun Liu2023
Fusion-Based Low-Light Image EnhancementHaodian Wang; Yang Wang; Yang Cao; Zheng-Jun Zha2023
Towards Interactive Facial Image Inpainting by Text or Exemplar ImageAilin Li; Lei Zhao; Zhiwen Zuo; Zhizhong Wang; Wei Xing; Dongming Lu2023
Dual-Feature Aggregation Network for No-Reference Image Quality AssessmentYihua Chen; Zhiyuan Chen; Mengzhu Yu; Zhenjun Tang2023
Single Cross-domain Semantic Guidance Network for Multimodal Unsupervised Image TranslationJiaying Lan; Lianglun Cheng; Guoheng Huang; Chi-Man Pun; Xiaochen Yuan; Shangyu Lai; HongRui Liu; Wing-Kuen Ling2023
Towards Captioning an Image Collection from a Combined Scene Graph Representation ApproachItthisak Phueaksri; Marc A. Kastner; Yasutomo Kawanishi; Takahiro Komamizu; Ichiro Ide2023
Health-Oriented Multimodal Food Question AnsweringJianghai Wang; Menghao Hu; Yaguang Song; Xiaoshan Yang2023
MM-Locate-News: Multimodal Focus Location Estimation in NewsGolsa Tahmasebzadeh; Eric Muller-Budack; Sherzod Hakimov; Ralph Ewerth2023
C-GZS: Controllable Person Image Synthesis Based on Group-Supervised Zero-Shot LearningJiyun Li; Yuan Gao; Chen Qian; Jiachen Lu; Zhongqin Chen2023
DiffMotion: Speech-Driven Gesture Synthesis Using Denoising Diffusion ModelFan Zhang; Naye Ji; Fuxing Gao; Yongping Li2023
TG-Dance: TransGAN-Based Intelligent Dance Generation with MusicDongjin Huang; Yue Zhang; Zhenyan Li; Jinhua Liu2023
Visual Question Generation Under Multi-granularity Cross-Modal InteractionZi Chai; Xiaojun Wan; Soyeon Caren Han; Josiah Poon2023
123