Yaosi Hu, Chong Luo, Zhenzhong Chen: A Benchmark for Controllable Text -Image-to-Video Generation. IEEE Trans. Multim. 26: 1706-1719 (2024)
Yaosi Hu, Chong Luo, Zhenzhong Chen. Make it move: Controllable image-to-video generation with text descriptions[C]// Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022: 18198-1820.
Yaosi Hu, Dacheng Yin, Yuwang Wang, Zhenzhong Chen, Chong Luo. Decomposing style, content, and motion for videos[J]. Journal of Visual Communication and Image Representation (JVCIR), 2022, 89:103686.
Yaosi Hu, Zhenzhong Chen, Zheng-Jun Zha, Feng Wu. Hierarchical global-local temporal modeling for video captioning[C]// Proceedings of the 27th ACM International Conference on Multimedia (ACM MM), 2019: 774-783.
Yaosi Hu, Yingxue Zhang, Zizheng Liu, Zhenzhong Chen, Shan Liu. Subjective Study of Perceptual Quality for Micro-Video Applications[C]// IEEE Conference on Multimedia Information Processing and Retrieval (MIPR), 2020: 229-232.
Leitian Tao, Li Mi, Nannan Li, Xianhang Cheng, Yaosi Hu, Zhenzhong Chen. Predicate correlation learning for scene graph generation[J]. IEEE Transactions on Image Processing (TIP), 2022, 31: 4173-4185.
J. Gutierrez et al., Subjective evaluation of visual quality and simulator sickness of short 360ff videos: ITU-T Rec. P.919[J]. IEEE Transactions on Multimedia (TMM), 2022, 24: 3087-3100.
Yiran Tao, Yaosi Hu, Zhenzhong Chen: Memory-guided representation matching for unsupervised video anomaly detection. J. Vis. Commun. Image Represent. 101: 104185 (2024)
Wanping Ouyang, Yaosi Hu, Yangjun Ou, Zhenzhong Chen: Multiple visual relationship forecasting and arrangement in videos. Neurocomputing 541: 126274 (2023)
Huiying Shi, Yaosi Hu, Yingxue Zhang, Zhenzhong Chen: A Lightweight No-reference Video Quality Assessment Method. VCIP 2023: 1-5
Mengying Liu, Jose Joskowicz, Rafael Sotelo, Yaosi Hu, Zhenzhong Chen, Lei Yang. Subjective quality assessment of one-to-one video-telephony services[C]// IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB), 2022: 1-6.
Wei Wu, Yingxue Zhang, Yaosi Hu, Zhenzhong Chen, Shan Liu. Video quality assessment based on quality aggregation networks[C]// IEEE International Conference on Visual Communications and Image Processing (VCIP), 2022: 1-5.
Yiran Tao, Yaosi Hu, Zhenzhong Chen. Learn to look around: Deep reinforcement learning agent for video saliency prediction[C]// IEEE International Conference on Visual Communications and Image Processing (VCIP), 2021: 1-5.
Cong Zou, Xuchen Wang, Yaosi Hu, Zhenzhong Chen, Shan Liu. MAPS: Joint multimodal attention and POS sequence generation for video captioning[C]// IEEE International Conference on Visual Communications and Image Processing (VCIP), 2021: 1-5.
Ran Wei, Li Mi, Yaosi Hu, Zhenzhong Chen. Exploiting the local temporal information for video captioning[J]. Journal of Visual Communication and Image Representation (JVCIR), 2020, 67: 102751.
Jiayi Xie, Yaochen Zhu, Zhibin Zhang, Jian Peng, Jing Yi, Yaosi Hu, Hongyi Liu, Zhenzhong Chen. A multimodal variational encoder-decoder framework for microvideo popularity prediction[C]// Proceedings of The Web Conference (WWW), 2020: 2542-2548.
Di Liu, Yaosi Hu, Kao Zhang, Zhenzhong Chen. Two-stream refinement network for RGB-D saliency detection[C]// IEEE International Conference on Image Processing (ICIP), 2019: 3925-3929.