2024-3-11 03:35 /
今日工作总结
1. 了解视频格式基础知识,压制原理和常见动画画面瑕疵。链接更新在了AI动画技术指南中。
2. 阅读论文
(1) AudioCLIP: Extending CLIP to Image, Text and Audio
(2) Learning Transferable Visual Models From Natural Language Supervision
(3) InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation
看论文细节和代码实现
3. 收集数据集
1. 了解视频格式基础知识,压制原理和常见动画画面瑕疵。链接更新在了AI动画技术指南中。
2. 阅读论文
(1) AudioCLIP: Extending CLIP to Image, Text and Audio
(2) Learning Transferable Visual Models From Natural Language Supervision
(3) InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation
看论文细节和代码实现
3. 收集数据集