StoryMovie: A Dataset for Semantic Alignment of Visual Stories with Movie Scripts and Subtitles
I’m happy to share our latest research on visual storytelling. This work tackles a critical limitation in current storytelling models: […]
I’m happy to share our latest research on visual storytelling. This work tackles a critical limitation in current storytelling models: […]
I’m happy to share my latest research on visual storytelling. This work addresses a challenge in LLMs: teaching models to
Entity Re-identification in Visual Storytelling via Contrastive Reinforcement Learning Read Post »
I’m happy to share my latest research on visual storytelling. This work addresses a critical challenge in AI systems: generating
I’m excited to share my latest research on visually grounded image captioning. This work introduces GroundCap, a novel dataset that
GroundCap: A Visually Grounded Image Captioning Dataset Read Post »
I’m happy to share my survey on visual story generation. This work provides an extensive analysis of methodologies, datasets, and
Story Generation from Visual Inputs: A Comprehensive Survey of Techniques and Challenges Read Post »
I’m happy to share my recent research on video classification. This work explores how well the Video Swin Transformer (VST)
Transfer Learning for Video Classification: Video Swin Transformer Research Read Post »