The 2025 CivicCon Awards recognized 15 award winners in 10 categories for their work to make our community more beautiful, ...
Abstract: Visual Dialog is a typical AI-agent task on images, in which the agent interprets information from heterogeneous modalities and provides the correct answer. In this area, most approaches are ...