HVLM: Exploring Human-Like Visual Cognition and Language-Memory Network for Visual Dialog

Visual dialog, a visual-language task, enables an AI agent to engage in conversation with humans grounded in a given image. To generate appropriate answers for a series of questions in the dialog, the agent is required to understand the comprehensive visual content of an image and the fine-grained t...
Ausführliche Beschreibung

Gespeichert in:
Autor*in:

Sun, Kaili [verfasserIn]

Guo, Chi

Zhang, Huyin

Li, Yuan

Format:

E-Artikel

Sprache:

Englisch

Erschienen:

2022transfer abstract

Schlagwörter:

Dual-perspective reasoning

Simple spectral graph convolution network

Visual Dialog

Visual-language understanding

Übergeordnetes Werk:

Enthalten in: Selective oxidation of 1,2-propanediol to lactic acid catalyzed by nanosized Mg(OH)2-supported bimetallic Au–Pd catalysts - Feng, Yonghai ELSEVIER, 2014, an international journal, Amsterdam [u.a.]

Übergeordnetes Werk:

volume:59 ; year:2022 ; number:5 ; pages:0

Links:

Volltext

DOI / URN:

10.1016/j.ipm.2022.103008

Katalog-ID:

ELV058802215

Nicht das Richtige dabei?

Schreiben Sie uns!