Learning Human-Object Interactions by Graph Parsing Neural Networks

Qi, Siyuan; Wang, Wenguan; Jia, Baoxiong; Shen, Jianbing; Zhu, Song-Chun

Computer Science > Computer Vision and Pattern Recognition

arXiv:1808.07962 (cs)

[Submitted on 23 Aug 2018]

Title:Learning Human-Object Interactions by Graph Parsing Neural Networks

Authors:Siyuan Qi, Wenguan Wang, Baoxiong Jia, Jianbing Shen, Song-Chun Zhu

View PDF

Abstract:This paper addresses the task of detecting and recognizing human-object interactions (HOI) in images and videos. We introduce the Graph Parsing Neural Network (GPNN), a framework that incorporates structural knowledge while being differentiable end-to-end. For a given scene, GPNN infers a parse graph that includes i) the HOI graph structure represented by an adjacency matrix, and ii) the node labels. Within a message passing inference framework, GPNN iteratively computes the adjacency matrices and node labels. We extensively evaluate our model on three HOI detection benchmarks on images and videos: HICO-DET, V-COCO, and CAD-120 datasets. Our approach significantly outperforms state-of-art methods, verifying that GPNN is scalable to large datasets and applies to spatial-temporal settings. The code is available at this https URL.

Comments:	This paper is published in ECCV 2018
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1808.07962 [cs.CV]
	(or arXiv:1808.07962v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1808.07962

Submission history

From: Siyuan Qi [view email]
[v1] Thu, 23 Aug 2018 23:04:22 UTC (3,081 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2018-08

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Siyuan Qi
Wenguan Wang
Baoxiong Jia
Jianbing Shen
Song-Chun Zhu

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:Learning Human-Object Interactions by Graph Parsing Neural Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Learning Human-Object Interactions by Graph Parsing Neural Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators