Visual Genome is a dataset, a knowledge base, an ongoing effort to connect structured image concepts to language.
108,077 Images
5.4 Million Region Descriptions
1.7 Million Visual Question Answers
3.8 Million Object Instances
2.8 Million Attributes
2.3 Million Relationships
Everything Mapped to Wordnet Synsets
Read our paper.
© Stanford University    Sponsors    Creative Commons   Stanford University