OpenAlex API documentation


Topics assigned to works
Concepts are abstract ideas that works are about. OpenAlex indexes about 65k concepts.
You can get a concept from the OpenAlex API like this:
The Canonical External ID for OpenAlex concepts is the Wikidata ID, and each of our concepts has one, because all OpenAlex concepts are also Wikidata concepts.
Concepts are hierarchical, like a tree. There are 19 root-level concepts, and six layers of descendants branching out from them, containing about 65 thousand concepts all told. This concept tree is a modified version of the one created by MAG. You can view all the concepts and their position in the tree as a spreadsheet here. About 85% of works are tagged with at least one concept (here's the breakdown of concept counts per work).

How concepts are assigned

Each work is tagged with multiple concepts, based on the title, abstract, and the title of its host venue. The tagging is done using an automated classifier that was trained on MAG’s corpus; you can read more about the development and operation of this classifier in Automated concept tagging for OpenAlex, an open index of scholarly articles. You can implement the classifier yourself using our models and code.
A score is available for each concept in a work, showing the classifier's confidence in choosing that concept. However, when assigning a lower-level child concept, we also assign all of its parent concepts all the way up to the root. This means that some concept assignment scores will be 0.0. The tagger adds concepts to works written in different languages, but it is optimized for English.
Concepts are linked to works via the concepts property. They’re also linked to authors, venues, and institutions via the x_concepts property, and to other concepts via the ancestors and related_concepts properties.

What's next

Learn more about what you can do with concepts: