1 of 97

OpenAlex technical documentation

Overview

Data

Access

Why OpenAlex?

OpenAlex is:

Big — We have about twice the coverage of the other services, and have significantly better coverage of non-English works and works from the Global South.
Easy — Our service is fast, modern, and well-documented.
Open — Our complete dataset is free under the CC0 license, which allows for transparency and reuse.

Contact

Citation

Priem, J., Piwowar, H., & Orr, R. (2022). OpenAlex: A fully-open index of scholarly works, authors, venues, institutions, and concepts. ArXiv. https://arxiv.org/abs/2205.01833

Quickstart tutorial

Query the OpenAlex dataset using the magic of The Internet

If you open these examples in a web browser, they will look much better if you have a browser plug-in such as JSONVue installed.

1. Find the institution

You can use the institutions endpoint to learn about universities and research centers. OpenAlex has a powerful search feature that searches across 108,000 institutions.

Lets use it to search for Stanford University:

Find Stanford University https://api.openalex.org/institutions?search=stanford

Our first result looks correct (yeah!):

{
  "id": "https://openalex.org/I97018004",
  "ror": "https://ror.org/00f54p054",
  "display_name": "Stanford University",
  "country_code": "US",
  "type": "education",
  "homepage_url": "http://www.stanford.edu/"
  // other fields removed
}

We can use the ID https://openalex.org/I97018004 in that result to find out more.

2. Find articles (works) associated with Stanford University

Show works where at least one author is associated with Stanford University https://api.openalex.org/works?filter=institutions.id:https://openalex.org/I97018004

This is just one of the 50+ ways that you can filter works!

3. Filter works by publication year

Right now the list shows records for all years. Lets narrow it down to works that were published between 2010 to 2020, and sort from newest to oldest.

Show works with publication years 2010 to 2020, associated with Stanford University https://api.openalex.org/works?filter=institutions.id:https://openalex.org/I97018004,publication_year:2010-2020&sort=publication_date:desc

4. Group works by publication year to show counts by year

Finally, you can group our result by publication year to get our final result, which is the number of articles produced by Stanford, by year from 2010 to 2020. There are more than 30 ways to group records in OpenAlex, including by publisher, journal, and open access status.

Group records by publication year https://api.openalex.org/works?filter=institutions.id:https://openalex.org/I97018004,publication\_year:2010-2020\&group-by=publication\_year

That gives a result like this:

[
  {
    "key": "2020",
    "key_display_name": "2020",
    "count": 18627
  },
  {
    "key": "2019",
    "key_display_name": "2019",
    "count": 15933
  },
  {
    "key": "2017",
    "key_display_name": "2017",
    "count": 14789
  },
  ...
]

What's next?

Jump into an area of OpenAlex that interests you:

Works
Authors
Sources
Institutions
Topics
Publishers
Funders

And check out our tutorials page for some hands-on examples!

API Entities

Entities overview

Learn more about the OpenAlex entities:

Works

Journal articles, books, datasets, and theses

Works are scholarly documents like journal articles, books, datasets, and theses. OpenAlex indexes over 240M works, with about 50,000 added daily. You can access a work in the OpenAlex API like this:

Get a list of OpenAlex works: https://api.openalex.org/works

That will return a list of Work object, describing everything OpenAlex knows about each work. We collect new works from many sources, including Crossref, PubMed, institutional and discipline-specific repositories (eg, arXiv). Many older works come from the now-defunct Microsoft Academic Graph (MAG).

Works are linked to other works via the referenced_works (outgoing citations), cited_by_api_url (incoming citations), and related_works properties.

What's next

Learn more about what you can do with works:

The Work object
Get a single work
Get lists of works
Filter works
Search for works
Group works
Get N-grams

Work object

There's a lot of useful data inside a work. When you use the API to get a single work or lists of works, this is what's returned.

`abstract_inverted_index`

Object: The abstract of the work, as an inverted index, which encodes information about the abstract's words and their positions within the text. Like Microsoft Academic Graph, OpenAlex doesn't include plaintext abstracts due to legal constraints.

abstract_inverted_index: {
    Despite: [
        0
    ],
    growing: [
        1
    ],
    interest: [
        2
    ],
    in: [
        3,
        57,
        73,
        110,
        122
    ],
    Open: [
        4,
        201
    ],
    Access: [
        5
    ],
    ...
}

Abstract inverted index coverage

Newer works are more likely to have an abstract inverted index. For example, over 60% of works in 2022 have abstract data, compared to 45% for works older than 2000. Full chart is below:

`alternate_host_venues` (deprecated)

The host_venue and alternate_host_venues properties have been deprecated in favor of primary_location and locations. The attributes host_venue and alternate_host_venues are no longer available in the Work object, and trying to access them in filters or group-bys will return an error.

`authorships`

List: List of Authorship objects, each representing an author and their institution. Limited to the first 100 authors to maintain API performance.

For more information, see the Authorship object page.

authorships: [
    // first authorship object:
    {
        author_position: "middle",
        author: {
            id: "https://openalex.org/A5023888391",
            display_name: "Jason Priem",
            orcid: "https://orcid.org/0000-0001-6187-6610"
        },
        institutions: [
            {
                id: "https://openalex.org/I4200000001",
                display_name: "OurResearch",
                ror: "https://ror.org/02nr0ka47",
                country_code: "US",
                type: "nonprofit"
            }
        ],
        // other fields removed for brevity. See the Authorship object documentation
    },
    
    // more authorship objects go here
]

`apc_list`

Object: Information about this work's APC (article processing charge). The object contains:

value: Integer
currency: String
provenance: String — the source of this data. Currently the only value is “doaj” (DOAJ)
value_usd: Integer — the APC converted into USD

This value is the APC list price–the price as listed by the journal’s publisher. That’s not always the price actually paid, because publishers may offer various discounts to authors. Unfortunately we don’t always know this discounted price, but when we do you can find it in apc_paid.

Currently our only source for this data is DOAJ, and so doaj is the only value for apc_list.provenance, but we’ll add other sources over time.

We currently don’t have information on the list price for hybrid journals (toll-access journals that also provide an open-access option), but we will add this at some point. We do have apc_paid information for hybrid OA works occasionally.

You can use this attribute to find works published in Diamond open access journals by looking at works where apc_list.value is zero. See open_access.oa_status for more info.

apc_payment: {
    value: 3200,
    currency: "USD",
    value_usd: 3200,
    provenance: "doaj"
}

`apc_paid`

Object: Information about the paid APC (article processing charge) for this work. The object contains:

value: Integer
currency: String
provenance: String — currently either openapc or doaj, but more will be added; see below for details.
value_usd: Integer — the APC converted into USD

You can find the listed APC price (when we know it) for a given work using apc_list. However, authors don’t always pay the listed price; often they get a discounted price from publishers. So it’s useful to know the APC actually paid by authors, as distinct from the list price. This is our effort to provide this.

Our best source for the actually paid price is the OpenAPC project. Where available, we use that data, and so apc_paid.provenance is openapc. Where OpenAPC data is unavailable (and unfortunately this is common) we make our best guess by assuming the author paid the APC list price, and apc_paid.provenance will be set to wherever we got the list price from.

apc_payment: {
    value: 2250,
    currency: "EUR",
    value_usd: 2426,
    provenance: "openapc"
}

`best_oa_location`

Object: A Location object with the best available open access location for this work.

We score open locations to determine which is best using these factors:

Must have is_oa: true
type_:_ "publisher" is better than "repository".
version: "publishedVersion" is better than "acceptedVersion", which is better than "submittedVersion".
pdf_url: A location with a direct PDF link is better than one without.
repository rankings: Some major repositories like PubMed Central and arXiv are ranked above others.

best_oa_location: {
  is_oa: true,
  landing_page_url: "https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1398957",
  pdf_url: null,
  source: {
    id: "https://openalex.org/S2764455111",
    display_name: "PubMed Central",
    issn_l: null,
    issn: null,
    host_organization: "https://openalex.org/I1299303238",
    type: "repository"
  },
  license: null,
  version: "publishedVersion"
}

`biblio`

Object: Old-timey bibliographic info for this work. This is mostly useful only in citation/reference contexts. These are all strings because sometimes you'll get fun values like "Spring" and "Inside cover."

volume (String)
issue (String)
first_page (String)
last_page (String)

biblio: {
    volume: "495",
    issue: "7442",
    first_page: "437",
    last_page: "440"
}

`citation_normalized_percentile`

Object: The percentile of this work's citation count normalized by work type, publication year, and subfield. This field represents the same information as the FWCI expressed as a percentile. Learn more in the reference article: Field Weighted Citation Impact (FWCI).

citation_normalized_percentile: {
        value: 0.999948,
        is_in_top_1_percent: true,
        is_in_top_10_percent": true
}

`cited_by_api_url`

String: A URL that uses the cites filter to display a list of works that cite this work. This is a way to expand cited_by_count into an actual list of works.

`cited_by_count`

Integer: The number of citations to this work. These are the times that other works have cited this work: Other works ➞ This work.

cited_by_count: 382

`concepts`

List: List of dehydrated Concept objects.

Each Concept object in the list also has one additional property:

score (Float): The strength of the connection between the work and this concept (higher is stronger). This number is produced by AWS Sagemaker, in the last layer of the machine learning model that assigns concepts.

Concepts with a score of at least 0.3 are assigned to the work. However, ancestors of an assigned concept are also added to the work, even if the ancestor scores are below 0.3.

Because ancestor concepts are assigned to works, you may see concepts in works with very low scores, even some zero scores.

concepts: [
    {
        id: "https://openalex.org/C71924100",
        wikidata: "https://www.wikidata.org/wiki/Q11190",
        display_name: "Medicine",
        level: 0,
        score: 0.9187037
    },
    {
        id: "https://openalex.org/C3007834351",
        wikidata: "https://www.wikidata.org/wiki/Q82069695",
        display_name: "Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2)",
        level: 5,
        score: 0.8070164
    },
    ...
    {
        id: "https://openalex.org/C191935318",
        wikidata: "https://www.wikidata.org/wiki/Q148",
        display_name: "China",
        level: 2,
        score: 0.5948172
    },
    ...
    {
        id: "https://openalex.org/C121608353",
        wikidata: "https://www.wikidata.org/wiki/Q12078",
        display_name: "Cancer",
        level: 2,
        score: 0.46887803
    },
    ...
    {
        id: "https://openalex.org/C17744445",
        wikidata: "https://www.wikidata.org/wiki/Q36442",
        display_name: "Political science",
        level: 0,
        score: 0
    }
]

`corresponding_author_ids`

List: OpenAlex IDs of any authors for which authorships.is_corresponding is true.

corresponding_author_ids: ["https://openalex.org/A5004365451"]

`corresponding_institution_ids`

List: OpenAlex IDs of any institutions found within an authorship for which authorships.is_corresponding is true.

corresponding_institution_ids: ["https://openalex.org/I4210123613"]

`countries_distinct_count`

Integer: Number of distinct country_codes among the authorships for this work.

countries_distinct_count: 4

`counts_by_year`

List: Works.cited_by_count for each of the last ten years, binned by year. To put it another way: each year, you can see how many times this work was cited.

Any citations older than ten years old aren't included. Years with zero citations have been removed so you will need to add those in if you need them.

counts_by_year: [
    {
        year: 2022,
        cited_by_count: 8
    },
    {
        year: 2021,
        cited_by_count: 252
    },
    ...
    {
        year: 2012,
        cited_by_count: 79
    }
]

`created_date`

String: The date this Work object was created in the OpenAlex dataset, expressed as an ISO 8601 date string.

created_date: "2017-08-08"

`display_name`

String: Exactly the same as Work.title. It's useful for Works to include a display_name property, since all the other entities have one.

display_name: "The state of OA: a large-scale analysis of the prevalence and impact of Open Access articles",

`doi`

String: The DOI for the work. This is the Canonical External ID for works.

Occasionally, a work has more than one DOI--for example, there might be one DOI for a preprint version hosted on bioRxiv, and another DOI for the published version. However, this field always has just one DOI, the DOI for the published work.

doi: "https://doi.org/10.7717/peerj.4375"

`fulltext_origin`

String: If a work's full text is searchable in OpenAlex (has_fulltext is true), this tells you how we got the text. This will be one of:

pdf: We used Grobid to get the text from an open-access PDF.
ngrams: Full text search is enabled using N-grams obtained from the Internet Archive.

This attribute is only available for works with has_fulltext:true.

fulltext_origin: "pdf"

`fwci`

Float: The Field-weighted Citation Impact (FWCI), calculated for a work as the ratio of citations received / citations expected in the year of publications and three following years. Learn more in the reference article: Field Weighted Citation Impact (FWCI).

fwci: 76.992

`grants`

List: List of grant objects, which include the Funder and the award ID, if available. Our grants data comes from Crossref, and is currently fairly limited.

grants: [
    // grant for which we have the grant details:
    {
        funder: "https://openalex.org/F4320306076",
        funder_display_name: "National Science Foundation",
        award_id: "ABI 1661218",
    },
    // grant for which we do not have the details:
    {
        funder: "https://openalex.org/F4320306084",
        funder_display_name: "U.S. Department of Energy",
        award_id: null,
    },
]

`has_fulltext`

Boolean: Set to true if the work's full text is searchable in OpenAlex. This does not necessarily mean that the full text is available to you, dear reader; rather, it means that we have indexed the full text and can use it to help power searches. If you are trying to find the full text for yourself, try looking in open_access.oa_url.

We get access to the full text in one of two ways: either using an open-access PDF, or using N-grams obtained from the Internet Archive. You can learn where a work's full text came from at fulltext_origin.

has_fulltext: true

`host_venue` (deprecated)

`id`

String: The OpenAlex ID for this work.

id: "https://openalex.org/W2741809807"

`ids`

Object: All the external identifiers that we know about for this work. IDs are expressed as URIs whenever possible. Possible ID types:

doi (String: The DOI. Same as Work.doi)
mag (Integer: the Microsoft Academic Graph ID)
openalex (String: The OpenAlex ID. Same as Work.id)
pmid (String: The Pubmed Identifier)
pmcid (String: the Pubmed Central identifier)

Most works are missing one or more ID types (either because we don't know the ID, or because it was never assigned). Keys for null IDs are not displayed.

ids: {
    openalex: "https://openalex.org/W2741809807",
    doi: "https://doi.org/10.7717/peerj.4375",
    mag: 2741809807,
    pmid: "https://pubmed.ncbi.nlm.nih.gov/29456894"
}

`indexed_in`

List: The sources this work is indexed in. Possible values: arxiv, crossref, doaj, pubmed.

indexed_in: [
    "arxiv", "crossref", "pubmed"
]

`institutions_distinct_count`

Integer: Number of distinct institutions among the authorships for this work.

institutions_distinct_count: 4

`is_paratext`

Boolean: True if we think this work is paratext.

In our context, paratext is stuff that's in a scholarly venue (like a journal) but is about the venue rather than a scholarly work properly speaking. Some examples and nonexamples:

yep it's paratext: front cover, back cover, table of contents, editorial board listing, issue information, masthead.
no, not paratext: research paper, dataset, letters to the editor, figures

Turns out there is a lot of paratext in registries like Crossref. That's not a bad thing... but we've found that it's good to have a way to filter it out.

We determine is_paratext algorithmically using title heuristics.

is_paratext: false

`is_retracted`

Boolean: True if we know this work has been retracted.

We identify works that have been retracted using the public Retraction Watch database, a public resource made possible by a partnership between Crossref and The Center for Scientific Integrity.

is_retracted: false

`keywords`

List of objects: Short phrases identified based on works' Topics. For background on how Keywords are identified, see the Keywords page at OpenAlex help pages.

The score for each keyword represents the similarity score of that keyword to the title and abstract text of the work.

We provide up to 5 keywords per work, for all keywords with scores above a certain threshold.

[
    {
        id: "https://openalex.org/keywords/global-seaweed-distribution",
        display_name: "Global Seaweed Distribution",
        score: 0.559386
    },
    {
        id: "https://openalex.org/keywords/climate-change-impacts",
        display_name: "Climate Change Impacts",
        score: 0.535795
    },
    {
        id: "https://openalex.org/keywords/ecosystem-resilience",
        display_name: "Ecosystem Resilience",
        score: 0.502789
    }
]

`language`

String: The language of the work in ISO 639-1 format. The language is automatically detected using the information we have about the work. We use the langdetect software library on the words in the work's abstract, or the title if we do not have the abstract. The source code for this procedure is here. Keep in mind that this method is not perfect, and that in some cases the language of the title or abstract could be different from the body of the work.

A few things to keep in mind about this:

We don't always assign a language if we do not have enough words available to accurately guess.
We report the language of the metadata, not the full text. For example, if a work is in French, but the title and abstract are in English, we report the language as English.
In some cases, abstracts are in two different languages. Unfortunately, when this happens, what we report will not be accurate.

language: "en"

`license`

String: The license applied to this work at this host. Most toll-access works don't have an explicit license (they're under "all rights reserved" copyright), so this field generally has content only if is_oa is true.

license: "cc-by"

`locations`

List: A list of Location objects describing all unique places where this work lives.

locations: [ 
  {
    is_oa: true,
    landing_page_url: "https://doi.org/10.1073/pnas.17.6.401",
    pdf_url: "http://www.pnas.org/content/17/6/401.full.pdf",
    source: {
      id: "https://openalex.org/S125754415",
      display_name: "Proceedings of the National Academy of Sciences of the United States of America",
      issn_l: "0027-8424",
      issn: ["1091-6490", "0027-8424"],
      host_organization: "https://openalex.org/P4310320052",
      type: "journal"
    },
    license: null,
    version: "publishedVersion"
  },
  {
    is_oa: true,
    landing_page_url: "https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1076072",
    pdf_url: null,
    source: {
      id: "https://openalex.org/S2764455111",
      display_name: "PubMed Central",
      issn_l: null,
      issn: null,
      host_organization: "https://openalex.org/I1299303238",
      type: "repository"
    },
    license: null,
    version: "publishedVersion"
  }
]

`locations_count`

Integer: Number of locations for this work.

locations_count: 3

`mesh`

List: List of MeSH tag objects. Only works found in PubMed have MeSH tags; for all other works, this is an empty list.

mesh: [
    {
        descriptor_ui: "D017712",
        descriptor_name: "Peer Review, Research",
        qualifier_ui: "Q000379",
        qualifier_name: "methods",
        is_major_topic: false
    },
    {
        descriptor_ui: "D017712",
        descriptor_name: "Peer Review, Research",
        qualifier_ui: "Q000592",
        qualifier_name: "standards",
        is_major_topic: true
    }
]

`open_access`

Object: Information about the access status of this work, as an OpenAccess object.

open_access: {
    is_oa: true,
    oa_status: "gold",
    oa_url: "https://peerj.com/articles/4375.pdf",
    any_repository_has_fulltext: true
},

`primary_location`

Object: A Location object with the primary location of this work.

The primary_location is where you can find the best (closest to the version of record) copy of this work. For a peer-reviewed journal article, this would be a full text published version, hosted by the publisher at the article's DOI URL.

primary_location: {
  is_oa: true,
  landing_page_url: "https://doi.org/10.1073/pnas.17.6.401",
  pdf_url: "http://www.pnas.org/content/17/6/401.full.pdf",
  source: {
    id: "https://openalex.org/S125754415",
    display_name: "Proceedings of the National Academy of Sciences of the United States of America",
    issn_l: "0027-8424",
    issn: ["1091-6490", "0027-8424"],
    host_organization: "https://openalex.org/P4310320052",
    type: "journal"
  },
  license: null,
  version: "publishedVersion"
}

`primary_topic`

Object

The top ranked Topic for this work. This is the same as the first item in Work.topics.

primary_topic: {
    id: "https://openalex.org/T12419",
    display_name: "Analysis of Cardiac and Respiratory Sounds",
    score: 	0.9997,
    subfield: {
        id: 2740,
        display_name: "Pulmonary and Respiratory Medicine"
    }
    field: {
        id: 27,
        display_name: "Medicine"
    }
    domain: {
        id: 4,
        display_name: "Health Sciences"
    }
}

`publication_date`

String: The day when this work was published, formatted as an ISO 8601 date.

Where different publication dates exist, we usually select the earliest available date of electronic publication.

This date applies to the version found at Work.url. The other versions, found in Work.locations, may have been published at different (earlier) dates.

publication_date: "2018-02-13"

`publication_year`

Integer: The year this work was published.

This year applies to the version found at Work.url. The other versions, found in Work.locations, may have been published in different (earlier) years.

publication_year: 2018

`referenced_works`

List: OpenAlex IDs for works that this work cites. These are citations that go from this work out to another work: This work ➞ Other works.

referenced_works: [
    "https://openalex.org/W2753353163",
    "https://openalex.org/W2785823074",
    "https://openalex.org/W2511661767",
    "https://openalex.org/W2115339903",
    "https://openalex.org/W2031754690"
]

`related_works`

List: OpenAlex IDs for works related to this work. Related works are computed algorithmically; the algorithm finds recent papers with the most concepts in common with the current paper.

related_works: [
    "https://openalex.org/W2753353163",
    "https://openalex.org/W2785823074",
    "https://openalex.org/W2511661767",
    "https://openalex.org/W2115339903",
    "https://openalex.org/W2031754690",
]

`sustainable_development_goals`

List: List of objects

The United Nations' 17 Sustainable Development Goals are a collection of goals at the heart of a global "shared blueprint for peace and prosperity for people and the planet." We use a machine learning model to tag works with their relevance to these goals based on our OpenAlex SDG Classifier, an mBERT machine learning model developed by the Aurora Universities Network. The score represents the model's predicted probability of the work's relevance for a particular goal.

We display all of the SDGs with a prediction score higher than 0.4.

sustainable_development_goals: [
    {
        id: "https://metadata.un.org/sdg/3",
        display_name: "Good health and well-being",
        score: 	0.95
    }
]

`topics`

List: List of objects

The top ranked Topics for this work. We provide up to 3 topics per work.

topics: [
    {
        id: "https://openalex.org/T12419",
        display_name: "Analysis of Cardiac and Respiratory Sounds",
        score: 	0.9997,
        subfield: {
            id: 2740,
            display_name: "Pulmonary and Respiratory Medicine"
        }
        field: {
            id: 27,
            display_name: "Medicine"
        }
        domain: {
            id: 4,
            display_name: "Health Sciences"
        }
    }
    ...
]

`title`

String: The title of this work.

This is exactly the same as Work.display_name. We include both attributes with the same information because we want all entities to have a display_name, but there's a longstanding tradition of calling this the "title," so we figured you'll be expecting works to have it as a property.

title: "The state of OA: a large-scale analysis of the prevalence and impact of Open Access articles",

`type`

String: The type of the work.

You can see all of the different types along with their counts in the OpenAlex API here: https://api.openalex.org/works?group_by=type.

Most works are type article. This includes what was formerly (and currently in type_crossref) labeled as journal-article, proceedings-article, and posted-content. We consider all of these to be article type works, and the distinctions between them to be more about where they are published or hosted:

Journal articles will have a primary_location.source.type of journal
Conference proceedings will have a primary_location.source.type of conference
Preprints or "posted content" will have a primary_location.version of submittedVersion

(Note that distinguishing between journals and conferences is a hard problem, one we often get wrong. We are working on improving this, but we also point out that the two have a lot of overlap in terms of their roles as hosts of research publications.)

Works that are hosted primarily on a preprint, or that are identified speicifically as preprints in the metadata we receive, are assigned the type preprint rather than article.

Works that represent stuff that is about the venue (such as a journal)—rather than a scholarly work properly speaking—have type paratext. These include things like front-covers, back-covers, tables of contents, and the journal itself (e.g., https://openalex.org/W4232230324).

We also have types for letter , editorial , erratum (corrections), libguides , supplementary-materials , and review (currently, articles that come from journals that exclusively publish review articles). Coverage is low on these but will improve.

Other work types follow the Crossref "type" controlled vocabulary—see type_crossref.

type: "article"

`type_crossref`

String: Legacy type information, using Crossref's "type" controlled vocabulary.

These are the work types that we used to use, before switching to our current system (see type).

You can see all possible values of Crossref's "type" controlled vocabulary via the Crossref api here: https://api.crossref.org/types.

Where possible, we just pass along Crossref's type value for each work. When that's impossible (eg the work isn't in Crossref), we do our best to figure out the type ourselves.

type_crossref: "journal-article"

`updated_date`

String: The last time anything in this Work object changed, expressed as an ISO 8601 date string (in UTC). This date is updated for any change at all, including increases in various counts.

updated_date: "2022-01-02T00:22:35.180390"

The `OpenAccess` object

The OpenAccess object describes access options for a given work. It's only found as part of the Work object.

`any_repository_has_fulltext`

Boolean: True if any of this work's locations has location.is_oa=true and location.source.type=repository.

Use case: researchers want to track Green OA, using a definition of "any repository hosts this." OpenAlex's definition (as used in oa_status) doesn't support this, because as soon as there's a publisher-hosted copy (bronze, hybrid, or gold), oa_status is set to that publisher-hosted status.

So there's a lot of repository-hosted content that the oa_status can't tell you about. Our State of OA paper calls this "shadowed Green." This feature makes it possible to track shadowed Green.

any_repository_has_fulltext: true

`is_oa`

Boolean: True if this work is Open Access (OA).

There are many ways to define OA. OpenAlex uses a broad definition: having a URL where you can read the fulltext of this work without needing to pay money or log in. You can use the locations and oa_status fields to narrow your results further, accommodating any definition of OA you like.

is_oa: true

`oa_status`

String: The Open Access (OA) status of this work. Possible values are:

diamond: Published in a fully OA journal—one that is indexed by the DOAJ or that we have determined to be OA—with no article processing charges (i.e., free for both readers and authors).
gold: Published in a fully OA journal.
green: Toll-access on the publisher landing page, but there is a free copy in an OA repository.
hybrid: Free under an open license in a toll-access journal.
bronze: Free to read on the publisher landing page, but without any identifiable license.
closed: All other articles.

oa_status: "gold"

`oa_url`

String: The best Open Access (OA) URL for this work.

Although there are many ways to define OA, in this context an OA URL is one where you can read the fulltext of this work without needing to pay money or log in. The "best" such URL is the one closest to the version of record.

This URL might be a direct link to a PDF, or it might be to a landing page that links to the free PDF

oa_url: "https://peerj.com/articles/4375.pdf"

Authorship object

`affiliations`

List: List of objects

`author`

`author_position`

String: A summarized description of this author's position in the work's author list. Possible values are first, middle, and last.

It's not strictly necessary, because author order is already implicitly recorded by the list order of Authorship objects; however it's useful in some contexts to have this as a categorical value.

`countries`

List: The country or countries for this author.

We determine the countries using a combination of matched institutions and parsing of the raw affiliation strings, so we can have this information for some authors even if we do not have a specific institutional affiliation.

`institutions`

`is_corresponding`

Boolean: If true, this is a corresponding author for this work.

This is a new feature, and the information may be missing for many works. We are working on this, and coverage will improve soon.

`raw_affiliation_strings`

List: This author's affiliation as it originally came to us (on a webpage or in an API), as a list of raw unformatted strings. If there is only one affiliation, it will be a list of length one.

`raw_author_name`

String: This author's name as it originally came to us (on a webpage or in an API), as a raw unformatted string.

Location object

The Location object describes the location of a given work. It's only found as part of the Work object.

There are three places in the Work object where you can find locations:

`is_accepted`

`is_oa`

Boolean: True if an Open Access (OA) version of this work is available at this location.

`is_published`

landing_page_url

String: The landing page URL for this location.

license

source

The concept of a source is meant to capture a certain social relationship between the host organization and a version of a work. When an organization puts the work on the internet, there is an understanding that they have, at some level, endorsed the work. This level varies, and can be very different depending on the source!

pdf_url

String: A URL where you can find this location as a PDF.

version

publishedVersion: The document’s version of record. This is the most authoritative version.
acceptedVersion: The document after having completed peer review and being officially accepted for publication. It will lack publisher formatting, but the content should be interchangeable with the that of the publishedVersion.
submittedVersion: the document as submitted to the publisher by the authors, but before peer-review. Its content may differ significantly from that of the accepted article.

Get a single work

It's easy to get a work from from the API with: /works/<entity_id> Here's an example:

External IDs

You can look up works using external IDs such as a DOI:

You can use the full ID or a shorter Uniform Resource Name (URN) format like so:

Available external IDs for works are:

You must make sure that the ID(s) you supply are valid and correct. If an ID you request is incorrect, you will get no result. If you request an illegal ID—such as one containing a , or &, the query will fail and you will get a 403 error.

Select fields

Get lists of works

You can get lists of works:

Which returns a response like this:

Page and sort works

Sample works

Select fields

Filter works

It's easy to filter works with the filter parameter:

In this example the filter is publication_year and the value is 2020.

`/works` attribute filters

`/works` convenience filters

`abstract.search`

Text search using abstracts

Value: a search string

`authors_count`

Number of authors for a work

Value: an Integer

`authorships.institutions.continent` (alias: `institutions.continent`)

Returns: works where at least one of the author's institutions is in the chosen continent.

`authorships.institutions.is_global_south` (alias: `institutions.is_global_south`)

Value: a Boolean (true or false)

`best_open_version`

Value: a String with one of the following values:

any: This means that best_oa_location.version = submittedVersion, acceptedVersion, or publishedVersion
acceptedOrPublished: This means that best_oa_location.version can be acceptedVersion or publishedVersion
published: This means that best_oa_location.version = publishedVersion

`cited_by`

`cites`

`concepts_count`

Value: an Integer

`default.search`

Text search across titles, abstracts, and full text of works

Value: a search string

`display_name.search` (alias: `title.search`)

Text search across titles for works

Value: a search string

`from_created_date`

Value: a date, formatted as yyyy-mm-dd

`from_publication_date`

Value: a date, formatted as yyyy-mm-dd

Filtering by publication date is not a reliable way to retrieve recently updated and created works, due to the way publishers assign publication dates. Use from_created_date or from_updated_date to get the latest changes in OpenAlex.

`from_updated_date`

`fulltext.search`

Value: a search string

We combined some n-grams before storing them in our search database, so querying for an exact phrase using quotes does not always work well.

`has_abstract`

Works that have an abstract available

Value: a Boolean (true or false)

Returns: works that have or lack an abstract, depending on the given value.

`has_doi`

Value: a Boolean (true or false)

`has_oa_accepted_or_published_version`

Value: a Boolean (true or false)

`has_oa_submitted_version`

Value: a Boolean (true or false)

`has_orcid`

Value: a Boolean (true or false)

`has_pmcid`

Value: a Boolean (true or false)

`has_pmid`

Value: a Boolean (true or false)

`has_ngrams` (DEPRECATED)

Works that have n-grams available to enable full-text search in OpenAlex.

Value: a Boolean (true or false)

`has_references`

Value: a Boolean (true or false)

`journal`

`locations.source.host_institution_lineage`

`locations.source.publisher_lineage`

`mag_only`

Value: a Boolean (true or false)

Returns: works which came from MAG (Microsoft Academic Graph), and no other data sources.

`primary_location.source.has_issn`

Value: a Boolean (true or false)

`primary_location.source.publisher_lineage`

`raw_affiliation_strings.search`

This filter used to be named raw_affiliation_string.search, but it is now raw_affiliation_strings.search (i.e., plural, with an 's').

Value: a search string

`related_to`

`repository`

You can also use this as a group_by to learn things about repositories:

`title_and_abstract.search`

Text search across titles and abstracts for works

Value: a search string

`to_created_date`

Value: a date, formatted as yyyy-mm-dd

`to_publication_date`

Value: a date, formatted as yyyy-mm-dd

`to_updated_date`

`version`

Value: a String with value publishedVersion, submittedVersion, acceptedVersion, or null

Search works

Search a specific field

The following fields can be searched within works:

Rather than searching for the names of entities related to works—such as authors, institutions, and sources—you need to search by a more unique identifier for that entity, like the OpenAlex ID. This means that there is a 2 step process:

Why can't you do this in just one step? Well, if you use the search term, "NYU," you might end up missing the ones that use the full name "New York University," rather than the initials. Sure, you could try to think of all possible variants and search for all of them, but you might miss some, and you risk putting in search terms that let in works that you're not interested in. Figuring out which works are actually associated with the "NYU" you're interested shouldn't be your responsibility—that's our job! We've done that work for you, so all the relevant works should be associated with one unique ID.

Autocomplete works

You can autocomplete works to create a very fast type-ahead style search function:

This returns a list of works titles with the author of each work set as the hint:

Group works

You can group works with the group_by parameter:

Or you can group using one the attributes below.

`/works` group_by attributes

Get N-grams

N-grams are groups of sequential words that occur in the text of a Work.

Note that while n-grams are derived from the fulltext of a Work, the presence of n-grams for a given Work doesn't imply that the fulltext is available to you, the reader. It only means the fulltext was available to Internet Archive for indexing. Work.open_access is the place to go for information on public fulltext availability.

API Endpoint

The n-gram API endpoint is not currently in service. The n-grams are still used on our backend to help power fulltext search. If you have any questions about this, please submit a support ticket.

Fulltext Coverage

You can see which works we have full-text for using the has_fulltext filter. This does not necessarily mean that the full text is available to you, dear reader; rather, it means that we have indexed the full text and can use it to help power searches. If you are trying to find the full text for yourself, try looking in open_access.oa_url.

About 57 million works have n-grams coverage through Internet Archive. OurResearch is the first organization to host this data in a highly usable way, and we are proud to integrate it into OpenAlex!

Curious about n-grams used in search? Browse them all via the API. Highly-cited works and less recent works are more likely to have n-grams, as shown by the coverage charts below:

Authors

People who create works

Authors are people who create works. You can get an author from the API like this:

Get a list of OpenAlex authors: https://api.openalex.org/authors

The Canonical External ID for authors is ORCID; only a small percentage of authors have one, but the percentage is higher for more recent works.

Our information about authors comes from MAG, Crossref, PubMed, ORCID, and publisher websites, among other sources. To learn more about how we combine this information to get OpenAlex Authors, see Author Disambiguation.

Authors are linked to works via the works.authorships property.

What's next

Learn more about what you can with authors:

The Author object
Get a single author
Get lists of authors
Filter authors
Search authors
Group authors

Author object

`affiliations`

List: List of objects, representing the affiliations this author has claimed in their publications. Each object in the list has two properties:

years: a list of the years in which this author claimed an affiliation with this institution

`cited_by_count`

`counts_by_year`

Any works or citations older than ten years old aren't included. Years with zero works and zero citations have been removed so you will need to add those in if you need them.

`created_date`

`display_name`

String: The name of the author as a single string.

`display_name_alternatives`

List: Other ways that we've found this author's name displayed.

`id`

String: The OpenAlex ID for this author.

`ids`

Object: All the external identifiers that we know about for this author. IDs are expressed as URIs whenever possible. Possible ID types:

twitter (String: this author's Twitter handle)
wikipedia (String: this author's Wikipedia page)

Most authors are missing one or more ID types (either because we don't know the ID, or because it was never assigned). Keys for null IDs are not displayed.

`last_known_institution` (deprecated)

`last_known_institutions`

`orcid`

Compared to other Canonical IDs, ORCID coverage is relatively low in OpenAlex, because ORCID adoption in the wild has been slow compared with DOI, for example. This is particularly an issue when dealing with older works and authors.

`summary_stats`

Object: Citation metrics for this author

While the 2-year mean citedness is normally a journal-level metric, it can be calculated for any set of papers, so we include it for authors.

`updated_date`

`works_api_url`

String: A URL that will get you a list of all this author's works.

We express this as an API URL (instead of just listing the works themselves) because sometimes an author's publication list is too long to reasonably fit into a single author object.

`works_count`

`x_concepts`

score (Float): The strength of association between this author and the listed concept, from 0-100.

The Dehydrated`Author` object

Get a single author

It's easy to get an author from from the API with: /authors/<entity_id>. Here's an example:

Authors are also available via an alias: /people

External IDs

You can look up authors using external IDs such as an ORCID:

Available external IDs for authors are:

Select fields

Get lists of authors

You can get lists of authors:

Which returns a response like this:

Page and sort authors

Sample authors

Select fields

Filter authors

You can filter authors with the filter parameter:

`/authors` attribute filters

Want to filter by last_known_institution.display_name? This is a two-step process:

Find the institution.id by searching institutions by display_name.
Filter works by last_known_institution.id.

`/authors` convenience filters

`default.search`

Value: a search string

`display_name.search`

Value: a search string

`has_orcid`

Value: a Boolean (true or false)

`last_known_institution.continent`

Returns: authors where where the last known institution is in the chosen continent.

`last_known_institution.is_global_south`

Value: a Boolean (true or false)

Search authors

Searching without a middle initial returns names with and without middle initials. So a search for "John Smith" will also return "John W. Smith".

Search a specific field

When searching for authors, there is no difference when using the search parameter or the filter display_name.search, since display_name is the only field searched when finding authors.

Autocomplete authors

You can autocomplete authors to create a very fast type-ahead style search function:

This returns a list of authors with their last known affiliated institution as the hint:

Group authors

You can group authors with the group_by parameter:

Or you can group using one the attributes below.

`/authors` group_by attributes

Limitations

Works with more than 100 authors are truncated

To see the full list of authors, go to the individual record for the work, which is never truncated.

This affects filtering as well. So if you filter works using an author ID or ROR, you will not receive works where that author is listed further than 100 places down on the list of authors. We plan to change this in the future, so that filtering works as expected.

Author disambiguation

Sources

Journals and repositories that host works

Sources are where works are hosted. OpenAlex indexes about 249,000 sources. There are several types, including journals, conferences, preprint repositories, and institutional repositories.

Our information about sources comes from Crossref, the ISSN Network, and MAG. These datasets are joined automatically where possible, but there’s also a lot of manual combining involved. We do not curate journals, so any journal that is available in the data sources should make its way into OpenAlex.

What's next

Learn more about what you can do with sources:

Get a single source

It's easy to get a source from from the API with: /sources/<entity_id>. Here's an example:

Sources are also available via an alias: /journals

External IDs

You can look up journals using external IDs such as an ISSN:

Available external IDs for sources are:

Select fields

Filter sources

You can filter sources with the filter parameter:

`/sources` attribute filters

Want to filter by host_organization.display_name? This is a two-step process:

Find the host organization's ID by searching by display_name in Publishers or Institutions, depending on which type you are looking for.
Filter works by host_organization.id.

`/sources` convenience filters

`continent`

Returns: sources that are associated with the chosen continent.

`default.search`

Value: a search string

`display_name.search`

Value: a search string

`has_issn`

Value: a Boolean (true or false)

`is_global_south`

Value: a Boolean (true or false)

How to use the API

Download all data

Additional Help

Work object

There's a lot of useful data inside a work. When you use the API to get a single work or lists of works, this is what's returned.

`abstract_inverted_index`

abstract_inverted_index: {
    Despite: [
        0
    ],
    growing: [
        1
    ],
    interest: [
        2
    ],
    in: [
        3,
        57,
        73,
        110,
        122
    ],
    Open: [
        4,
        201
    ],
    Access: [
        5
    ],
    ...
}

Abstract inverted index coverage

Newer works are more likely to have an abstract inverted index. For example, over 60% of works in 2022 have abstract data, compared to 45% for works older than 2000. Full chart is below:

`alternate_host_venues` (deprecated)

`authorships`

List: List of Authorship objects, each representing an author and their institution. Limited to the first 100 authors to maintain API performance.

For more information, see the Authorship object page.

authorships: [
    // first authorship object:
    {
        author_position: "middle",
        author: {
            id: "https://openalex.org/A5023888391",
            display_name: "Jason Priem",
            orcid: "https://orcid.org/0000-0001-6187-6610"
        },
        institutions: [
            {
                id: "https://openalex.org/I4200000001",
                display_name: "OurResearch",
                ror: "https://ror.org/02nr0ka47",
                country_code: "US",
                type: "nonprofit"
            }
        ],
        // other fields removed for brevity. See the Authorship object documentation
    },
    
    // more authorship objects go here
]

`apc_list`

Object: Information about this work's APC (article processing charge). The object contains:

value: Integer
currency: String
provenance: String — the source of this data. Currently the only value is “doaj” (DOAJ)
value_usd: Integer — the APC converted into USD

Currently our only source for this data is DOAJ, and so doaj is the only value for apc_list.provenance, but we’ll add other sources over time.

You can use this attribute to find works published in Diamond open access journals by looking at works where apc_list.value is zero. See open_access.oa_status for more info.

apc_payment: {
    value: 3200,
    currency: "USD",
    value_usd: 3200,
    provenance: "doaj"
}

`apc_paid`

Object: Information about the paid APC (article processing charge) for this work. The object contains:

value: Integer
currency: String
provenance: String — currently either openapc or doaj, but more will be added; see below for details.
value_usd: Integer — the APC converted into USD

apc_payment: {
    value: 2250,
    currency: "EUR",
    value_usd: 2426,
    provenance: "openapc"
}

`best_oa_location`

Object: A Location object with the best available open access location for this work.

We score open locations to determine which is best using these factors:

Must have is_oa: true
type_:_ "publisher" is better than "repository".
version: "publishedVersion" is better than "acceptedVersion", which is better than "submittedVersion".
pdf_url: A location with a direct PDF link is better than one without.
repository rankings: Some major repositories like PubMed Central and arXiv are ranked above others.

best_oa_location: {
  is_oa: true,
  landing_page_url: "https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1398957",
  pdf_url: null,
  source: {
    id: "https://openalex.org/S2764455111",
    display_name: "PubMed Central",
    issn_l: null,
    issn: null,
    host_organization: "https://openalex.org/I1299303238",
    type: "repository"
  },
  license: null,
  version: "publishedVersion"
}

`biblio`

volume (String)
issue (String)
first_page (String)
last_page (String)

biblio: {
    volume: "495",
    issue: "7442",
    first_page: "437",
    last_page: "440"
}

`citation_normalized_percentile`

citation_normalized_percentile: {
        value: 0.999948,
        is_in_top_1_percent: true,
        is_in_top_10_percent": true
}

`cited_by_api_url`

String: A URL that uses the cites filter to display a list of works that cite this work. This is a way to expand cited_by_count into an actual list of works.

`cited_by_count`

Integer: The number of citations to this work. These are the times that other works have cited this work: Other works ➞ This work.

cited_by_count: 382

`concepts`

List: List of dehydrated Concept objects.

Each Concept object in the list also has one additional property:

score (Float): The strength of the connection between the work and this concept (higher is stronger). This number is produced by AWS Sagemaker, in the last layer of the machine learning model that assigns concepts.

Concepts with a score of at least 0.3 are assigned to the work. However, ancestors of an assigned concept are also added to the work, even if the ancestor scores are below 0.3.

Because ancestor concepts are assigned to works, you may see concepts in works with very low scores, even some zero scores.

concepts: [
    {
        id: "https://openalex.org/C71924100",
        wikidata: "https://www.wikidata.org/wiki/Q11190",
        display_name: "Medicine",
        level: 0,
        score: 0.9187037
    },
    {
        id: "https://openalex.org/C3007834351",
        wikidata: "https://www.wikidata.org/wiki/Q82069695",
        display_name: "Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2)",
        level: 5,
        score: 0.8070164
    },
    ...
    {
        id: "https://openalex.org/C191935318",
        wikidata: "https://www.wikidata.org/wiki/Q148",
        display_name: "China",
        level: 2,
        score: 0.5948172
    },
    ...
    {
        id: "https://openalex.org/C121608353",
        wikidata: "https://www.wikidata.org/wiki/Q12078",
        display_name: "Cancer",
        level: 2,
        score: 0.46887803
    },
    ...
    {
        id: "https://openalex.org/C17744445",
        wikidata: "https://www.wikidata.org/wiki/Q36442",
        display_name: "Political science",
        level: 0,
        score: 0
    }
]

`corresponding_author_ids`

List: OpenAlex IDs of any authors for which authorships.is_corresponding is true.

corresponding_author_ids: ["https://openalex.org/A5004365451"]

`corresponding_institution_ids`

List: OpenAlex IDs of any institutions found within an authorship for which authorships.is_corresponding is true.

corresponding_institution_ids: ["https://openalex.org/I4210123613"]

`countries_distinct_count`

Integer: Number of distinct country_codes among the authorships for this work.

countries_distinct_count: 4

`counts_by_year`

List: Works.cited_by_count for each of the last ten years, binned by year. To put it another way: each year, you can see how many times this work was cited.

Any citations older than ten years old aren't included. Years with zero citations have been removed so you will need to add those in if you need them.

counts_by_year: [
    {
        year: 2022,
        cited_by_count: 8
    },
    {
        year: 2021,
        cited_by_count: 252
    },
    ...
    {
        year: 2012,
        cited_by_count: 79
    }
]

`created_date`

String: The date this Work object was created in the OpenAlex dataset, expressed as an ISO 8601 date string.

created_date: "2017-08-08"

`display_name`

String: Exactly the same as Work.title. It's useful for Works to include a display_name property, since all the other entities have one.

display_name: "The state of OA: a large-scale analysis of the prevalence and impact of Open Access articles",

`doi`

String: The DOI for the work. This is the Canonical External ID for works.

doi: "https://doi.org/10.7717/peerj.4375"

`fulltext_origin`

String: If a work's full text is searchable in OpenAlex (has_fulltext is true), this tells you how we got the text. This will be one of:

pdf: We used Grobid to get the text from an open-access PDF.
ngrams: Full text search is enabled using N-grams obtained from the Internet Archive.

This attribute is only available for works with has_fulltext:true.

fulltext_origin: "pdf"

`fwci`

fwci: 76.992

`grants`

List: List of grant objects, which include the Funder and the award ID, if available. Our grants data comes from Crossref, and is currently fairly limited.

grants: [
    // grant for which we have the grant details:
    {
        funder: "https://openalex.org/F4320306076",
        funder_display_name: "National Science Foundation",
        award_id: "ABI 1661218",
    },
    // grant for which we do not have the details:
    {
        funder: "https://openalex.org/F4320306084",
        funder_display_name: "U.S. Department of Energy",
        award_id: null,
    },
]

`has_fulltext`

has_fulltext: true

`host_venue` (deprecated)

`id`

String: The OpenAlex ID for this work.

id: "https://openalex.org/W2741809807"

`ids`

Object: All the external identifiers that we know about for this work. IDs are expressed as URIs whenever possible. Possible ID types:

doi (String: The DOI. Same as Work.doi)
mag (Integer: the Microsoft Academic Graph ID)
openalex (String: The OpenAlex ID. Same as Work.id)
pmid (String: The Pubmed Identifier)
pmcid (String: the Pubmed Central identifier)

Most works are missing one or more ID types (either because we don't know the ID, or because it was never assigned). Keys for null IDs are not displayed.

ids: {
    openalex: "https://openalex.org/W2741809807",
    doi: "https://doi.org/10.7717/peerj.4375",
    mag: 2741809807,
    pmid: "https://pubmed.ncbi.nlm.nih.gov/29456894"
}

`indexed_in`

List: The sources this work is indexed in. Possible values: arxiv, crossref, doaj, pubmed.

indexed_in: [
    "arxiv", "crossref", "pubmed"
]

`institutions_distinct_count`

Integer: Number of distinct institutions among the authorships for this work.

institutions_distinct_count: 4

`is_paratext`

Boolean: True if we think this work is paratext.

In our context, paratext is stuff that's in a scholarly venue (like a journal) but is about the venue rather than a scholarly work properly speaking. Some examples and nonexamples:

yep it's paratext: front cover, back cover, table of contents, editorial board listing, issue information, masthead.
no, not paratext: research paper, dataset, letters to the editor, figures

Turns out there is a lot of paratext in registries like Crossref. That's not a bad thing... but we've found that it's good to have a way to filter it out.

We determine is_paratext algorithmically using title heuristics.

is_paratext: false

`is_retracted`

Boolean: True if we know this work has been retracted.

We identify works that have been retracted using the public Retraction Watch database, a public resource made possible by a partnership between Crossref and The Center for Scientific Integrity.

is_retracted: false

`keywords`

List of objects: Short phrases identified based on works' Topics. For background on how Keywords are identified, see the Keywords page at OpenAlex help pages.

The score for each keyword represents the similarity score of that keyword to the title and abstract text of the work.

We provide up to 5 keywords per work, for all keywords with scores above a certain threshold.

[
    {
        id: "https://openalex.org/keywords/global-seaweed-distribution",
        display_name: "Global Seaweed Distribution",
        score: 0.559386
    },
    {
        id: "https://openalex.org/keywords/climate-change-impacts",
        display_name: "Climate Change Impacts",
        score: 0.535795
    },
    {
        id: "https://openalex.org/keywords/ecosystem-resilience",
        display_name: "Ecosystem Resilience",
        score: 0.502789
    }
]

`language`

A few things to keep in mind about this:

We don't always assign a language if we do not have enough words available to accurately guess.
We report the language of the metadata, not the full text. For example, if a work is in French, but the title and abstract are in English, we report the language as English.
In some cases, abstracts are in two different languages. Unfortunately, when this happens, what we report will not be accurate.

language: "en"

`license`

license: "cc-by"

`locations`

List: A list of Location objects describing all unique places where this work lives.

locations: [ 
  {
    is_oa: true,
    landing_page_url: "https://doi.org/10.1073/pnas.17.6.401",
    pdf_url: "http://www.pnas.org/content/17/6/401.full.pdf",
    source: {
      id: "https://openalex.org/S125754415",
      display_name: "Proceedings of the National Academy of Sciences of the United States of America",
      issn_l: "0027-8424",
      issn: ["1091-6490", "0027-8424"],
      host_organization: "https://openalex.org/P4310320052",
      type: "journal"
    },
    license: null,
    version: "publishedVersion"
  },
  {
    is_oa: true,
    landing_page_url: "https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1076072",
    pdf_url: null,
    source: {
      id: "https://openalex.org/S2764455111",
      display_name: "PubMed Central",
      issn_l: null,
      issn: null,
      host_organization: "https://openalex.org/I1299303238",
      type: "repository"
    },
    license: null,
    version: "publishedVersion"
  }
]

`locations_count`

Integer: Number of locations for this work.

locations_count: 3

`mesh`

List: List of MeSH tag objects. Only works found in PubMed have MeSH tags; for all other works, this is an empty list.

mesh: [
    {
        descriptor_ui: "D017712",
        descriptor_name: "Peer Review, Research",
        qualifier_ui: "Q000379",
        qualifier_name: "methods",
        is_major_topic: false
    },
    {
        descriptor_ui: "D017712",
        descriptor_name: "Peer Review, Research",
        qualifier_ui: "Q000592",
        qualifier_name: "standards",
        is_major_topic: true
    }
]

`open_access`

Object: Information about the access status of this work, as an OpenAccess object.

open_access: {
    is_oa: true,
    oa_status: "gold",
    oa_url: "https://peerj.com/articles/4375.pdf",
    any_repository_has_fulltext: true
},

`primary_location`

Object: A Location object with the primary location of this work.

primary_location: {
  is_oa: true,
  landing_page_url: "https://doi.org/10.1073/pnas.17.6.401",
  pdf_url: "http://www.pnas.org/content/17/6/401.full.pdf",
  source: {
    id: "https://openalex.org/S125754415",
    display_name: "Proceedings of the National Academy of Sciences of the United States of America",
    issn_l: "0027-8424",
    issn: ["1091-6490", "0027-8424"],
    host_organization: "https://openalex.org/P4310320052",
    type: "journal"
  },
  license: null,
  version: "publishedVersion"
}

`primary_topic`

Object

The top ranked Topic for this work. This is the same as the first item in Work.topics.

primary_topic: {
    id: "https://openalex.org/T12419",
    display_name: "Analysis of Cardiac and Respiratory Sounds",
    score: 	0.9997,
    subfield: {
        id: 2740,
        display_name: "Pulmonary and Respiratory Medicine"
    }
    field: {
        id: 27,
        display_name: "Medicine"
    }
    domain: {
        id: 4,
        display_name: "Health Sciences"
    }
}

`publication_date`

String: The day when this work was published, formatted as an ISO 8601 date.

Where different publication dates exist, we usually select the earliest available date of electronic publication.

This date applies to the version found at Work.url. The other versions, found in Work.locations, may have been published at different (earlier) dates.

publication_date: "2018-02-13"

`publication_year`

Integer: The year this work was published.

This year applies to the version found at Work.url. The other versions, found in Work.locations, may have been published in different (earlier) years.

publication_year: 2018

`referenced_works`

List: OpenAlex IDs for works that this work cites. These are citations that go from this work out to another work: This work ➞ Other works.

referenced_works: [
    "https://openalex.org/W2753353163",
    "https://openalex.org/W2785823074",
    "https://openalex.org/W2511661767",
    "https://openalex.org/W2115339903",
    "https://openalex.org/W2031754690"
]

`related_works`

List: OpenAlex IDs for works related to this work. Related works are computed algorithmically; the algorithm finds recent papers with the most concepts in common with the current paper.

related_works: [
    "https://openalex.org/W2753353163",
    "https://openalex.org/W2785823074",
    "https://openalex.org/W2511661767",
    "https://openalex.org/W2115339903",
    "https://openalex.org/W2031754690",
]

`sustainable_development_goals`

List: List of objects

We display all of the SDGs with a prediction score higher than 0.4.

sustainable_development_goals: [
    {
        id: "https://metadata.un.org/sdg/3",
        display_name: "Good health and well-being",
        score: 	0.95
    }
]

`topics`

List: List of objects

The top ranked Topics for this work. We provide up to 3 topics per work.

topics: [
    {
        id: "https://openalex.org/T12419",
        display_name: "Analysis of Cardiac and Respiratory Sounds",
        score: 	0.9997,
        subfield: {
            id: 2740,
            display_name: "Pulmonary and Respiratory Medicine"
        }
        field: {
            id: 27,
            display_name: "Medicine"
        }
        domain: {
            id: 4,
            display_name: "Health Sciences"
        }
    }
    ...
]

`title`

String: The title of this work.

title: "The state of OA: a large-scale analysis of the prevalence and impact of Open Access articles",

`type`

String: The type of the work.

You can see all of the different types along with their counts in the OpenAlex API here: https://api.openalex.org/works?group_by=type.

Journal articles will have a primary_location.source.type of journal
Conference proceedings will have a primary_location.source.type of conference
Preprints or "posted content" will have a primary_location.version of submittedVersion

Works that are hosted primarily on a preprint, or that are identified speicifically as preprints in the metadata we receive, are assigned the type preprint rather than article.

Other work types follow the Crossref "type" controlled vocabulary—see type_crossref.

type: "article"

`type_crossref`

String: Legacy type information, using Crossref's "type" controlled vocabulary.

These are the work types that we used to use, before switching to our current system (see type).

You can see all possible values of Crossref's "type" controlled vocabulary via the Crossref api here: https://api.crossref.org/types.

Where possible, we just pass along Crossref's type value for each work. When that's impossible (eg the work isn't in Crossref), we do our best to figure out the type ourselves.

type_crossref: "journal-article"

`updated_date`

String: The last time anything in this Work object changed, expressed as an ISO 8601 date string (in UTC). This date is updated for any change at all, including increases in various counts.

updated_date: "2022-01-02T00:22:35.180390"

The `OpenAccess` object

The OpenAccess object describes access options for a given work. It's only found as part of the Work object.

`any_repository_has_fulltext`

Boolean: True if any of this work's locations has location.is_oa=true and location.source.type=repository.

So there's a lot of repository-hosted content that the oa_status can't tell you about. Our State of OA paper calls this "shadowed Green." This feature makes it possible to track shadowed Green.

any_repository_has_fulltext: true

`is_oa`

Boolean: True if this work is Open Access (OA).

is_oa: true

`oa_status`

String: The Open Access (OA) status of this work. Possible values are:

diamond: Published in a fully OA journal—one that is indexed by the DOAJ or that we have determined to be OA—with no article processing charges (i.e., free for both readers and authors).
gold: Published in a fully OA journal.
green: Toll-access on the publisher landing page, but there is a free copy in an OA repository.
hybrid: Free under an open license in a toll-access journal.
bronze: Free to read on the publisher landing page, but without any identifiable license.
closed: All other articles.

oa_status: "gold"

`oa_url`

String: The best Open Access (OA) URL for this work.

This URL might be a direct link to a PDF, or it might be to a landing page that links to the free PDF

oa_url: "https://peerj.com/articles/4375.pdf"

Institution object

These are the fields in an institution object. When you use the API to get a single institution or lists of institutions, this is what's returned.

`associated_institutions`

List: Institutions related to this one. Each associated institution is represented as a dehydrated Institution object, with one extra property:

relationship (String): The type of relationship between this institution and the listed institution. Possible values: parent, child, and related.

Institution associations and the relationship vocabulary come from ROR's relationships.

associated_institutions: [
    {
        id: "https://openalex.org/I2802101240",
        ror: "https://ror.org/0483mr804",
        display_name: "Carolinas Medical Center",
        country_code: "US",
        type: "healthcare",
        relationship: "related"
    },
    {
        id: "https://openalex.org/I69048370",
        ror: "https://ror.org/01s91ey96",
        display_name: "Renaissance Computing Institute",
        country_code: "US",
        type: "education",
        relationship: "related"
    },
    
    // and so forth
]

`cited_by_count`

Integer: The total number Works that cite a work created by an author affiliated with this institution. Or less formally: the number of citations this institution has collected.

cited_by_count: 21199844

`country_code`

String: The country where this institution is located, represented as an ISO two-letter country code.

country_code: "US"

`counts_by_year`

List: works_count and cited_by_count for each of the last ten years, binned by year. To put it another way: each year, you can see how many new works this institution put out, and how many times any work affiliated with this institution got cited.

Years with zero citations and zero works have been removed so you will need to add those in if you need them.

counts_by_year: [
    {
        year: 2022,
        works_count: 133,
        cited_by_count: 32731
    },
    {
        year: 2021,
        works_count: 12565,
        cited_by_count: 2180827
    },
    
    // and so forth
]

`created_date`

String: The date this Institution object was created in the OpenAlex dataset, expressed as an ISO 8601 date string.

created_date: "2017-08-08"

`display_name`

String: The primary name of the institution.

display_name: "University of North Carolina at Chapel Hill"

`display_name_acronyms`

List: Acronyms or initialisms that people sometimes use instead of the full display_name.

display_name_acronyms:["UNC"]

`display_name_alternatives`

List: Other names people may use for this institution.

display_name_alternatives: [
    "UNC-Chapel Hill"
]

`geo`

Object: A bunch of stuff we know about the location of this institution:

city (String): The city where this institution lives.
geonames_city_id (String): The city where this institution lives, as a GeoNames database ID.
region (String): The sub-national region (state, province) where this institution lives.
country_code (String): The country where this institution lives, represented as an ISO two-letter country code.
country (String): The country where this institution lives.
latitude (Float): Does what it says.
longitude (Float): Does what it says.

geo: {
    city: "Chapel Hill",
    geonames_city_id: "4460162",
    region: "North Carolina",
    country_code: "US",
    country: "United States",
    latitude: 35.9083,
    longitude: -79.0492
}

`homepage_url`

String: The URL for institution's primary homepage.

homepage_url: "http://www.unc.edu/"

`id`

String: The OpenAlex ID for this institution.

id: "https://openalex.org/I114027177"

`ids`

Object: All the external identifiers that we know about for this institution. IDs are expressed as URIs whenever possible. Possible ID types:

grid (String: this institution's GRID ID)
mag (Integer: this institution's Microsoft Academic Graph ID)
openalex (String: this institution's OpenAlex ID. Same as Institution.id)
ror (String: this institution's ROR ID. Same as Institution.ror)
wikipedia (String: this institution's Wikipedia page URL)
wikidata (String: this institution's Wikidata ID)

Many institution are missing one or more ID types (either because we don't know the ID, or because it was never assigned). Keys for null IDs are not displayed.

ids: {
    openalex: "https://openalex.org/I114027177",
    ror: "https://ror.org/0130frc33",
    grid: "grid.10698.36",
    wikipedia: "https://en.wikipedia.org/wiki/University%20of%20North%20Carolina%20at%20Chapel%20Hill",
    wikidata: "https://www.wikidata.org/wiki/Q192334",
    mag: 114027177
}

`image_thumbnail_url`

String: Same as image_url, but it's a smaller image.

image_thumbnail_url: "https://upload.wikimedia.org/wikipedia/en/thumb/5/5c/University_of_North_Carolina_at_Chapel_Hill_seal.svg/100px-University_of_North_Carolina_at_Chapel_Hill_seal.svg.png"

`is_super_system`

Boolean: True if this institution is a "super system". This includes large university systems such as the University of California System (https://openalex.org/I2803209242), as well as some governments and multinational companies.

We have this special flag for these institutions so that we can exclude them from other institutions' lineage, which we do because these super systems are not generally relevant in group-by results when you're looking at ranked lists of institutions.

The list of institution IDs marked as super systems can be found in this file.

`image_url`

String: URL where you can get an image representing this institution. Usually this is hosted on Wikipedia, and usually it's a seal or logo.

image_url: "https://upload.wikimedia.org/wikipedia/en/5/5c/University_of_North_Carolina_at_Chapel_Hill_seal.svg"

`international`

Object: The institution's display name in different languages. Derived from the wikipedia page for the institution in the given language.

display_name (Object)
- key (String): language code in wikidata language code format. Full list of languages is here.
- value (String): display_name in the given language

international: {
    display_name: {
        "ar": "جامعة نورث كارولينا في تشابل هيل",
        "en": "University of North Carolina at Chapel Hill",
        "es": "Universidad de Carolina del Norte en Chapel Hill",
        "zh-cn": "北卡罗来纳大学教堂山分校",
        ...
    }
}

`lineage`

List: OpenAlex IDs of institutions. The list will include this institution's ID, as well as any parent institutions. If this institution has no parent institutions, this list will only contain its own ID.

This information comes from ROR's relationships, specifically the Parent/Child relationships.

Super systems are excluded from the lineage. See is_super_system above.

id: "https://openalex.org/I170203145",
...
lineage: [
    "https://openalex.org/I170203145",
    "https://openalex.org/I90344618"
]

`repositories`

List: Repositories (Sources with type: repository) that have this institution as their host_organization

repositories: [
    {
        id: "https://openalex.org/S4306402521",
        display_name: "University of Minnesota Digital Conservancy (University of Minnesota)",
        host_organization: "https://openalex.org/I130238516",
        host_organization_name: "University of Minnesota",
        host_organization_lineage: ["https://openalex.org/I130238516"]
    }
    // and so forth
]

`roles`

List: List of role objects, which include the role (one of institution, funder, or publisher), the id (OpenAlex ID), and the works_count.

In many cases, a single organization does not fit neatly into one role. For example, Yale University is a single organization that is a research university, funds research studies, and publishes an academic journal. The roles property links the OpenAlex entities together for a single organization, and includes counts for the works associated with each role.

The roles list of an entity (Funder, Publisher, or Institution) always includes itself. In the case where an organization only has one role, the roles will be a list of length one, with itself as the only item.

roles: [
    {
        role: "funder",
        id: "https://openalex.org/F4320308380",
        works_count: 1004,
    },
    {
        role: "publisher",
        id: "https://openalex.org/P4310315589",
        works_count: 13986,
    },
    {
        role: "institution",
        id: "https://openalex.org/I32971472",
        works_count: 250031,
    }
]

`ror`

String: The ROR ID for this institution. This is the Canonical External ID for institutions.

The ROR (Research Organization Registry) identifier is a globally unique ID for research organization. ROR is the successor to GRiD, which is no longer being updated.

ror: "https://ror.org/0130frc33"

`summary_stats`

Object: Citation metrics for this institution

2yr_mean_citedness Float: The 2-year mean citedness for this source. Also known as impact factor. We use the year prior to the current year for the citations (the numerator) and the two years prior to that for the citation-receiving publications (the denominator).
h_index Integer: The h-index for this institution.
i10_index Integer: The i-10 index for this institution.

While the h-index and the i-10 index are normally author-level metrics and the 2-year mean citedness is normally a journal-level metric, they can be calculated for any set of papers, so we include them for institutions.

summary_stats: {
    2yr_mean_citedness: 5.065784263815827,
    h_index: 985,
    i10_index: 176682
}

`type`

String: The institution's primary type, using the ROR "type" controlled vocabulary.

Possible values are: Education, Healthcare, Company, Archive, Nonprofit, Government, Facility, and Other.

type: "education"

`updated_date`

String: The last time anything in this Institution changed, expressed as an ISO 8601 date string. This date is updated for any change at all, including increases in various counts.

updated_date: "2022-01-02T00:27:23.088909"

`works_api_url`

String: A URL that will get you a list of all the Works affiliated with this institution.

We express this as an API URL (instead of just listing the Works themselves) because most institutions have way too many works to reasonably fit into a single return object.

works_api_url: "https://api.openalex.org/works?filter=institutions.id:I114027177"

`works_count`

Integer: The number of Works created by authors affiliated with this institution. Or less formally: the number of works coming out of this institution.

works_count: 202704

`x_concepts`

x_concepts will be deprecated and removed soon. We will be replacing this functionality with Topics instead.

List: The Concepts most frequently applied to works affiliated with this institution. Each is represented as a dehydrated Concept object, with one additional attribute:

score (Float): The strength of association between this institution and the listed concept, from 0-100.

x_concepts: [
    {
        id: "https://openalex.org/C86803240",
        wikidata: null,
        display_name: "Biology",
        level: 0,
        score: 86.7
    },
    {
        id: "https://openalex.org/C185592680",
        wikidata: null,
        display_name: "Chemistry",
        level: 0,
        score: 51.4
    },
    
    // and so forth
]

The `DehydratedInstitution` object

The DehydratedInstitution is a stripped-down Institution object, with most of its properties removed to save weight. Its only remaining properties are:

country_code
display_name
id
lineage
ror
type

Source object

These are the fields in a source object. When you use the API to get a single source or lists of sources, this is what's returned.

abbreviated_title

String: An abbreviated title obtained from the ISSN Centre.

abbreviated_title: "J. addict. med. ther. sci."

alternate_titles

Array: Alternate titles for this source, as obtained from the ISSN Centre and individual work records, like Crossref DOIs, that carry the source name as a string. These are commonly abbreviations or translations of the source's canonical name.

alternate_titles: [
   "ACRJ"
]

apc_prices

List: List of objects, each with price (Integer) and currency (String).

Article processing charge information, taken directly from DOAJ.

apc_prices: [
    {
        price: 3920,
        currency: "GBP"
    }
]

apc_usd

Integer: The source's article processing charge in US Dollars, if available from DOAJ.

The apc_usd value is calculated by taking the APC price (see apc_prices) with a currency of USD if it is available. If it's not available, we convert the first available value from apc_prices into USD, using recent exchange rates.

apc_usd: 5200

`cited_by_count`

Integer: The total number of Works that cite a Work hosted in this source.

cited_by_count: 133702

`country_code`

String: The country that this source is associated with, represented as an ISO two-letter country code.

country_code: "GB"

`counts_by_year`

List: works_count and cited_by_count for each of the last ten years, binned by year. To put it another way: each year, you can see how many new works this source started hosting, and how many times any work in this source got cited.

If the source was founded less than ten years ago, there will naturally be fewer than ten years in this list. Years with zero citations and zero works have been removed so you will need to add those in if you need them.

counts_by_year: [
    {
        year: 2021,
        works_count: 4338,
        cited_by_count: 127268
    },
    {
        year: 2020,
        works_count: 4363,
        cited_by_count: 119531
    },
    
    // and so forth
]

`created_date`

String: The date this Source object was created in the OpenAlex dataset, expressed as an ISO 8601 date string.

created_date: "2017-08-08"

`display_name`

String: The name of the source.

display_name: "PeerJ"

`homepage_url`

String: The starting page for navigating the contents of this source; the homepage for this source's website.

homepage_url: "http://www.peerj.com/"

`host_organization`

String: The host organization for this source as an OpenAlex ID. This will be an Institution.id if the source is a repository, and a Publisher.id if the source is a journal, conference, or eBook platform (based on the type field).

id: "https://openalex.org/P4310320595"

`host_organization_lineage`

List: OpenAlex IDs — See Publisher.lineage. This will only be included if the host_organization is a publisher (and not if the host_organization is an institution).

host_organization_lineage: [
    "https://openalex.org/P4310321285",
    "https://openalex.org/P4310319900",
    "https://openalex.org/P4310319965"
]

`host_organization_name`

String: The display_name from the host_organization, shown for convenience.

host_organization_name: "Elsevier BV"

`id`

String: The OpenAlex ID for this source.

id: "https://openalex.org/S1983995261"

`ids`

Object: All the external identifiers that we know about for this source. IDs are expressed as URIs whenever possible. Possible ID types:

fatcat (String: this source's Fatcat ID)
issn (List: a list of this source's ISSNs. Same as Source.issn)
issn_l (String: this source's ISSN-L. Same as Source.issn_l)
mag (Integer: this source's Microsoft Academic Graph ID)
openalex (String: this source's OpenAlex ID. Same as Source.id)
wikidata (String: this source's Wikidata ID)

Many sources are missing one or more ID types (either because we don't know the ID, or because it was never assigned). Keys for null IDs are not displayed.

Example

ids: {
    openalex: "https://openalex.org/S1983995261",
    issn_l: "2167-8359",
    issn: [
        "2167-8359"
    ],
    mag: 1983995261,
    fatcat: "https://fatcat.wiki/container/z3ijzhu7zzey3f7jwws7rzopoq",
    wikidata: "https://www.wikidata.org/entity/Q96326029"
}

`is_core`

Boolean: Whether this source is identified as a "core source" by CWTS, used in the Open Leiden Ranking of universities around the world. The list of core sources can be found here.

is_core: true

`is_in_doaj`

Boolean: Whether this is a journal listed in the Directory of Open Access Journals (DOAJ).

is_in_doaj: true

`is_oa`

Boolean: Whether this is currently fully-open-access source. This could be true for a preprint repository where everything uploaded is free to read, or for a Gold or Diamond open access journal, where all newly published Works are available for free under an open license.

We say "currently" because the status of a source can change over time. It's common for journals to "flip" to Gold OA, after which they may make only future articles open or also open their back catalogs. It's entirely possible for a source to say is_oa: true, but for an article from last year to require a subscription.

is_oa: true

`issn`

List: The ISSNs used by this source. Many publications have multiple ISSNs , so ISSN-L should be used when possible.

issn: ["2167-8359"]

`issn_l`

String: The ISSN-L identifying this source. This is the Canonical External ID for sources.

ISSN is a global and unique ID for serial publications. However, different media versions of a given publication (e.g., print and electronic) often have different ISSNs. This is why we can't have nice things. The ISSN-L or Linking ISSN solves the problem by designating a single canonical ISSN for all media versions of the title. It's usually the same as the print ISSN.

issn_l: "2167-8359"

societies

Array: Societies on whose behalf the source is published and maintained, obtained from our crowdsourced list. Thanks!

societies: [
    {
        "url": "http://www.counseling.org/",
        "organization": "American Counseling Association on behalf of the American College Counseling Association"
    }
]

`summary_stats`

Object: Citation metrics for this source

2yr_mean_citedness Float: The 2-year mean citedness for this source. Also known as impact factor. We use the year prior to the current year for the citations (the numerator) and the two years prior to that for the citation-receiving publications (the denominator).
h_index Integer: The h-index for this source.
i10_index Integer: The i-10 index for this source.

While the h-index and the i-10 index are normally author-level metrics, they can be calculated for any set of papers, so we include them for sources.

summary_stats: {
    2yr_mean_citedness: 1.5295340589458237,
    h_index: 105,
    i10_index: 5045
}

`type`

String: The type of source, which will be one of: journal, repository, conference, ebook platform, book series, metadata, or other.

type: "journal"

`updated_date`

String: The last time anything in this Source object changed, expressed as an ISO 8601 date string. This date is updated for any change at all, including increases in various counts.

updated_date: "2022-01-02T00:00:00"

`works_api_url`

String: A URL that will get you a list of all this source's Works.

We express this as an API URL (instead of just listing the works themselves) because sometimes a source's publication list is too long to reasonably fit into a single Source object.

works_api_url: "https://api.openalex.org/works?filter=primary_location.source.id:S1983995261",

`works_count`

Integer: The number of Works this source hosts.

works_count: 20184

`x_concepts`

x_concepts will be deprecated and removed soon. We will be replacing this functionality with Topics instead.

List: The Concepts most frequently applied to works hosted by this source. Each is represented as a dehydrated Concept object, with one additional attribute:

score (Float): The strength of association between this source and the listed concept, from 0-100.

x_concepts: [
    {
        id: "https://openalex.org/C86803240",
        wikidata: null,
        display_name: "Biology",
        level: 0,
        score: 86.7
    },
    {
        id: "https://openalex.org/C185592680",
        wikidata: null,
        display_name: "Chemistry",
        level: 0,
        score: 51.4
    },
    
    // and so forth
]

The `DehydratedSource` object

The DehydratedSource is stripped-down Source object, with most of its properties removed to save weight. Its only remaining properties are:

display_name
host_organization
host_organization_lineage
host_organization_name
id
is_core
is_in_doaj
is_oa
issn
issn_l
type

OpenAlex technical documentation

Overview

Data

Access

Why OpenAlex?

Contact

Citation

Quickstart tutorial

1. Find the institution

2. Find articles (works) associated with Stanford University

3. Filter works by publication year

4. Group works by publication year to show counts by year

What's next?

API Entities

Entities overview

Works

What's next

Work object

abstract_inverted_index

Abstract inverted index coverage

alternate_host_venues (deprecated)

authorships

apc_list

apc_paid

best_oa_location

biblio

citation_normalized_percentile

cited_by_api_url

cited_by_count

concepts

corresponding_author_ids

corresponding_institution_ids

countries_distinct_count

counts_by_year

created_date

display_name

doi

fulltext_origin

fwci

grants

has_fulltext

host_venue (deprecated)

id

ids

indexed_in

institutions_distinct_count

is_paratext

is_retracted

keywords

language

license

locations

locations_count

mesh

open_access

primary_location

primary_topic

publication_date

publication_year

referenced_works

related_works

sustainable_development_goals

topics

title

type

type_crossref

updated_date

The OpenAccess object

any_repository_has_fulltext

is_oa

oa_status

oa_url

Authorship object

affiliations

author

author_position

countries

institutions

is_corresponding

raw_affiliation_strings

`abstract_inverted_index`

`alternate_host_venues` (deprecated)

`authorships`

`apc_list`

`apc_paid`

`best_oa_location`

`biblio`

`citation_normalized_percentile`

`cited_by_api_url`

`cited_by_count`

`concepts`

`corresponding_author_ids`

`corresponding_institution_ids`

`countries_distinct_count`

`counts_by_year`

`created_date`

`display_name`

`doi`

`fulltext_origin`

`fwci`

`grants`

`has_fulltext`

`host_venue` (deprecated)

`id`

`ids`

`indexed_in`

`institutions_distinct_count`

`is_paratext`

`is_retracted`

`keywords`

`language`

`license`

`locations`

`locations_count`

`mesh`

`open_access`

`primary_location`

`primary_topic`

`publication_date`

`publication_year`

`referenced_works`

`related_works`

`sustainable_development_goals`

`topics`

`title`

`type`

`type_crossref`

`updated_date`

The `OpenAccess` object

`any_repository_has_fulltext`

`is_oa`

`oa_status`

`oa_url`

`affiliations`

`author`

`author_position`

`countries`

`institutions`

`is_corresponding`

`raw_affiliation_strings`

`raw_author_name`

`is_accepted`

`is_oa`

`is_published`

`/works` attribute filters

`/works` convenience filters

`abstract.search`

`authors_count`

`authorships.institutions.continent` (alias: `institutions.continent`)

`authorships.institutions.is_global_south` (alias: `institutions.is_global_south`)

`best_open_version`

`cited_by`

`cites`

`concepts_count`

`default.search`

`display_name.search` (alias: `title.search`)

`from_created_date`

`from_publication_date`

`from_updated_date`

`fulltext.search`