Find similar works
Find semantically similar works using vector search
OpenAlex's standard search uses keyword matchingβit finds works containing the words you type. But sometimes you want to find works that are about the same thing, even if they use different terminology.
That's what /find/works does. It uses AI embeddings to find semantically similar works based on meaning, not just keywords. Search for "machine learning applications in drug discovery" and you'll find relevant papers even if they say "AI-driven pharmaceutical research" or "computational approaches to medicine."
Semantic search requires an API key and costs 1,000 credits per query. See rate limits for details.
How it works
When you submit a query:
We convert your text into a 1024-dimensional vector using an embedding model
We search our index of ~217 million work embeddings for the most similar vectors
We return the matching works, ranked by similarity score
The embedding model (GTE-Large) captures semantic meaning, so conceptually related works cluster together in vector space even when they use different words.
Basic usage
GET request
https://api.openalex.org/find/works?query=machine%20learning%20for%20drug%20discovery&api_key=YOUR_KEYPOST request
For longer queries (up to 10,000 characters), use POST:
Parameters
query
The text to find similar works for (required, max 10,000 chars)
β
count
Number of results to return (1-100)
25
filter
Metadata filters (see below)
β
Filters
You can narrow results using these filters:
publication_year
2023, >2020, 2020-2023
Filter by year
is_oa
true, false
Open access only
has_abstract
true, false
Has abstract
has_content.pdf
true, false
Has downloadable PDF
has_content.grobid_xml
true, false
Has parsed XML
GET with filters:
POST with filters:
Response format
Each result includes:
score: Similarity score (0-1, higher is more similar)work: The full work object, same as you'd get from/works/{id}
Example: Literature review assistant
Say you're starting a literature review on CRISPR applications in agriculture. You could use keyword search, but you might miss papers that use terms like "genome editing," "gene modification," or "crop improvement" without mentioning "CRISPR" explicitly.
Step 1: Find semantically related works
Step 2: Explore the results
You'll find works about plant genome modification, agricultural biotechnology, and crop scienceβeven papers that never mention "CRISPR" directly but are highly relevant to your research.
Limitations
English-focused: The embedding model is optimized for English text. Non-English works are indexed, but similarity matching will be less accurate.
Abstracts required: Only works with abstracts are indexed (~217 million works). Works without abstracts won't appear in results.
No datasets: Dataset entities are not currently included in the vector index.
When to use semantic search vs keyword search
Use semantic search (/find/works) when...
Use keyword search (/works?search=) when...
You want conceptually related works
You need exact term matches
You're exploring a new research area
You know the specific terminology
Your query is a sentence or paragraph
Your query is a few keywords
You want to find works using different terminology
You want to filter by many metadata fields
Credit costs
Semantic search query
1,000
Last updated