Yes! The work associated with ID W1234 will keep the ID W1234.
At some point we might merge duplicated works, authors, etc that already have assigned IDs. At that point we will expand the schema to include synonym ID data.
Do you disambiguate authors?
Yes. Using coauthors, references, and other features of the data, we can tell that the same Jane Smith wrote both "Frog behavior" and "Frogs: A retrospective," but it's a different Jane Smith who wrote "Oats before boats: The breakfast customs of 17th-Century Dutch bargemen."
Do you gather author affiliations?
Yes. We automatically gather and normalize author affiliations from both structured and unstructured sources.
We automatically index new journals and articles so there is nothing you need to do. We primarily retrieve new records from Crossref. So if you are not seeing your journal or article, you may want to check if it is available there. We are adding more DOI registrars soon!
How often is the data updated?
For now, the database snapshot is updated about once per month. However, in the future we will probably offer a much faster update cadence (approximately daily) as an optional, paid upgrade. If you're interested in that, drop us a line at [email protected].
Is your data quality better than ____?
Our dataset is still very young, so there's not a lot of systematic research comparing OpenAlex to peer databases like MAG, Scopus, Dimensions, etc. We're currently working on publishing some research like that ourselves. Our initial finding are very encouraging...we believe OpenAlex is already comparable in coverage and accuracy to the more established players--but OpenAlex is 100% open data, built on 100% open-source code. We think that's a really important feature. We will also continue improving the data quality in the days, weeks, months, and years ahead!
Our Unpaywall project (a free index of the world's open-access research literature) has been self-sustaining via a freemium revenue model for nearly five years, and we'll certainly be looking closely at that as a model for OpenAlex, as well. Access to the data will always be free for everyone, but some kind of high-throughput, SLA access is potentially something we could add later for a fee.
We're currently focused on getting OpenAlex launched and all the bugs worked out, though, and we're not ready to commit to any one sustainability path just yet. Stay tuned :)