Metadata-Version: 2.1
Name: alexandria3k
Version: 2.6.0
Summary: Efficient relational database queries over the entire Crossref abnd ORCID data sets
Project-URL: Homepage, https://github.com/dspinellis/alexandria3k
Project-URL: Bug Tracker, https://github.com/dspinellis/alexandria3k/issues
Author-email: Diomidis Spinellis <dds@aueb.gr>
License-File: LICENSE
Classifier: Development Status :: 4 - Beta
Classifier: Environment :: Console
Classifier: Intended Audience :: Science/Research
Classifier: License :: OSI Approved :: GNU General Public License v3 or later (GPLv3+)
Classifier: Operating System :: OS Independent
Classifier: Programming Language :: Python :: 3
Classifier: Topic :: Database
Classifier: Topic :: Scientific/Engineering :: Information Analysis
Classifier: Topic :: Software Development :: Libraries :: Python Modules
Requires-Python: >=3.7
Requires-Dist: apsw
Requires-Dist: importlib-metadata
Requires-Dist: pyahocorasick
Description-Content-Type: text/markdown

## Alexandria3k

The _alexandria3k_ package supplies a library and a command-line tool
providing efficient relational query access to diverse publication open
data sets.
The most important one is the entire
[Crossref data set](https://www.nature.com/articles/d41586-022-02926-y)
(157 GB compressed, 1 TB uncompressed).
This contains publication metadata from about 134 million publications from
all major international publishers with full citation data for 60 million
of them.
In addition,
the Crossref data set can be linked with
the [ORCID summary data set](https://support.orcid.org/hc/en-us/articles/360006897394-How-do-I-get-the-public-data-file-)
  (25 GB compressed, 435 GB uuncompressed),
  containing about 78 million author records, as well as
data sets of
funder bodies,
journal names,
open access journals,
and research organizations.

The _alexandria3k_ package installation contains all elements required
to run it.
It does not require the installation, configuration, and maintenance
of a third party relational or graph database.
It can therefore be used out-of-the-box for performing reproducible
publication research on the desktop.



## Pre-print and citation

Details about the rationale, design, implementation, and use of this software
can be found in the following paper.

Diomidis Spinellis. Open Reproducible Systematic Publication Research. arXiv:2301.13312, January 2023. https://doi.org/10.48550/arXiv.2301.13312

## Reference and use documentation

The complete documentation for *alexandria3k*  can be found [here](https://dspinellis.github.io/alexandria3k/).
