BP/SD Consortium Software and Tools
The Born Physical/Studied Digital (BP/SD) Consortium develops and maintains a number of software packages to facilitate the analysis of data from the BP/SD project. We also host archive-specific datasets. The following is a list of the main software packages and datasets, along with links to their documentation and source code.
Arkiverse
This is the main software framework of the BP/SD project. It is a direct descendant of Tiramisu, but it uses a production-level DAG system with pipeline scheduler (Dagster) and fully Pythonic objects to represent archival artifacts.
- GitHub repository: Currently set to private, contact us to get added to the GitHub Organization
- Documentation
Tiramisu
Tiramisu is a Docker-based platform to handle content relationships for an end-to-end archival-artifacts pipeline. It is intended to be used on standalone computing units to serve, access, and preserve documents, files, and their derivative artifacts. Tiramisu allows for distributed task management, frontend accessibility, metadata viewing, and graph-based database management. Most importantly, Tiramisu is completely open source: It integrates a task manager (Celery) with a graph database (neo4j) tailored for the storage and processing of archival documents.
- GitHub repository: Currently set to private, contact us to get added to the GitHub Organization
SCALES-related software, models, and datasets
Publications:
- How to build a more open justice system
- PRESIDE: A Judge Entity Recognition and Disambiguation Model for US District Court Records
- The Promise of AI in an Open Justice System
- A user-centered approach to developing an AI system analyzing U.S. federal court data
- The SCALES Project: Making Federal Court Records Free
SKATE
SKATE (Seismogram Kit for Automatic Trace Extraction) is a web-based software tool for the digitization of seismic traces in historic analog seismograms.
- Frontend tool
- Frontend tool user manual
- GitHub repository: Conversion of SKATE from Python 2 to Python 3
Publications: