Releases: adsabs/ingest_data_model
Supports multiple titles/abstracts in different languages
- Updates
abstractin Document.json to be an array of objects each tagged with the abstract text and the stated language of the abstract, for cases where the input data have both native-language and translated versions of the abstract - Updates
titlein Document.json to be an array (as withabstract) - Renames the directory containing the schema
adsingestschema->json_schema - Expanded test cases, including failure cases
Updates for SciX compatibility, multilingual capability
- Adds UUID for honeycomb
- Absolute
$idin Document.json pointing to this release (v1.0.13) - Minor schema changes for both fulltext and multilingual compatibility
- Adds
doc_type(when set at parse time, for data set, proposal, etc)
Removes fulltext field; adds publisher-specific parser names to loadFormat list
- fulltext field removed as we transition to a separate fulltext_data_model
- adds publisher-specific formats as additional loadFormat options
Updates affils to allow multiple institutional identifiers per affil
Versions <=v1.0.10 assumed that each affiliation object could have one name (string), one identifier type, and one identifier id. Version 1.0.11 subclasses the affiliation identifier, and each affil object will now have an array of affiliation identifers.
The practical application of this is that if the publisher includes multiple external identifiers for a given author affiliation -- for example an ISNI, ROR, and/or GRID id -- we will be able to store all within the affiliation.
Adds storage for title/subtitle footnotes and other xrefs
- Created a new reference, Title.json, containing the structure of Document.json['title'] with an additional list of strings for notes
- Document "title" and "subtitle" are now both $ref to Title.json
Fielded funding information, capable of holding multiple sources
Merge pull request #19 from seasidesparrow/funding_obj.20230725 fix: update Document.funding to be array of Funder objects
Fix: change key names in Contrib.json
This release modifies key names to replace hyphens with underscores, for compatibility with serialization tools.
Fix: Allowed doctypes now match Solr data types
Updated doctypes to include the entire list of doctypes allowed by Solr. The "data" doctype has been renamed to "dataset".
Modifications to book / book-series handling
- Added space for e-ISBN
- Additional space for book-in-series information
Maintenance release: formatting typo
#10 Fixed formatting typo in PubDates schema