zotero-meta-analysis-toolkit

Command-line tools to support meta-analysis using a library managed in Zotero

This command-line tool operates on a Zotero database, using the pyzotero API, to obtain basic quantitative measures about the distribution of references according to user-applied thematic tags. While not optimized for speed or efficiency, this script is relatively easy to adapt as it makes use of the convenient tag-filtering and other queries that are built into the Zotero API, which avoids needing to parse the complex data structure in a Zotero RDF or json dump.

The tool assumes a workflow in which significant metadata concerning each item are encoded in Zotero tags. The primary output of query operations will be a CSV file containing a two-dimensional tags array, displaying the count of articles matching both tags for each cell in the array. For example, it would be possible to query a Zotero library by geographic region and by theme, to generate tabular output like the following:

tag	Culture	History
Asia	27	96
Europe	18	40

Each axis in the sample array above is constructed from a list of tags supplied to the get_union command.

Usage

  python zma.py [OPTIONS] COMMAND [ARGS]...

Global options

`--key TEXT`

(Required)

The API key for your Zotero user or group library. This will be a seven-digit code available from the settings page at https://www.zotero.org/settings/keys.

`--library-id TEXT`

(Required)

For a personal library, the library ID is listed on the "keys" page (see above) as "Your userID for use in API calls". For a group library, the ID will be part of the group URL -- for example, for the group http://zotero.org/groups/2183860/dried_fish_matters/ the library ID is 2183860.

`--library-type [user|group]`

(Required)

Indicate whether the Zotero library you are accessing is owned by a user or a group.

`--tag-filter TEXT`

Tag to include/exclude in query commands. The TEXT string will match anywhere at the beginning of a tag, so --tag-filter theme_ would match the tags theme_culture and theme_history.

This option can generally be specified more than once, in which case it will serve as a logical AND by returning tags that match all of the strings given.

Tags can be excluded by prefixing the string with a minus, sign (e.g., --tag-filter -exclude will limit the results to items that do not have the tag "exclude").

A logical OR is represented as "||" (e.g., --tag-filter "foo || bar" will return items that have tags beginning with either "foo" or "bar").

Commands

`get-tags`

Usage:

  python zma.py [OPTIONS] get-tags OUTPUT

Print a list of tags in the library.

Tags are filtered to include or exclude those that match the prefix strings given with --tag-filter; otherwise all tags are returned.

OUTPUT can be a filename or - to print to stdout.

`print-bibliography`

Usage:

  python zma.py [OPTIONS] print-bibliography [--print-tag TAG]... OUTPUT

Print an html bibliography for a given collection, with tags listed beneath each entry.

Tags can be filtered by providing tag prefixes through --print-tag.

OUTPUT can be a filename or - to print to stdout.

Example:

    python zma.py --library-type group --library-id 2183860 /
    --collection-id 27MV6NK5 print-bibliography /
    --print-tag "#THEME:" --print-tag "+" zotero.html

`apply-category-tags`

Usage:

   python zma.py [OPTIONS] apply-category-tags --tag TEXT INPUT

Apply category tags to items matching tags listed in INPUT.

INPUT should contain a list of tags, separated by newlines. INPUT can be a filename or - to read from stdin. References matching any of the tags in that list will be tagged in the Zotero library with the additional tag specified with --tag. For example, INPUT could contain a list of Asian country names, in which case all library items tagged with one of those country names could additionally be given the tag "ASIA":

  python zma.py [OPTIONS] apply-category-tags --tag ASIA asian-countries.txt

`find-missing-tags`

Usage:

  python zma.py [OPTIONS] find-missing-tags [--local FILE] [--remote FILE] TAGS_LIST

Compare a list of tags to those in the library.

Prints lists of tags to two plain text files, by default "missing-user- tags.txt" (tags that are in the Zotero library but not in the user-supplied file) and "missing-zotero-tags.txt" (tags that are in the user-supplied file but not in the Zotero library).

TAGS_LIST is the name of a plain-text file or stream containing a list of files to check, one per line. Use the get_tags command to generate a baseline list as needed.

This function is intended to be used where there is an established codebook, and you wish to check whether (1) there are any unused tags in the codebook, and (2) there are any tags in the Zotero library that aren't documented in the codebook. If the tags in the codebook have a common prefix, results from the Zotero library can be filtered using the --tag- filter argument.

`get-union`

Usage:

  python zma.py [OTPIONS] get-union TAG_X TAG_Y OUTPUT

Generate a CSV file containing an array of tag correlations.

Each of TAG_X and TAG_Y is a plain-text file or stream containing a list of tags, separated by newlines.

Use the --tag--filter argument as a global filter to limit the results to items that match a specific tag or tags (this argument can be specified more than once, in which case ALL tags must be matched to be included in the result set).

Example:

  python zma.py [OPTIONS] --tag-filter="#RELEVANCE: Direct" --tag-filter \
    "-#exclude" get-union x.txt y.txt out.csv

`list-journals`

Usage:

  python zma.py [OPTIONS] list-journals [--start-date DATE] [--end-date DATE] OUTPUT

Write a table showing journal frequencies. OUTPUT will be a CSV file containing two columns: the ranked number of occurrences in the library of articles in each journal, and the name of each journal.

Tag filters can be used to limit results. Note that the --tag-filter argument can only be specified once for this command, otherwise it will be ignored.

The --start-date (default: 1900) and --end-date (default: 2100) options may be used to limit output to a specific temporal range. Most date strings can be parsed successfully, although a four-digit year as input will be least ambiguous.

Copying

This program is free software: you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation, either version 3 of the License, or (at your option) any later version.

This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.

You should have received a copy of the GNU General Public License along with this program. If not, see https://www.gnu.org/licenses/.

Credits

This script was originally written for the Dried Fish Matters project, supported by the Social Sciences and Humanities Research Council of Canada.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
LICENSE		LICENSE
README.md		README.md
setup.py		setup.py
zma.py		zma.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

zotero-meta-analysis-toolkit

Usage

Global options

`--key TEXT`

`--library-id TEXT`

`--library-type [user|group]`

`--tag-filter TEXT`

Commands

`get-tags`

`print-bibliography`

`apply-category-tags`

`find-missing-tags`

`get-union`

`list-journals`

Copying

Credits

About

Releases

Packages

Languages

License

DriedFishMatters/zotero-meta-analysis-toolkit

Folders and files

Latest commit

History

Repository files navigation

zotero-meta-analysis-toolkit

Usage

Global options

--key TEXT

--library-id TEXT

--library-type [user|group]

--tag-filter TEXT

Commands

get-tags

print-bibliography

apply-category-tags

find-missing-tags

get-union

list-journals

Copying

Credits

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

`--key TEXT`

`--library-id TEXT`

`--library-type [user|group]`

`--tag-filter TEXT`

`get-tags`

`print-bibliography`

`apply-category-tags`

`find-missing-tags`

`get-union`

`list-journals`

Packages