Skip to content

dgidb/dgidb-citation-analysis

Repository files navigation

DGIdb Citation Analysis

This repository contains analyses designed to summarize impact of DGIdb and compare to other mainstream drug-gene interaction aggregator knowledgebases. The analyses explore citation trends and disease categorization across DGIdb, Pharos, and OpenTargets. By examining these trends, we aim to highlight the impact and relevance of DGIdb in the scientific community.

Papers considered

The analysis in this repository evaluates works that cite the following papers:

Drug-Gene Interaction Database (DGIdb)

Pharos Resource

Open Targets Platform

Repository Structure

  • data/ This directory contains the data files used for analysis:

    • Citation CSV files for DGIdb, Pharos, and OpenTargets.
    • Abstract text files associated with the citations for these resources.
  • citation-history.ipynb A Jupyter Notebook that visualizes the citation history for DGIdb, Pharos, and OpenTargets as a grouped histogram. This analysis highlights the temporal trends of citations for each resource.

  • disease-categorization.ipynb A Jupyter Notebook that categorizes the publications citing DGIdb, Pharos, and OpenTargets into major disease areas based on keywords derived from MeSH. The results are visualized as a comparative grouped histogram of disease focus across these resources.

  • semantic-scholar-citations.ipynb A Jupyter Notebook that extracts richer annotations ("influential", "methodology", "background") on citations for papers of interest. The output is given in ./data/semantic_scholar_citation_metadata.json and can be used to generate visuals for these annotations in the aggregate or by-year.

Getting Started

  1. Clone the repository:

    git clone https://github.com/YOUR_USERNAME/dgidb-citation-analysis.git
    cd dgidb-citation-analysis
  2. Install required Python packages: The analyses use standard Python libraries such as pandas, matplotlib, and numpy. Install them using:

    pip install -r requirements.txt
  3. Run the Notebooks: Open the Jupyter Notebooks to explore the citation history and disease categorization:

    jupyter notebook citation-history.ipynb
    jupyter notebook disease-categorization.ipynb
    jupyter notebook semantic-scholar-citations.ipynb

Contributions

Contributions are welcome! Please submit issues or pull requests to suggest improvements or new analyses.

License

This repository is open-sourced under the MIT License. See the LICENSE file for details.

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 3

  •  
  •  
  •