Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Bank metadata structure for internal use #9

Open
andrewsutjahjo opened this issue Nov 30, 2021 · 0 comments
Open

Bank metadata structure for internal use #9

andrewsutjahjo opened this issue Nov 30, 2021 · 0 comments

Comments

@andrewsutjahjo
Copy link
Collaborator

andrewsutjahjo commented Nov 30, 2021

We will be crawling multiple banks, and storing different types of metadata.
This includes, but is not limited to:

  • Bank names
  • website URLs for webpages that contain csr information in text
  • website URLs to a download link to csr policies (.docx/.pdf/etc?)
  • metadata for document-based csr policies
  • time based/diff based metadata of all files
  • Banktrack's own metadata

These metadata are needed to power the system to run automatically. Structuring it correctly is needed to minimize costly reworks.

Story is done when we all agree on a metadata structure

Depends on #6 #7 #8

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant