Add torben conversion script along with the output data#5
Conversation
|
@realmarcin I just updated this PR to separate data into its own directory, and added some minimal documentation. |
| and _OUT_EDGE_HEADERS. | ||
| """ | ||
| noext = os.path.splitext(_SOURCE_PATH)[0].replace('/source/', '/out/') | ||
| node_path = noext + '.biolink-nodes.tsv' |
There was a problem hiding this comment.
can you align this file naming to:
IMGVR_extra_KGX_edges.tsv
IMGVR_extra_KGX_nodes.tsv
so subKG (Torben) dataset (sample) _KGX_edges.tsv etc
Also the format is KGX (and overlaps with many graph formats) and we are using some biolink terms to annotate our nodes/edges.
|
A few notes for the edges file:
|
|
A few notes for the nodes file:
|
| @@ -0,0 +1,2 @@ | |||
| * `/data/source` are source files that scripts in `/transform` take as input | |||
There was a problem hiding this comment.
here let's align to the KGX dir structure:
ls -1 data/
raw
transform
merged
and then subdirs for each source as in the merge.yaml
https://github.com/kbaseIncubator/KE_KG/blob/main/merge.yaml
No description provided.