Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
8 changes: 7 additions & 1 deletion README.md
Original file line number Diff line number Diff line change
@@ -1 +1,7 @@
# KE_KG
# KE_KG

* Source and output data can be found in `/data`

## Torben data transforms

* Run sample data transform: `python data/torben_transform_samples.py`
Empty file removed Torben_samples_to_tsv.py
Empty file.
2 changes: 2 additions & 0 deletions data/README.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,2 @@
* `/data/source` are source files that scripts in `/transform` take as input
Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

here let's align to the KGX dir structure:
ls -1 data/
raw
transform
merged

and then subdirs for each source as in the merge.yaml
https://github.com/kbaseIncubator/KE_KG/blob/main/merge.yaml

* `/data/out` are transformed output files using scripts from `/transform`
96,746 changes: 96,746 additions & 0 deletions data/out/torben/Torben_IMG-data_linked_to-GOLD_v2.biolink-edges.tsv

Large diffs are not rendered by default.

101,708 changes: 101,708 additions & 0 deletions data/out/torben/Torben_IMG-data_linked_to-GOLD_v2.biolink-nodes.tsv

Large diffs are not rendered by default.

4,962 changes: 4,962 additions & 0 deletions data/source/torben/Torben_IMG-data_linked_to-GOLD_v2.tsv

Large diffs are not rendered by default.

4 changes: 1 addition & 3 deletions transform/IMGVR_to_tsv.py
Original file line number Diff line number Diff line change
Expand Up @@ -135,10 +135,8 @@
kgx_header_nodes = "id\tname\tcategory\tprovided_by\\n"



def load(source_path):
df = pd.read_csv(source_path, sep='\t')

columns = df.columns.str
print(type(columns))
print(columns)
Expand Down Expand Up @@ -266,4 +264,4 @@ def write(output, outfile, header):
node_output = tuple2[1]
node_outfile = "IMGVR_sample_KGX_nodes.tsv"
print("writing "+node_outfile)
write(node_output, node_outfile, kgx_header_nodes)
write(node_output, node_outfile, kgx_header_nodes)
269 changes: 0 additions & 269 deletions transform/Torben_samples_to_tsv.py

This file was deleted.

Loading