You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+8-8Lines changed: 8 additions & 8 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -125,13 +125,13 @@ Those embeddings can then be utilized in a novel way thanks to their dim-wise in
125
125
126
126
## Key technical features of Cleora embeddings
127
127
The embeddings produced by Cleora are different from those produced by Node2vec, Word2vec, DeepWalk or other systems in this class by a number of key properties:
128
-
- efficiency - Cleora is two orders of magnitude faster than Node2Vec or DeepWalk
129
-
- inductivity - as Cleora embeddings of an entity are defined only by interactions with other entities, vectors for new entities can be computed on-the-fly
130
-
- updatability - refreshing a Cleora embedding for an entity is a very fast operation allowing for real-time updates without retraining
131
-
- stability - all starting vectors for entities are deterministic, which means that Cleora embeddings on similar datasets will end up being similar. Methods like Word2vec, Node2vec or DeepWalk return different results with every run.
132
-
- cross-dataset compositionality - thanks to stability of Cleora embeddings, embeddings of the same entity on multiple datasets can be combined by averaging, yielding meaningful vectors
133
-
- dim-wise independence - thanks to the process producing Cleora embeddings, every dimension is independent of others. This property allows for efficient and low-parameter method for combining multi-view embeddings with Conv1d layers.
134
-
- extreme parallelism and performance - Cleora is written in Rust utilizing thread-level parallelism for all calculations except input file loading. In practice this means that the embedding process is often faster than loading the input data.
128
+
-**efficiency** - Cleora is two orders of magnitude faster than Node2Vec or DeepWalk
129
+
-**inductivity** - as Cleora embeddings of an entity are defined only by interactions with other entities, vectors for new entities can be computed on-the-fly
130
+
-**updatability** - refreshing a Cleora embedding for an entity is a very fast operation allowing for real-time updates without retraining
131
+
-**stability** - all starting vectors for entities are deterministic, which means that Cleora embeddings on similar datasets will end up being similar. Methods like Word2vec, Node2vec or DeepWalk return different results with every run.
132
+
-**cross-dataset compositionality** - thanks to stability of Cleora embeddings, embeddings of the same entity on multiple datasets can be combined by averaging, yielding meaningful vectors
133
+
-**dim-wise independence** - thanks to the process producing Cleora embeddings, every dimension is independent of others. This property allows for efficient and low-parameter method for combining multi-view embeddings with Conv1d layers.
134
+
-**extreme parallelism and performance** - Cleora is written in Rust utilizing thread-level parallelism for all calculations except input file loading. In practice this means that the embedding process is often faster than loading the input data.
135
135
136
136
## Key usability features of Cleora embeddings
137
137
@@ -302,4 +302,4 @@ Every `SparseMatrix` object allocates space for:
302
302
-`|V|` objects, each occupying 40 bytes,
303
303
-`2 x |E|` objects (in undirected graphs we need to count an edge in both directions), each occupying 24 bytes.
304
304
305
-
During training we need additonal `2 x d x |V|` objects, each occupying 4 bytes (this can be avoided by using memory-mapped files, see `--in-memory-embedding-calculation` argument for the program).
305
+
During training we need additonal `2 x d x |V|` objects, each occupying 4 bytes (this can be avoided by using memory-mapped files, see `--in-memory-embedding-calculation` argument for the program).
0 commit comments