You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Say there is a WCON spread across 500 files ("chunks"). Going to timestamp X will require opening all these chunks unless there is a table of contents telling the reader which chunk to visit.
This table of contents should be auto-generated by the WCON writer, and reference what times and worms are stored in each of the other file "chunks". It will be optionally used by a WCON reader to open only the necessary chunk when doing random access of certain timestamps.
To retain backward compatibility it could be a top-level object called "@tableofcontents". Or perhaps it could be a change to the top-level "files" object, adding hash and data range info to each file entry.
Implementations using this should probably put the "@TableofContents", "units", and "metadata" objects, and an empty "data" object, in the first chunk, and the real data objects starting from the second chunk, to ensure efficient loading of only the data required.
@vivekv2 may have an example of a table of contents in action.
The text was updated successfully, but these errors were encountered:
This sounds like a good idea overall, but I would recommend that the list of contents go at the end rather than the beginning because either end is equally easy to find once it's all there, but if you want to write it in one sweep while taking data, you won't know what to put at the beginning. You can, however, tell that when you're done writing the first file that you're either completely done (so you write the ToC) or not (in which case you start writing the next file).
How about files can have contents tag, which is a map from file names to an array of three-element arrays, where the arrays are ["wormID", 1.2, 3.4] where 1.2 and 3.4 are the earliest and latest times the worm is present, respectively? So, for example,
Say there is a WCON spread across 500 files ("chunks"). Going to timestamp X will require opening all these chunks unless there is a table of contents telling the reader which chunk to visit.
This table of contents should be auto-generated by the WCON writer, and reference what times and worms are stored in each of the other file "chunks". It will be optionally used by a WCON reader to open only the necessary chunk when doing random access of certain timestamps.
To retain backward compatibility it could be a top-level object called
"@tableofcontents"
. Or perhaps it could be a change to the top-level"files"
object, adding hash and data range info to each file entry.Implementations using this should probably put the "@TableofContents", "units", and "metadata" objects, and an empty "data" object, in the first chunk, and the real data objects starting from the second chunk, to ensure efficient loading of only the data required.
@vivekv2 may have an example of a table of contents in action.
The text was updated successfully, but these errors were encountered: