implement sync_diffs_filediffs

To reduce the memory required for writing large dataframes, a new mode `sync_filediffs` is being implemented in the mysql.Connection class.

The approach is to do as much as possible out of memory.
On receiving a dataframe, the df is written to disk. 

The db table which should be updated is also downloaded chunkwise to disk.

Then the [`filediffs`](https://github.com/INWTlab/filediffs) package is used to find the differences between the two dataframes and save them to disk.

After that the update part and the delete part are read back into memory and the database is updated.

A first version is already implemented on the [sync_filediffs](https://github.com/INWTlab/dbrequests/compare/sync_filediffs?expand=1) branch. 

Still open Issues are 
1. The verbose logging has to be improved so it integrates better into the codebase.
2. The temporary file management has to be improved.
3. The `query` method's output format. Changing it seems to be a breaking change.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

implement sync_diffs_filediffs #47

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

implement sync_diffs_filediffs #47

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions