Improve Netzschleuder code #38

krlmlr · 2025-04-28T19:52:18Z

Read in chunks using httr2::resp_stream_raw() : https://github.com/igraph/igraphdata/pull/23/files#r2047637905
Use a named list as return for resolve_name(): https://github.com/igraph/igraphdata/pull/23/files#r2047636940
Clean temporary file: https://github.com/igraph/igraphdata/pull/23/files#r2049479042
Double-check use of [1] vs. [[1]] : https://github.com/igraph/igraphdata/pull/23/files#r2049490639
Don't babysit the user, improve error message if needed: https://github.com/igraph/igraphdata/pull/23/files#r2049473272

The text was updated successfully, but these errors were encountered:

schochastics · 2025-04-29T14:45:45Z

@krlmlr What is the advantage of resp_stream_raw()? Should this replace the current logic of checking filesize beforehand and only download if it doesn't go over a certain limit?

igraphdata/R/netzschleuder.R

Lines 57 to 64 in 0dcc3f3

    
           byte_size <- as.numeric(httr2::resp_headers(resp)[["content-length"]]) 
        
           gb_size <- round(byte_size / 1024^3, 4) 
        
           if (gb_size > size_limit) { 
        
             cli::cli_abort(c( 
        
               "{zip_url} has a size of {gb_size} GB and exceeds the size limit of {size_limit} GB.", 
        
               "i" = "To download the file, set {.arg size_limit} to a value greater than {gb_size}" 
        
             )) 
        
           }

krlmlr · 2025-04-29T18:58:33Z

We can read in chunks of 1 MB or so, and are no longer limited regarding file size. The current code loads the entire file into RAM. I suspect this will also help with rate limiting, need to check.

Thinking about it, we need RAM proportional to the graph size anyway, so perhaps not important?

krlmlr assigned schochastics Apr 28, 2025

schochastics linked a pull request Apr 29, 2025 that will close this issue

fix: improve netzschleuder internals #40

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Improve Netzschleuder code #38

Improve Netzschleuder code #38

krlmlr commented Apr 28, 2025 •

edited by schochastics

Loading

schochastics commented Apr 29, 2025

Uh oh!

krlmlr commented Apr 29, 2025

Uh oh!

Improve Netzschleuder code #38

Improve Netzschleuder code #38

Comments

krlmlr commented Apr 28, 2025 • edited by schochastics Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

schochastics commented Apr 29, 2025

Uh oh!

krlmlr commented Apr 29, 2025

Uh oh!

krlmlr commented Apr 28, 2025 •

edited by schochastics

Loading