Support csv import #124

AndrooFrowns · 2024-11-10T02:32:12Z

Implement basic changes to permit CSV files to be converted to .ic

Modified paste csv function to not use a &str so that the data doesn't need to be entirely in memory, hopefully permitting large CSVs to be read.

not addressed yet, but noticed while implementing:

export to CSV
user/programmatic selection of CSV settings
supporting strings rather than chars for comment indicator

…from disk

AndrooFrowns · 2024-11-11T18:35:12Z

It looks like I thought BufReader would permit partially reading a file in chunks to permit reading files larger than available RAM. It looks like the more correct way to handle that is with mmap or manually writing the code to read in chunks.

I could try to switch strategies here or just make that a future enhancement, up to you.

nhatcher · 2024-11-11T19:35:56Z

I could try to switch strategies here or just make that a future enhancement, up to you.

There is not a time constraint, if you think there is a better way, go for the better way.

jaycarlton · 2024-11-22T00:48:29Z

xlsx/src/bin/xlsx_2_icalc.rs

+        FileKind::Csv => handle_csv(&path),
+    }?;
+
+    save_to_icalc(&model, &output_path).with_context(|| "Failed to sasve file as .icalc")?;


I thought the extension was .ic.

That might be, I just kept what this file was using. Is there a standard naming convention we should ensure is used everywhere?

jaycarlton · 2024-11-22T00:58:04Z

xlsx/tests/csv.rs

+use std::process::{Command, ExitStatus};
+
+#[test]
+fn test_simple_csv() {


There are a huge number of edge cases in CSV files that you'll want to test. Escaped (or unescaped) commas inside strings, function calls, misaligned column count, etc.

This is a good point, however as I'm relying on the implementation we already have for copy/paste purposes, I tried to stay focused the parts relating to getting a file in rather than improving the csv parsing itself.

If it's a blocker I can add it as well when I get time to work on this, but the distinction made sense in my head.

AndrooFrowns added 3 commits November 9, 2024 15:19

UPDATE: setup for xlsx_2_icalc to support csv files

d47050c

UPDATE: Allow for reading in csv files from buffered readers such as …

f2b4ef4

…from disk

UPDATE: add simple test for csv conversion to .ic

bf7a0b0

jaycarlton reviewed Nov 22, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support csv import #124

Support csv import #124

AndrooFrowns commented Nov 10, 2024

AndrooFrowns commented Nov 11, 2024

nhatcher commented Nov 11, 2024

jaycarlton Nov 22, 2024

AndrooFrowns Nov 23, 2024

jaycarlton Nov 22, 2024

AndrooFrowns Nov 23, 2024

Support csv import #124

Are you sure you want to change the base?

Support csv import #124

Conversation

AndrooFrowns commented Nov 10, 2024

AndrooFrowns commented Nov 11, 2024

nhatcher commented Nov 11, 2024

jaycarlton Nov 22, 2024

Choose a reason for hiding this comment

AndrooFrowns Nov 23, 2024

Choose a reason for hiding this comment

jaycarlton Nov 22, 2024

Choose a reason for hiding this comment

AndrooFrowns Nov 23, 2024

Choose a reason for hiding this comment