feat: implement multiple file support in Dataset by Abdeali099 · Pull Request #81 · resilient-tech/transaction-parser

Abdeali099 · 2026-03-30T11:56:40Z

No description provided.

… related reports

… and validation

Abdeali099 · 2026-03-30T13:49:09Z

transaction_parser/parser_benchmark/runner.py


-        self.log.file_content = content
-        return content
+        self.log.file_content = combined


Can add here something like, Order can be made of multiple files.

make separator constant

greptile-apps · 2026-03-30T14:05:28Z

Confidence Score: 4/5

Safe to merge after restoring per-row file type validation; all other changes are well-structured.

One P1 finding: the SUPPORTED_FILE_TYPES guard was removed without replacement, meaning unsupported file types silently pass validation and only fail during background job execution. All other findings are P2 style/UX suggestions. Restoring the validation check in validate_files would bring this to 5/5.

parser_benchmark_dataset.py — the validate_files method needs a per-row file type check restored.

Important Files Changed

Filename	Overview
transaction_parser/parser_benchmark/doctype/parser_benchmark_dataset/parser_benchmark_dataset.py	Core model refactored from single-file to child-table; file type validation was removed without replacement, creating a regression where unsupported file types pass validation and only fail at background job execution time.
transaction_parser/parser_benchmark/doctype/parser_benchmark_dataset/parser_benchmark_dataset.json	DocType fields migrated from single Attach+Data to a child Table; the depends_on visibility guard for the PDF Processor section was dropped, making it permanently visible regardless of uploaded file types.
transaction_parser/parser_benchmark/runner.py	Runner updated to iterate over multiple file docs and join content with a separator; logic is clean and the empty-list guard in _get_file_docs prevents IndexError.
transaction_parser/patches/populate_dataset_files_table.py	Migration patch correctly reads previously-attached File docs and inserts them as child rows; uses db_insert() which is appropriate for a data migration patch.
transaction_parser/patches/remove_dataset_file_field.py	Pre-model-sync patch ensures File docs are properly linked before the old file column is dropped; handles orphan and missing File docs gracefully.

_{Reviews (1): Last reviewed commit: "refactor: enhance Parser Benchmark Datas..." | Re-trigger Greptile}

...saction_parser/parser_benchmark/doctype/parser_benchmark_dataset/parser_benchmark_dataset.py

...ction_parser/parser_benchmark/doctype/parser_benchmark_dataset/parser_benchmark_dataset.json

...saction_parser/parser_benchmark/doctype/parser_benchmark_dataset/parser_benchmark_dataset.py

Abdeali099 added 4 commits March 30, 2026 17:25

feat: implement multiple file support in Parser Benchmark Dataset and…

afd74c7

… related reports

Merge branch 'test-suite' into multifiles-dataset

80c5e60

Merge branch 'test-suite' into multifiles-dataset

2ec73f4

refactor: enhance Parser Benchmark Dataset with multiple file support…

494aa65

… and validation

Abdeali099 commented Mar 30, 2026

View reviewed changes

Abdeali099 marked this pull request as ready for review March 30, 2026 14:01

Abdeali099 merged commit 51ccd69 into test-suite Mar 30, 2026
2 of 3 checks passed

Abdeali099 deleted the multifiles-dataset branch March 30, 2026 14:01

greptile-apps bot reviewed Mar 30, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: implement multiple file support in Dataset#81

feat: implement multiple file support in Dataset#81
Abdeali099 merged 4 commits intotest-suitefrom
multifiles-dataset

Abdeali099 commented Mar 30, 2026

Uh oh!

Abdeali099 Mar 30, 2026

Uh oh!

Uh oh!

greptile-apps bot commented Mar 30, 2026

Important Files Changed

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Abdeali099 commented Mar 30, 2026

Uh oh!

Abdeali099 Mar 30, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

greptile-apps bot commented Mar 30, 2026

Confidence Score: 4/5

Important Files Changed

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant