Skip to content

Fix/summarizer #16

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 59 commits into
base: main
Choose a base branch
from
Open

Fix/summarizer #16

wants to merge 59 commits into from

Conversation

a-klos
Copy link
Member

@a-klos a-klos commented Jun 6, 2025

This pull request introduces significant updates to the PageSummaryEnhancer class in admin-api-lib/src/admin_api_lib/impl/information_enhancer/page_summary_enhancer.py. The changes focus on improving the logic for summarizing pages by restructuring the grouping mechanism and adding a new method for summarizing page content asynchronously.

Enhancements to page summarization:

  • Added a new method _asummarize_page to asynchronously summarize the content of grouped page pieces. This method combines page content, invokes the summarizer, and constructs metadata with additional attributes such as a unique ID and related document IDs.

Refactoring of page grouping logic:

  • Replaced the previous grouping logic in _acreate_summary with a new approach that explicitly identifies distinct pages and groups information pieces based on their page metadata. The new logic also filters out groups that do not meet the chunk size threshold when self._chunker_settings is enabled.

a-klos and others added 28 commits June 2, 2025 07:44
…e unused managed_page_summary_enhancer module
…r DefaultSourceUploader to use it

refactor: update JSON serialization in ExtractionParameters, ExtractionRequest, InformationPiece, and KeyValuePair models
refactor: remove unused test files for confluence and thread management integration
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants