Semantic markup and ICS table generation #402

PaulMartinsen · 2025-03-06T06:57:24Z

📑 Description

This PR provides document processing support to:

extend semantic markup of requirements for generation of rich JSON extracts for the SDPi requirements core model,
prototype semantic markup of use-cases to generate JSON extracts,
insertion of automatically generated implementation conformance statement changes,
copy links to SDPi requirements to the system clipboard clipboard from the published sdpi-profile web-document.

This slide show PDF provides a high-level overview.

This PR does not include the changes to the AsciiDoc source required to use the new processing features. Instead it is nearly 100% backwards compatible producing html output functionally identical as before.

Additional markup is needed in the source document, once this PR is done, to take advantage of these new features. I just don't want to make changes to the document source until everyone's had a chance to provide feedback at this stage. The goal of this PR is to get the processing and markup roughly right then I can start incorporating new features into the source files and get further feedback on how it works in practice.

Refer to the cookbook for documentation of the new markup and the test cases for examples.

Most of the document processing has been shifted from block processors to a tree processor. This mainly enables cross-referencing the bibliograph from sdpi_requirement_ref_standard requirements, and simplifies tagging use-case requirements with the related use-case. It may reduce the amount of work needed to extend the processor further (extracting transactions, for example).

Testing:

unit-tests were added to the processing tool to check output generation for each requirement and use-case.
html output documents generated from the master branch and this PR were compared to check the output was (materially) identical. See backwards compatiblity for differences.

☑ Mandatory Tasks

The following aspects have been respected by the pull request assignee and at least one reviewer:

Changelog update (necessity checked and entry added or not added respectively)
- Pull Request Assignee
- Reviewer

…e entities. Added requirement type. Tidy

Separated block processing and semantic information extraction. Added tree-processor to dump document tree for diagnostics. Created polymorphic requirement classes for each kind of requirement. Added semantic blocks for requirement parts. Added basic support to extract use cases. Cleanup output trace.

Added reference information to ref_ics requirement.

Changes for backwards compatibility with current formatting: * use local requirement id, * default to SDPi for requirement source, * treat unstyled content as normative, * support note paragraphs, * default to "tech_feature" requirement if type not specified.

…mentMetadata

Tidy.

Requirement link fallback to local ids.

Tidy

Added notes on backwards compatibility.

Fixed tests: * normalize compared strings to use the system line-ending character. * removed ':' in expected output (it isn't included in output anymore) * added `sdpi_offset` to input documents as is done in supplement text. Added backwards compatibility info.

…ocument processor.

…mentMetadata

Tidy.

Requirement link fallback to local ids.

Tidy

Added notes on backwards compatibility.

Fixed tests: * normalize compared strings to use the system line-ending character. * removed ':' in expected output (it isn't included in output anymore) * added `sdpi_offset` to input documents as is done in supplement text. Added backwards compatibility info.

…ocument processor.

… into 2024-11-PJM-RequirementMetadata

Removed old code.

d-gregorczyk · 2025-03-11T15:41:42Z

I must say - this is too many features in one branch.

It contains ICS functionality and requirements interop in one PR, plus there are side features such as the "clipboard copy"
I also saw macros dispatching to different output formats, which I do not understand why we need it
Does it resonate with the requirements interoperability approach and has this approach been validated by anyone?

My gut feeling is that we are drowning in complexity here and would have needed to discuss the interop approach and clean up my tainted code in advance.

d-gregorczyk · 2025-03-11T15:13:31Z

asciidoc/images/octicons-16.svg

Is this graphic freely available or drawn by yourself? We otherwise need to reference sources or include licenses in SDPi. Is this graphic really needed?

It was part of the copy to clipboard fragment, which has a Mozzilla public license and the source is referenced in the fragment

Please note that the image itself is listed under MIT license, and the license information gets lost once we export to PDF. That's just my concern.

PaulMartinsen · 2025-03-11T20:10:43Z

I must say - this is too many features in one branch.

It contains ICS functionality and requirements interop in one PR, plus there are side features such as the "clipboard copy"

I also saw macros dispatching to different output formats, which I do not understand why we need it

Does it resonate with the requirements interoperability approach and has this approach been validated by anyone?

My gut feeling is that we are drowning in complexity here and would have needed to discuss the interop approach and clean up my tainted code in advance.

Yes, there is a lot to unpack here.

By 'interop', i think you mean the JSON export for external tooling... otherwise i got the wrong idea
I see ICS functionality as the short-term payoff for the SDPi document; exporting to JSON is useful for external tooling but as far as I know that tooling doesn't exist and isn't being worked on yet. So it may be useful to consider how interoperability support can benefit the document itself. ICS tables are a good example IMHO & one of the use cases @ToddCooper highlighted at the last SDPi workshop. Anyway, they are trivial after gathering all the requirements.
It is intended to resonate with the requirements interoperability approach @ToddCooper shared at the last SDPi workshop. Or at least as far as I understood it. There's been a few practical difficulties with further discourse, to date. Github provides a way to do this collaboratively and asynchronously at least.
I stuck with JSON as the interop medium, but refined the format to separate content after I found original format with normative statements, notes, examples etc all smashed together didn't really work for me. Is anyone using the JSON extracts currently? Could we get input from them?
I don't think there was any tainted code to cleanup; without the prior version I wouldn't have had anything to build off and the barrier to learning Kotlin would have been too great.
I'm not sure what you mean by macro dispatch? Guessing:
** in ascii doc source I exclude some javascript content when generating PDFs. Otherwise it just shows up as text in the PDF
** there are macro extension in the processor for generating reference and use-case cross references. This support using oids for permalinks if we want to do that in the future.
Clipboard copy was to make this worthwhile for me; I find the SDPi document really hard to work as a user. For example, I regularly need to create references to requirements which is very difficult. I plan to make a suggestion around the "table of contents" if I get time to figure out how that works too.

As I understand it, interop is important but not urgent; we have the time to learn to swim here.

To cut through the complexity, a key outcome here tree-processor extensions lets us extract the information needed using roles. It isn't necessary to create block processors for interop features. And, this lets the tree processor work with the entire document; we aren't limited to source order.

PaulMartinsen · 2025-03-11T22:50:58Z

SDPi requirements export.pdf provides a higher-level overview .

d-gregorczyk · 2025-03-12T13:20:34Z

I must say - this is too many features in one branch.

It contains ICS functionality and requirements interop in one PR, plus there are side features such as the "clipboard copy"

I also saw macros dispatching to different output formats, which I do not understand why we need it

Does it resonate with the requirements interoperability approach and has this approach been validated by anyone?

My gut feeling is that we are drowning in complexity here and would have needed to discuss the interop approach and clean up my tainted code in advance.

Yes, there is a lot to unpack here.

By 'interop', i think you mean the JSON export for external tooling... otherwise i got the wrong idea

I see ICS functionality as the short-term payoff for the SDPi document; exporting to JSON is useful for external tooling but as far as I know that tooling doesn't exist and isn't being worked on yet. So it may be useful to consider how interoperability support can benefit the document itself. ICS tables are a good example IMHO & one of the use cases @ToddCooper highlighted at the last SDPi workshop. Anyway, they are trivial after gathering all the requirements.

It is intended to resonate with the requirements interoperability approach @ToddCooper shared at the last SDPi workshop. Or at least as far as I understood it. There's been a few practical difficulties with further discourse, to date. Github provides a way to do this collaboratively and asynchronously at least.

I stuck with JSON as the interop medium, but refined the format to separate content after I found original format with normative statements, notes, examples etc all smashed together didn't really work for me. Is anyone using the JSON extracts currently? Could we get input from them?

I don't think there was any tainted code to cleanup; without the prior version I wouldn't have had anything to build off and the barrier to learning Kotlin would have been too great.

I'm not sure what you mean by macro dispatch? Guessing:
** in ascii doc source I exclude some javascript content when generating PDFs. Otherwise it just shows up as text in the PDF
** there are macro extension in the processor for generating reference and use-case cross references. This support using oids for permalinks if we want to do that in the future.

Clipboard copy was to make this worthwhile for me; I find the SDPi document really hard to work as a user. For example, I regularly need to create references to requirements which is very difficult. I plan to make a suggestion around the "table of contents" if I get time to figure out how that works too.

As I understand it, interop is important but not urgent; we have the time to learn to swim here.

To cut through the complexity, a key outcome here tree-processor extensions lets us extract the information needed using roles. It isn't necessary to create block processors for interop features. And, this lets the tree processor work with the entire document; we aren't limited to source order.

Ok, I would like to see this approach being double-checked by Todd so he can drop his remarks from a high level design perspective (so he should not look at the Kotlin code :-) ).

From what I understand, RI and ICS tables do not have anything in common. RI being a means to relate requirements with other - partly external - requirements, and ICS tables being the means to provide conformity statements to sets of requirements.

Anyway, I still think the pull request is way too complex and should be decomposed and discussed. However, if reviewed and understood thoroughly by our stakeholders, I do not have any objection to merge it eventually.

And please do not get me wrong: I am totally advocating the idea of tweaking the SDPi input, I am just concerned that this might be overwhelming to the authors of the document (including me ;-) ).

… forks to run most of processing without the secret to upstream repository.

PaulMartinsen · 2025-03-18T05:28:47Z

@ToddCooper , when we discussed gathering requirements for the profiles (such as SDPi-P), I had the idea that there would be a bunch of requirements defined in §1:2.3.10, for example. And we wanted to make a list of those. Looking closer, I see that section doesn't actually define requirements. Instead, it references transactions, use-cases and content modules.

So my new idea, is we want to gather all the requirements in referenced use-cases, transactions and content-modules to make a list of requirements for each profile. Is that correct?

Assuming it is correct, in which direction do you imagine these relationships are best defined in the AsciiDoc source? "Best" really means, what's most logical for how everyone thinks about the document or which is most natural easiest for editors? In other words, do you prefer:

when editing a use-case/transaction/content module we want to assign it to a profile, or
we want to select relevant use-cases, transactions, content-modules that belong to a profile in the profile section of the source document.

To me, the second option makes more sense. That is, there's a smorgasbord of use-cases/transactions/content-modules that profiles pick and choose from.

Of course, the document processor will be able to figure out the relationship in both directions with either option.

Finally, there is a third option. The processor could discover actors and may be able to build relationships based on where actors are used. For example, SDPi-P defines the SOMDS Provider. So "membership" in SDPi-P could include any use-case/transaction/content-module that refers to this actor. Its possible this will work magically, but it may also be frustrating to figure out why relationships are being created because there isn't really much in the way of diagnostic tooling. Could be fun to try though :)

PaulMartinsen added 30 commits November 24, 2024 13:05

Added document for testing

83db68b

Initial go at processing requirement metadata.

db1901e

Fix up paths and remove redundant references

b163244

Added test documents

e5d50d2

Added support for source specification oids through document attribut…

a5b2458

…e entities. Added requirement type. Tidy

Tidy

1cb5b33

Added examples for testing.

2ad0ffc

Added basic support extract use case semantics

0579a69

Added reference information to ref_ics requirement.

Added oid definitions.

4ee331e

Changes for backwards compatibility with current formatting: * use local requirement id, * default to SDPi for requirement source, * treat unstyled content as normative, * support note paragraphs, * default to "tech_feature" requirement if type not specified.

Added support to create implementation conformance statement tables.

c4c5b69

Merge remote-tracking branch 'origin/master' into 2024-11-PJM-Require…

6d4e5e3

…mentMetadata

Don't add global id when oid information isn't available.

a13f5b6

Document DumpTreeInfo breaking macro replacement.

aefff5a

Tidy.

Support unfiltered requirements tables.

c0a4070

Requirement link fallback to local ids.

Applied Kotlin style and coding standards from IDE.

0c2b012

Fixed image example path.

5de1231

Started documenting semantic formatting of requirements and use-cases.

3f141d4

Tidy

Documenting more use-cases, and inserting tables.

03cbd9e

Fix test build error; tidy language in doc

22697fc

Added command line option to control structure diagnosis dump.

8c65e4f

Added notes on backwards compatibility.

Added tests for requirements, use-cases, ICS tables.

8b881ec

Refactored numbering tests to isolate processor tested and simplify d…

39aa426

…ocument processor.

Tidy converter options

d8d5cfe

Fixes for PDF target

36e057d

Expanded document generation docs to cover test tools.

ef1957d

Fixed PDF icons

552685e

Merge remote-tracking branch 'origin/master' into 2024-11-PJM-Require…

fee61b6

…mentMetadata

Don't add global id when oid information isn't available.

b2888fa

PaulMartinsen added 18 commits March 6, 2025 17:05

Document DumpTreeInfo breaking macro replacement.

6266aca

Tidy.

Support unfiltered requirements tables.

96af868

Requirement link fallback to local ids.

Applied Kotlin style and coding standards from IDE.

532bd3e

Fixed image example path.

ff11cc9

Started documenting semantic formatting of requirements and use-cases.

a589ef0

Tidy

Documenting more use-cases, and inserting tables.

21c3a65

Fix test build error; tidy language in doc

b0c576c

Added command line option to control structure diagnosis dump.

518fd4f

Added notes on backwards compatibility.

Added tests for requirements, use-cases, ICS tables.

8ddadff

Refactored numbering tests to isolate processor tested and simplify d…

07c3fbf

…ocument processor.

Tidy converter options

92fc103

Fixes for PDF target

fa1c0a1

Expanded document generation docs to cover test tools.

b007f17

Fixed PDF icons

c324962

Merge remote-tracking branch 'origin/2024-11-PJM-RequirementMetadata'…

215a038

… into 2024-11-PJM-RequirementMetadata

Removed PDF referering to itself.

fb76947

Removed testdoc used for development.

8a717d3

PaulMartinsen self-assigned this Mar 6, 2025

Added copy2clipboard back; fixed for asciidoc folder names.

9632fa6

Removed old code.

PaulMartinsen requested review from d-gregorczyk and ToddCooper March 6, 2025 07:02

d-gregorczyk reviewed Mar 11, 2025

View reviewed changes

PaulMartinsen added 2 commits March 15, 2025 03:31

Minor change to trigger actions?

ffa4e77

Support special value "skip" for SDPI_API_ACCESS_TOKEN_SECRET. Allows…

d526f4f

… forks to run most of processing without the secret to upstream repository.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Semantic markup and ICS table generation #402

Semantic markup and ICS table generation #402

PaulMartinsen commented Mar 6, 2025 •

edited

Loading

d-gregorczyk commented Mar 11, 2025

d-gregorczyk Mar 11, 2025

PaulMartinsen Mar 11, 2025

d-gregorczyk Mar 12, 2025

PaulMartinsen commented Mar 11, 2025 •

edited

Loading

PaulMartinsen commented Mar 11, 2025

d-gregorczyk commented Mar 12, 2025 •

edited

Loading

PaulMartinsen commented Mar 18, 2025

Semantic markup and ICS table generation #402

Are you sure you want to change the base?

Semantic markup and ICS table generation #402

Conversation

PaulMartinsen commented Mar 6, 2025 • edited Loading

📑 Description

☑ Mandatory Tasks

d-gregorczyk commented Mar 11, 2025

d-gregorczyk Mar 11, 2025

Choose a reason for hiding this comment

PaulMartinsen Mar 11, 2025

Choose a reason for hiding this comment

d-gregorczyk Mar 12, 2025

Choose a reason for hiding this comment

PaulMartinsen commented Mar 11, 2025 • edited Loading

PaulMartinsen commented Mar 11, 2025

d-gregorczyk commented Mar 12, 2025 • edited Loading

PaulMartinsen commented Mar 18, 2025

PaulMartinsen commented Mar 6, 2025 •

edited

Loading

PaulMartinsen commented Mar 11, 2025 •

edited

Loading

d-gregorczyk commented Mar 12, 2025 •

edited

Loading