Score sentence-end transitions with LabelScorer in new search #152

SimBe195 · 2025-10-08T10:13:28Z

Currently, in LexiconfreeTimesyncBeamSearch and TreeTimesyncBeamSearch sentence-end is not handled by the LabelScorer; it is only scored by the word-level LM. This PR adds logic to properly handle sentence-end transitions in the new search. The changes consist of the following points:

SENTENCE_END is added as a new TransitionType for the LabelScorer.
For LexiconfreeTimesyncBeamSearch
- A sentence-end-index can be specified as a parameter
- The inferTransitionType function is adjusted accordingly to assign it the new SENTENCE_END transition type
- If it is present, at the end of a segment only hypotheses that have emitted this sentence-end are kept (otherwise sentence-end-fallback is applied)
For TreeTimesyncBeamSearch
- The CtcTreeBuilder is modified to include the sentenceEndLemma in the tree if it exists and has pronunciations.
- A set of finalStates is added to the PersistentStateTree. This is used in the search to determine which states are considered valid at segment end. If sentence-end is included in the tree, only the sentence-end sink state is added as final state.
- A sentenceEndLabelIndex_ is added as a member to the search algorithm and inferred from the lexicon
- The inferTransitionType function is also adjusted to produce the SENTENCE_END transition type
- Add second-order exits to word-end-hypotheses in decodeStep. This is because when the sentenceEndLemma has an empty pronunciation (i.e., should only be scored by the LM and not the LabelScorer), the hypotheses may need to take a normal word-end exit and then the sentence-end exit back-to-back in the same decode step.

Depends on changes to the transition types from #138.
Still requires testing.

Here are some plots of the new tree structure including sentence-end:

Tree without sentence-end lemma in lexicon:

Tree with sentence-end lemma with empty pronunciation in lexicon:

Tree with sentence-end lemma and non-empty pronunciation in lexicon:

…mesyncBeamSearch

src/Search/LexiconfreeTimesyncBeamSearch/LexiconfreeTimesyncBeamSearch.cc

src/Search/TreeBuilder.cc

hannah220 · 2025-10-09T11:18:02Z

src/Search/TreeBuilder.cc

+    // Add optional blank after the sentence-end lemma
+    if (allowBlankAfterSentenceEnd_) {
+        StateId blankAfter = extendState(sentenceEndSink_, blankDesc_);
+        addExit(blankAfter, sentenceEndSink_, lexicon_.specialLemma("blank")->id());


Do we also need to add loop for this blank state?

addTransition(blankAfter, blankAfter);

I don't think so because if you take the exit at this state, you are transferred to the root again (tr=2), which only has this one state as successor, so basically you already have this loop as blank-state -> root -> blank-state -> root -> ...
The only aspect we might need to think about is the fact that this blank always counts as a word-end hypothesis. However I think this is fine because actually we are at a word-end. Blanks between words also count as word-end hypotheses.

Right, that is a loop already.
Now that you mention this blank should be word end hyp and I agree, it seems it's not the case now.
reachedSentenceEnd will be set to true, only when the next token is SENTENCE_END.
There is no SENTENCE_END to BLANK transition type where we'd also want to set that to true.

The idea is to set reachedSentenceEnd to true once we have a SENTENCE_END transition and afterwards it can't be set to false again. But you're right, I also see a problem there now. In the LabelHypothesis of the Lexiconfree Search, reachedSentenceEnd is also set to true for many other transition types. I think this is indeed a bug.

Ah right, this was a bug. Fixed now.

hannah220 · 2025-10-09T12:20:03Z

src/Search/TreeTimesyncBeamSearch/TreeTimesyncBeamSearch.cc

+                                                hypIndex};
+
+            auto const sts = lemma->syntacticTokenSequence();
+            if (sts.size() != 0) {


I think I have seen more than one syntacticTokenSequence in a lemma. Do we need this assertion?

I introduced that in PR #145 . First of all, we need to make sure that we have at least one syntacticTokenSequence, otherwise the lemma should not be scored by the LM. We are currently not supporting multiple syntacticTokenSequences in general, therefore we require that we have exactly one. Alternatively, we would have to pick one, probably the first one, anyway. I would say this is an aspect for future work.

Yes, we will implement support for multi-token-sequences when we actually need it. I haven't seen this case so far and it would complicate the logic a bit since we currently delay the history update until after pruning which we can't do if multiple tokens in a row need to be scored.

hannah220 · 2025-10-09T13:33:41Z

In the graph above, t is time, m is emission idx and tr is traceback?

larissakl · 2025-10-09T13:51:49Z

@hannah220 With m you're correct, it's the emission index, in this case (monophones) it's just the output index (=the position in the lexicon), so

m=0 -> </s>
m=1 -> A
m=2 -> B
m=3 -> _

t is the transition index (doesn't matter here) and tr is the transition state of an exit, so when you are at a word end, you transition to this state. For example after predicting </s> you now have tr=2 which means you go to the state (the root) with ID 2, so to this new "sentence-end root state", while with all "normal" exits you go to the "normal" root with ID 1 because of tr=1.

SimBe195 added 20 commits July 24, 2025 16:51

Add TransitionLabelScorer

7502b82

Rewrite docstring

7430001

Clean up includes

2a6272e

Rewrite docstring again

7e325e1

Merge branch 'master' into tdp_label_scorer

a276136

Refactor params to string list with compile time check

d2d78fe

Remove transitionTypeToIndex function and revert associated changes

303fa46

Revert unnecessary static_cast

ddd75c7

Change std=c++17 to c++20

b856c1e

Merge remote-tracking branch 'origin/version-bump' into tdp_label_scorer

70699c0

Move transition type string array to LabelScorer.hh

5b89d0f

Move transitionTypeArray to protected space

b9d919b

Add sentence-end transition to enum

54bee17

Sentence-end handling for lexiconfree-search

3dc887b

Add finalStates collection to PersistentStateTree

b1ba86a

Sentence-end handling for tree-search

667f558

Merge branch 'master' into tdp_label_scorer

1795685

Merge branch 'tdp_label_scorer' into sentence_end_handling

98b824f

Allow no pronunciations of sentence-end

dfdcfe7

Add sentence-end-index as member and to inferTransitionType in TreeTi…

aa099a4

…mesyncBeamSearch

SimBe195 requested review from curufinwe and larissakl October 8, 2025 10:13

Change log to warning when sentence-end is not included in tree

a04c2f9

Base automatically changed from tdp_label_scorer to master October 8, 2025 12:59

Merge branch 'master' into sentence_end_handling

1ee5547

hannah220 reviewed Oct 9, 2025

View reviewed changes

Suggestions from code review

d4f6202

hannah220 mentioned this pull request Oct 13, 2025

Partial enabling of transition types for different label scorers #148

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Score sentence-end transitions with LabelScorer in new search #152

Score sentence-end transitions with LabelScorer in new search #152

Uh oh!

SimBe195 commented Oct 8, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

hannah220 Oct 9, 2025

Uh oh!

larissakl Oct 9, 2025

Uh oh!

hannah220 Oct 9, 2025

Uh oh!

larissakl Oct 9, 2025

Uh oh!

SimBe195 Oct 9, 2025

Uh oh!

hannah220 Oct 9, 2025

Uh oh!

larissakl Oct 9, 2025

Uh oh!

SimBe195 Oct 9, 2025

Uh oh!

hannah220 commented Oct 9, 2025

Uh oh!

larissakl commented Oct 9, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Score sentence-end transitions with LabelScorer in new search #152

Are you sure you want to change the base?

Score sentence-end transitions with LabelScorer in new search #152

Uh oh!

Conversation

SimBe195 commented Oct 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

hannah220 commented Oct 9, 2025

Uh oh!

larissakl commented Oct 9, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

SimBe195 commented Oct 8, 2025 •

edited

Loading