alignment

alignment #

Utility method to visualize the alignment and errors between one or more reference and hypothesis pairs.

collect_error_counts #

collect_error_counts(output)

Retrieve three dictionaries, which count the frequency of how often each word or character was substituted, inserted, or deleted. The substitution dictionary has, as keys, a 2-tuple (from, to). The other two dictionaries have the inserted/deleted words or characters as keys.

Parameters:

Name	Type	Description	Default
`output`	`Union[WordOutput, CharacterOutput]`	The processed output of reference and hypothesis pair(s).	required

Returns:

Type	Description
`Tuple[dict, dict, dict]`	A three-tuple of dictionaries, in the order substitutions, insertions, deletions.

Source code in src/jiwer/alignment.py

def collect_error_counts(output: Union[WordOutput, CharacterOutput]):
    """
    Retrieve three dictionaries, which count the frequency of how often
    each word or character was substituted, inserted, or deleted.
    The substitution dictionary has, as keys, a 2-tuple (from, to).
    The other two dictionaries have the inserted/deleted words or characters as keys.

    Args:
        output: The processed output of reference and hypothesis pair(s).

    Returns:
        (Tuple[dict, dict, dict]): A three-tuple of dictionaries, in the order substitutions, insertions, deletions.
    """
    substitutions = defaultdict(lambda: 0)
    insertions = defaultdict(lambda: 0)
    deletions = defaultdict(lambda: 0)

    for idx, sentence_chunks in enumerate(output.alignments):
        ref = output.references[idx]
        hyp = output.hypotheses[idx]
        sep = " " if isinstance(output, WordOutput) else ""

        for chunk in sentence_chunks:
            if chunk.type == "insert":
                inserted = sep.join(hyp[chunk.hyp_start_idx : chunk.hyp_end_idx])
                insertions[inserted] += 1
            if chunk.type == "delete":
                deleted = sep.join(ref[chunk.ref_start_idx : chunk.ref_end_idx])
                deletions[deleted] += 1
            if chunk.type == "substitute":
                replaced = sep.join(ref[chunk.ref_start_idx : chunk.ref_end_idx])
                by = sep.join(hyp[chunk.hyp_start_idx : chunk.hyp_end_idx])
                substitutions[(replaced, by)] += 1

    return substitutions, insertions, deletions

visualize_alignment #

visualize_alignment(
    output,
    show_measures=True,
    skip_correct=True,
    line_width=None,
)

Visualize the output of jiwer.process_words and jiwer.process_characters. The visualization shows the alignment between each processed reference and hypothesis pair. If show_measures=True, the output string will also contain all measures in the output.

Parameters:

Name	Type	Description	Default
`output`	`Union[WordOutput, CharacterOutput]`	The processed output of reference and hypothesis pair(s).	required
`show_measures`	`bool`	If enabled, the visualization will include measures like the WER or CER	`True`
`skip_correct`	`bool`	If enabled, the visualization will exclude correct reference and hypothesis pairs	`True`
`line_width`	`Optional[int]`	If set, try, at best effort, to spit sentences into multiple lines if they exceed the width.	`None`

Returns:

Type	Description
`str`	The visualization as a string

Example

This code snippet

import jiwer

out = jiwer.process_words(
    ["short one here", "quite a bit of longer sentence"],
    ["shoe order one", "quite bit of an even longest sentence here"],
)

print(jiwer.visualize_alignment(out))

will produce this visualization:

=== SENTENCE 1 ===

REF:    # short one here
HYP: shoe order one    *
        I     S        D

=== SENTENCE 2 ===

REF: quite a bit of  #    #  longer sentence    #
HYP: quite * bit of an even longest sentence here
           D         I    I       S             I

=== SUMMARY ===
number of sentences: 2
substitutions=2 deletions=2 insertions=4 hits=5

mer=61.54%
wil=74.75%
wip=25.25%
wer=88.89%

When show_measures=False, only the alignment will be printed:

=== SENTENCE 1 ===

REF:    # short one here
HYP: shoe order one    *
        I     S        D

=== SENTENCE 2 ===

REF: quite a bit of  #    #  longer sentence    #
HYP: quite * bit of an even longest sentence here
           D         I    I       S             I

When setting line_width=80, the following output will be split into multiple lines:

=== SENTENCE 1 ===

REF: This is a very  long sentence that is *** much longer than the previous one
HYP: This is a very loong sentence that is not much longer than the previous one
                        S                    I
REF: or the one before that
HYP: or *** one before that
          D

Source code in src/jiwer/alignment.py

def visualize_alignment(
    output: Union[WordOutput, CharacterOutput],
    show_measures: bool = True,
    skip_correct: bool = True,
    line_width: Optional[int] = None,
) -> str:
    """
    Visualize the output of [jiwer.process_words][process.process_words] and
    [jiwer.process_characters][process.process_characters]. The visualization
    shows the alignment between each processed reference and hypothesis pair.
    If `show_measures=True`, the output string will also contain all measures in the
    output.

    Args:
        output: The processed output of reference and hypothesis pair(s).
        show_measures: If enabled, the visualization will include measures like the WER
                       or CER
        skip_correct: If enabled, the visualization will exclude correct reference and hypothesis pairs
        line_width: If set, try, at best effort, to spit sentences into multiple lines if they exceed the width.

    Returns:
        (str): The visualization as a string

    Example:
        This code snippet

        ```python
        import jiwer

        out = jiwer.process_words(
            ["short one here", "quite a bit of longer sentence"],
            ["shoe order one", "quite bit of an even longest sentence here"],
        )

        print(jiwer.visualize_alignment(out))
        ```

        will produce this visualization:

        ```txt
        === SENTENCE 1 ===

        REF:    # short one here
        HYP: shoe order one    *
                I     S        D

        === SENTENCE 2 ===

        REF: quite a bit of  #    #  longer sentence    #
        HYP: quite * bit of an even longest sentence here
                   D         I    I       S             I

        === SUMMARY ===
        number of sentences: 2
        substitutions=2 deletions=2 insertions=4 hits=5

        mer=61.54%
        wil=74.75%
        wip=25.25%
        wer=88.89%
        ```

        When `show_measures=False`, only the alignment will be printed:

        ```txt
        === SENTENCE 1 ===

        REF:    # short one here
        HYP: shoe order one    *
                I     S        D

        === SENTENCE 2 ===

        REF: quite a bit of  #    #  longer sentence    #
        HYP: quite * bit of an even longest sentence here
                   D         I    I       S             I
        ```

        When setting `line_width=80`, the following output will be split into multiple lines:

        ```txt
        === SENTENCE 1 ===

        REF: This is a very  long sentence that is *** much longer than the previous one
        HYP: This is a very loong sentence that is not much longer than the previous one
                                S                    I
        REF: or the one before that
        HYP: or *** one before that
                  D
        ```
    """
    references = output.references
    hypothesis = output.hypotheses
    alignment = output.alignments
    is_cer = isinstance(output, CharacterOutput)

    final_str = ""
    for idx, (gt, hp, chunks) in enumerate(zip(references, hypothesis, alignment)):
        if skip_correct and (
            len(chunks) == 0 or (len(chunks) == 1 and chunks[0].type == "equal")
        ):
            continue

        final_str += f"=== SENTENCE {idx+1} ===\n\n"
        final_str += _construct_comparison_string(
            gt, hp, chunks, include_space_seperator=not is_cer, line_width=line_width
        )
        final_str += "\n"

    if show_measures:
        final_str += "=== SUMMARY ===\n"
        final_str += f"number of sentences: {len(alignment)}\n"
        final_str += f"substitutions={output.substitutions} "
        final_str += f"deletions={output.deletions} "
        final_str += f"insertions={output.insertions} "
        final_str += f"hits={output.hits}\n"

        if is_cer:
            final_str += f"\ncer={output.cer*100:.2f}%\n"
        else:
            final_str += f"\nmer={output.mer*100:.2f}%"
            final_str += f"\nwil={output.wil*100:.2f}%"
            final_str += f"\nwip={output.wip*100:.2f}%"
            final_str += f"\nwer={output.wer*100:.2f}%\n"
    else:
        # remove last newline
        final_str = final_str[:-1]

    return final_str

visualize_error_counts #

visualize_error_counts(
    output,
    show_substitutions=True,
    show_insertions=True,
    show_deletions=True,
    top_k=None,
)

Visualize which words (or characters), and how often, were substituted, inserted, or deleted.

Parameters:

Name	Type	Description	Default
`output`	`Union[WordOutput, CharacterOutput]`	The processed output of reference and hypothesis pair(s).	required
`show_substitutions`	`bool`	If true, visualize substitution errors.	`True`
`show_insertions`	`bool`	If true, visualize insertion errors.	`True`
`show_deletions`	`bool`	If true, visualize deletion errors.	`True`
`top_k`	`Optional[int]`	If set, only visualize the k most frequent errors.	`None`

Returns:

Type	Description
`str`	A string which visualizes the words/characters and their frequencies.

Example

The code snippet

import jiwer

out = jiwer.process_words(
    ["short one here", "quite a bit of longer sentence"],
    ["shoe order one", "quite bit of an even longest sentence here"],
)
print(jiwer.visualize_error_counts(out))

will print the following:

=== SUBSTITUTIONS ===
short   --> order   = 1x
longer  --> longest = 1x

=== INSERTIONS ===
shoe    = 1x
an even = 1x
here    = 1x

=== DELETIONS ===
here = 1x
a    = 1x

Source code in src/jiwer/alignment.py

def visualize_error_counts(
    output: Union[WordOutput, CharacterOutput],
    show_substitutions: bool = True,
    show_insertions: bool = True,
    show_deletions: bool = True,
    top_k: Optional[int] = None,
):
    """
    Visualize which words (or characters), and how often, were substituted, inserted, or deleted.

    Args:
        output: The processed output of reference and hypothesis pair(s).
        show_substitutions: If true, visualize substitution errors.
        show_insertions: If true, visualize insertion errors.
        show_deletions: If true, visualize deletion errors.
        top_k: If set, only visualize the k most frequent errors.

    Returns:
         (str): A string which visualizes the words/characters and their frequencies.

    Example:
        The code snippet
        ```python3
        import jiwer

        out = jiwer.process_words(
            ["short one here", "quite a bit of longer sentence"],
            ["shoe order one", "quite bit of an even longest sentence here"],
        )
        print(jiwer.visualize_error_counts(out))
        ```

        will print the following:

        ```txt
        === SUBSTITUTIONS ===
        short   --> order   = 1x
        longer  --> longest = 1x

        === INSERTIONS ===
        shoe    = 1x
        an even = 1x
        here    = 1x

        === DELETIONS ===
        here = 1x
        a    = 1x
        ```
    """
    s, i, d = collect_error_counts(output)

    def build_list(errors: dict):
        if len(errors) == 0:
            return "none"

        keys = [k for k in errors.keys()]
        keys = sorted(keys, reverse=True, key=lambda k: errors[k])

        if top_k is not None:
            keys = keys[:top_k]

        # we get the maximum length of all words to nicely pad output
        ln = max(len(k) if isinstance(k, str) else max(len(e) for e in k) for k in keys)

        # here we construct the string
        build = ""

        for count, (k, v) in enumerate(
            sorted(errors.items(), key=lambda tpl: tpl[1], reverse=True)
        ):
            if top_k is not None and count >= top_k:
                break

            if isinstance(k, tuple):
                build += f"{k[0]: <{ln}} --> {k[1]:<{ln}} = {v}x\n"
            else:
                build += f"{k:<{ln}} = {v}x\n"

        return build

    output = ""

    if show_substitutions:
        if output != "":
            output += "\n"
        output += "=== SUBSTITUTIONS ===\n"
        output += build_list(s)

    if show_insertions:
        if output != "":
            output += "\n"
        output += "=== INSERTIONS ===\n"
        output += build_list(i)

    if show_deletions:
        if output != "":
            output += "\n"
        output += "=== DELETIONS ===\n"
        output += build_list(d)

    if output[-1:] == "\n":
        output = output[:-1]

    return output