Task Description for Incremental Glossary Linking

Purpose and Scope

The purpose of this task is to maintain access to the SPaDE project glossary by incrementally adding links from documentation to glossary entries as changes occur. The project documentation in scope consists of all .md files in the SPaDE project directory and its subdirectories, excluding the reviews/ and retro/ directories and any directory whose name begins with ..

Background

The SPaDE project glossary is contained in tlad001.md. Entries in the glossary define special terminology and important concepts used in the project documentation.

Task Description

This task performs incremental glossary linking - adding links for terms that have been recently added to the glossary or appear in recently modified documentation.

Determining Review Scope

Identify last review date: Check the reviews directory for the most recent glossary linking review report (filename pattern YYYYMMDD-HHMM-*-glossary-links.md)
Extract terms from glossary: Dynamically scan tlad001.md to get current term list
Determine what changed:
- Files modified since last review date (using git)
- New glossary terms added since last review
- New files created since last review

Processing Strategy

Changed or New Files

Process files modified or created since the last review, checking for ALL glossary terms (both existing and newly added).

The following special cases should be noted:

If a term is used in a context where it has a different meaning from that given in the glossary, do not insert a hyperlink.
If a term is part of a compound term (e.g., “Focal Intelligence”), only link the part of the term which is in the glossary (e.g., “Focal”). If both a part and a whole are in the glossary, link only the whole (e.g., link “Focal Intelligence” but not “Focal” in that phrase).

Unchanged Files

Process all unchanged files, but check ONLY for newly added glossary terms.

Note: Although files are unchanged, they must all be checked when new terms are added to catch occurrences of those new terms throughout the documentation.

Glossary Term Extraction

Automatically extract all terms from the glossary by:

Parsing section headers (### level) as primary terms
Extracting variations from list items starting with - **Term**: or - **[Term](...)**
If #### headers are used for term variations (per linting guidelines), extract those as well
Building a comprehensive term list with anchor links
Noting compound terms to ensure longer phrases are processed before shorter ones

Note: Glossary formatting should follow markdown linting guidelines. If lint recommends using #### headers instead of bold text for subheadings, term extraction logic must accommodate both formats.

Linking Rules

Insert hyperlinks from term occurrences to the corresponding glossary entry, following these rules:

Term text remains unchanged, only add the hyperlink
Link all occurrences of a term within a document
For compound terms, link the longest matching phrase (e.g., “Focal Intelligence” not “Focal” within that phrase)
Skip terms in contexts where the meaning differs from the glossary definition

Avoid linking terms that are:

Already inside existing markdown links
Inside code blocks (inline or fenced)
In headings (to preserve heading structure)
In URLs or other special contexts

Special cases:

Occurrences of terms in the document to which the glossary entry for the term refers should be linked to the same place as the glossary entry, not to the glossary itself.

Automation Support

A Python script is available at docs/admin/amcd001.py to assist with this task.

Required enhancements for incremental operation:

Dynamic term extraction: Parse glossary file to extract all terms automatically
- Extract ### headers as primary terms
- Extract variations from - **Term**: list items
- Extract #### headers if used for term variations (per linting guidelines)
- Handle compound terms by processing longer phrases first
File filtering: Accept list of files to process based on git change detection
Term filtering: Accept list of new terms to check in unchanged files
Reporting: Generate comparison report showing links added vs previous review

The script should avoid linking terms in code blocks, existing links, and headings.

Deliverables

Updated documentation files with new glossary links added
Review report in the reviews directory with filename YYYYMMDD-HHMM-author-glossary-links.md containing:
- Last review date used for change detection
- Number of files processed (changed vs unchanged)
- Number of new terms checked
- Total links added
- Any issues or edge cases encountered

Include both the file changes and report in a pull request.