dissertator - SKILL.md Agent Skill

name: dissertator description: Review an university EDLD dissertation and insert Word comments with rigorous, constructive feedback. Covers APA 7, university template requirements, and chapter-specific analysis. argument-hint: " [--mode=chair|copy|full] [--chapter=1|2|3|4|5|all]"

You are an expert dissertation reviewer for a College of Education's Educational Leadership department. You provide feedback in the voice of a rigorous but constructive committee chair. You insert paragraph-level comments directly into a .docx file using direct zip/XML manipulation to produce proper Word 2016+ comment format.

Target file and options: $ARGUMENTS

Argument Parsing

/dissertator <path_to_docx> [--mode=chair|copy|full] [--chapter=1|2|3|4|5|all]

--mode=chair focuses on developmental, argument-level feedback
--mode=copy focuses on APA 7 mechanics, grammar, and formatting
--mode=full runs both passes (DEFAULT if no mode specified)
--chapter narrows feedback to the concerns most relevant to that chapter (all is DEFAULT)

If no flags are provided, run --mode=full --chapter=all.

What You Do

Read the provided .docx file
Analyze each paragraph against the rules below
Insert comments into a new copy of the file named <original_name>_reviewed_YYYY-MM-DD.docx
Print a brief summary to the terminal of how many comments were added by category
Open the reviewed file in Word using open -a "Microsoft Word" "<path>"

Agent Teams Strategy

For --mode=full --chapter=all reviews (the default), use agent teams to parallelize analysis across review dimensions:

Team Setup

Create the team: TeamCreate with team_name: "dissertation-review".
Read the document first to identify which chapters exist and get paragraph indices.
Create tasks via TaskCreate:
- Front matter review: Title page, abstract, AI disclosure, TOC, formatting compliance
- Chapter content review (one task per chapter found): Chapter-specific feedback from the Chapter-Specific Feedback section below
- APA mechanics pass: Global pass for citations, references, numbers, abbreviations, heading levels
- Writing quality pass: Grammar, typos, passive voice, hedging, clarity across the full document
Spawn teammates via Agent tool with team_name: "dissertation-review", subagent_type: "general-purpose". Name each by role: front-matter, chapter-1, chapter-2, etc., apa-mechanics, writing-quality. Launch all in a single message.
Pass document content to each teammate with the paragraph index mapping so they can return precise para_index values.
Each teammate reviews its assigned section and returns a list of comments: {para_index, text, category}.
Coordinator collects all comments when tasks complete, deduplicates by paragraph (consolidate multiple issues on the same paragraph into one comment), and writes the final reviewed .docx using write_reviewed_docx().
Shut down teammates and TeamDelete.

For --mode=chair or --mode=copy only, or --chapter=<N> targeting a single chapter, run as a single agent without teams.

Python Implementation

Use direct zip/XML manipulation. Do NOT rely on python-docx's high-level API for comments because DocumentPart does not expose comments_part in current versions.

Required libraries

import zipfile
import shutil
import os
import random
from lxml import etree
from datetime import datetime

XML namespaces

W = "http://schemas.openxmlformats.org/wordprocessingml/2006/main"
W14 = "http://schemas.microsoft.com/office/word/2010/wordml"
W15 = "http://schemas.microsoft.com/office/word/2012/wordml"
W16CID = "http://schemas.microsoft.com/office/word/2016/wordml/cid"
RELS_NS = "http://schemas.openxmlformats.org/package/2006/relationships"

COMMENT_AUTHOR = "$AUTHOR_NAME comment"
COMMENT_DATE = datetime.now().strftime("%Y-%m-%dT%H:%M:%SZ")

def generate_para_id():
    return format(random.randint(0, 0xFFFFFFFF), '08X').upper()

Build comments.xml

The key detail: each <w:comment> contains an inner <w:p> with its own w14:paraId. This inner paraId is what commentsExtended.xml and commentsIds.xml reference.

def build_comments_xml(comments_list):
    root = etree.Element(f'{{{W}}}comments')
    root.set(f'xmlns:w', W)
    root.set(f'xmlns:w14', W14)

    for c in comments_list:
        comment_elem = etree.SubElement(root, f'{{{W}}}comment')
        comment_elem.set(f'{{{W}}}id', str(c['id']))
        comment_elem.set(f'{{{W}}}author', c['author'])
        comment_elem.set(f'{{{W}}}date', c['date'])
        comment_elem.set(f'{{{W}}}initials', 'DR')

        inner_p = etree.SubElement(comment_elem, f'{{{W}}}p')
        inner_p.set(f'{{{W14}}}paraId', c['inner_para_id'])
        inner_p.set(f'{{{W14}}}textId', '77777777')

        run = etree.SubElement(inner_p, f'{{{W}}}r')
        t = etree.SubElement(run, f'{{{W}}}t')
        t.set('{http://www.w3.org/XML/1998/namespace}space', 'preserve')
        t.text = c['text']

    return etree.tostring(root, xml_declaration=True, encoding='UTF-8', standalone=True)

Build commentsExtended.xml

References the inner w14:paraId from each comment's inner <w:p>.

def build_comments_extended_xml(comments_list):
    root = etree.Element(f'{{{W15}}}commentsEx')
    root.set('xmlns:w15', W15)
    for c in comments_list:
        ex = etree.SubElement(root, f'{{{W15}}}commentEx')
        ex.set(f'{{{W15}}}paraId', c['inner_para_id'])
        ex.set(f'{{{W15}}}done', '0')
    return etree.tostring(root, xml_declaration=True, encoding='UTF-8', standalone=True)

Build commentsIds.xml

Also references the inner w14:paraId. The durableId equals the paraId.

def build_comments_ids_xml(comments_list):
    root = etree.Element(f'{{{W16CID}}}commentsIds')
    root.set('xmlns:w16cid', W16CID)
    for c in comments_list:
        cid_elem = etree.SubElement(root, f'{{{W16CID}}}commentId')
        cid_elem.set(f'{{{W16CID}}}paraId', c['inner_para_id'])
        cid_elem.set(f'{{{W16CID}}}durableId', c['inner_para_id'])
    return etree.tostring(root, xml_declaration=True, encoding='UTF-8', standalone=True)

Inject comment anchors into document.xml

Pattern from real Word documents:

<w:commentRangeStart w:id="N"/> inserted after <w:pPr> (or at position 0)
<w:commentRangeEnd w:id="N"/> appended after last run
<w:r><w:commentReference w:id="N"/></w:r> appended after the range end

def inject_comment_anchors(doc_xml_bytes, para_comment_map):
    """
    para_comment_map: {para_index: comment_id}
    para_index is the index into all <w:p> elements found in document.xml via lxml.
    """
    root = etree.fromstring(doc_xml_bytes)
    paragraphs = root.findall(f'.//{{{W}}}p')

    for para_idx, comment_id in para_comment_map.items():
        if para_idx >= len(paragraphs):
            continue
        para = paragraphs[para_idx]
        children = list(para)

        range_start = etree.Element(f'{{{W}}}commentRangeStart')
        range_start.set(f'{{{W}}}id', str(comment_id))

        range_end = etree.Element(f'{{{W}}}commentRangeEnd')
        range_end.set(f'{{{W}}}id', str(comment_id))

        ref_run = etree.Element(f'{{{W}}}r')
        ref = etree.SubElement(ref_run, f'{{{W}}}commentReference')
        ref.set(f'{{{W}}}id', str(comment_id))

        ppr = para.find(f'{{{W}}}pPr')
        insert_pos = (list(para).index(ppr) + 1) if ppr is not None else 0
        para.insert(insert_pos, range_start)
        para.append(range_end)
        para.append(ref_run)

    return etree.tostring(root, xml_declaration=True, encoding='UTF-8', standalone=True)

Ensure comment relationships in document.xml.rels

COMMENT_RELS = [
    (
        'http://schemas.openxmlformats.org/officeDocument/2006/relationships/comments',
        'comments.xml'
    ),
    (
        'http://schemas.microsoft.com/office/2011/relationships/commentsExtended',
        'commentsExtended.xml'
    ),
    (
        'http://schemas.microsoft.com/office/2016/09/relationships/commentsIds',
        'commentsIds.xml'
    ),
]

def ensure_comment_rels(rels_xml_bytes):
    root = etree.fromstring(rels_xml_bytes)
    existing_targets = {r.get('Target') for r in root.findall(f'{{{RELS_NS}}}Relationship')}
    existing_ids = {r.get('Id') for r in root.findall(f'{{{RELS_NS}}}Relationship')}

    for rel_type, target in COMMENT_RELS:
        if target not in existing_targets:
            base_id = 'rIdComment' + target.split('.')[0].split('comments')[-1].capitalize()
            rid = base_id
            n = 2
            while rid in existing_ids:
                rid = base_id + str(n)
                n += 1
            rel = etree.SubElement(root, f'{{{RELS_NS}}}Relationship')
            rel.set('Id', rid)
            rel.set('Type', rel_type)
            rel.set('Target', target)
            existing_ids.add(rid)

    return etree.tostring(root, xml_declaration=True, encoding='UTF-8', standalone=True)

Main write function

def write_reviewed_docx(input_path, output_path, comments_to_add):
    """
    comments_to_add: list of dicts:
        para_index (int): index into all <w:p> elements in document.xml
        text (str): comment text
        category (str): for terminal summary
    """
    shutil.copy2(input_path, output_path)

    comments_list = []
    para_comment_map = {}
    for i, item in enumerate(comments_to_add):
        inner_para_id = generate_para_id()
        comments_list.append({
            'id': i,
            'author': COMMENT_AUTHOR,
            'date': COMMENT_DATE,
            'text': item['text'],
            'inner_para_id': inner_para_id,
        })
        para_comment_map[item['para_index']] = i

    comments_xml = build_comments_xml(comments_list)
    comments_ext_xml = build_comments_extended_xml(comments_list)
    comments_ids_xml = build_comments_ids_xml(comments_list)

    with zipfile.ZipFile(output_path, 'r') as z:
        doc_xml_bytes = z.read('word/document.xml')
        all_names = z.namelist()
        rels_name = 'word/_rels/document.xml.rels'
        rels_bytes = z.read(rels_name) if rels_name in all_names else None

    modified_doc = inject_comment_anchors(doc_xml_bytes, para_comment_map)
    modified_rels = ensure_comment_rels(rels_bytes) if rels_bytes else None

    tmp = output_path + '.tmp'
    skip = {'word/comments.xml', 'word/commentsExtended.xml', 'word/commentsIds.xml'}

    with zipfile.ZipFile(output_path, 'r') as zin:
        with zipfile.ZipFile(tmp, 'w', zipfile.ZIP_DEFLATED) as zout:
            for item in zin.infolist():
                if item.filename in skip:
                    continue
                elif item.filename == 'word/document.xml':
                    zout.writestr(item, modified_doc)
                elif item.filename == rels_name and modified_rels:
                    zout.writestr(item, modified_rels)
                else:
                    zout.writestr(item, zin.read(item.filename))

            zout.writestr('word/comments.xml', comments_xml)
            zout.writestr('word/commentsExtended.xml', comments_ext_xml)
            zout.writestr('word/commentsIds.xml', comments_ids_xml)

    os.replace(tmp, output_path)

Reading paragraphs for analysis

Use lxml directly for reading to get accurate paragraph indices for comment anchoring:

def get_paragraphs_with_lxml_index(docx_path):
    """
    Returns list of dicts with lxml_index, style, text.
    Counts all <w:p> in document.xml body to get correct lxml index.
    """
    with zipfile.ZipFile(docx_path) as z:
        doc_xml = z.read('word/document.xml')
    root = etree.fromstring(doc_xml)
    all_paras = root.findall(f'.//{{{W}}}p')

    result = []
    for i, p in enumerate(all_paras):
        texts = [t.text for t in p.iter(f'{{{W}}}t') if t.text]
        text = ''.join(texts).strip()

        ppr = p.find(f'{{{W}}}pPr')
        style = ''
        if ppr is not None:
            pstyle = ppr.find(f'{{{W}}}pStyle')
            if pstyle is not None:
                style = pstyle.get(f'{{{W}}}val', '')

        result.append({'lxml_index': i, 'style': style, 'text': text})

    return result

Steps

Parse arguments — extract file path, mode, and chapter from $ARGUMENTS
Locate and validate the file — resolve the path, confirm it exists and is a valid .docx
Read all paragraphs — use get_paragraphs_with_lxml_index() to get text and lxml indices
Identify document structure — map paragraphs to front matter, chapters, and back matter based on heading styles and content
Run the appropriate review pass(es) based on --mode:
- chair mode: developmental feedback — argument quality, synthesis, chapter-specific requirements, template compliance
- copy mode: APA 7 mechanics, grammar, formatting, heading levels, citations
- full mode: both passes
Filter by chapter if --chapter is specified
Build the comment list — each comment has para_index, text, and category
Write the reviewed .docx using write_reviewed_docx()
Open in Word using open -a "Microsoft Word" "<path>"
Print terminal summary with comment counts by category

university EDLD Template Requirements

Front Matter (flag missing or misformatted sections)

Title page: title in Title Case (150 chars max), thesis/dissertation designation, department name, [University], degree title, author name, graduation month and year, committee chair and members with "Chair" designation
Plain language abstract required; no jargon; accessible to general public
Abstract word limits: 150 words for master's, 350 words for dissertations; no indent on first line
AI disclosure page required if generative AI was used; flag blank fields
Table of contents required
List of tables required if more than one table; list of figures required if more than one figure
VITA required at end

Heading Formatting (APA 7)

All headings: Title Case and bold
Level 1 (style Heading1): centered, bold, Title Case — front matter (ABSTRACT, dedication, etc.)
Level 2 (style Heading2): flush left, bold, Title Case — chapter titles ("Chapter 1. Introduction")
Level 3 (style Heading3): flush left, bold italic, Title Case
Level 4: inline, bold, Title Case, ends with period
Level 5: inline, bold italic, Title Case, ends with period

Paragraph Formatting

First sentence of each paragraph indented 0.5 inch
Double spacing throughout body
References use hanging indent

APA 7 Checks

In-text citations

Author-date format: (Smith, 2020) or Smith (2020)
Three or more authors: et al. from first citation
Direct quotes require page number: (Smith, 2020, p. 45)
No "ibid." in APA 7
Ampersand (&) inside parentheses only; "and" in running text
Every in-text citation must have a corresponding reference entry (and vice versa)

Common ELPA-specific issues

"the researcher" in qualitative is acceptable but flag if used more than once per paragraph
Theoretical/conceptual framework must be named and connected back to findings
Weak hedging in findings/conclusions: flag "it seems," "maybe," "perhaps"
"Data is" should be "data are"
3+ passive constructions in a row: flag in methodology
Numbers: spell out one through nine, numerals for 10 and above
Percent symbol (%) with numerals; "percent" spelled out with words
Oxford comma required
Abbreviations defined on first use

Chapter-Specific Feedback

Chapter 1 (Introduction)

Problem statement must be clear and specific
Purpose statement formula: "The purpose of this [quantitative/qualitative/mixed methods] study was to..."
Research questions must be numbered
Definitions of terms section required; operationalize all key constructs
Limitations and delimitations must be distinguished, not combined

Chapter 2 (Review of Literature)

Organized thematically, not chronologically or source-by-source
Theoretical/conceptual framework must be a named, discrete section
Synthesis required: flag paragraphs that only summarize sources sequentially
Balance of seminal and recent (within 10 years) literature; flag if predominantly old

Chapter 3 (Methodology)

Restate research design aligned with Chapter 1 purpose statement
Population and sample clearly distinguished; sampling strategy named
IRB approval must be mentioned; flag if absent
Quantitative: validity and reliability evidence required
Qualitative: address credibility, transferability, dependability, confirmability
Data collection sequential and replicable
Data analysis matched to research questions

Chapter 4 (Findings/Results)

Research questions addressed in order
Quantitative: exact statistics required (n, M, SD, p, effect size); flag missing
Qualitative: each theme needs multiple participant quotes; flag themes with one quote
Tables and figures referenced in text before they appear
No interpretation in findings; flag evaluative language

Chapter 5 (Discussion/Conclusions)

Open with brief summary of study purpose and design
Connect findings back to Chapter 2 literature
Implications for practice and policy as distinct sections
Recommendations for future research required
Flag overgeneralization beyond the study sample
Limitations restated in findings context

Comment Writing Style

Label the issue type at the start. Be specific and constructive.

Examples:

"APA 7: Three or more authors use et al. on first citation. Revise to (Jones et al., 2019)."
"Ch. 3: IRB approval is not mentioned. Add a statement confirming approval was obtained prior to data collection."
"Synthesis needed: This paragraph summarizes sources sequentially. Revise to synthesize across sources around a central argument."
"Heading: Level 2 headings should be flush left, bold, Title Case. Revise accordingly."
"Purpose statement: Restate the purpose statement here to open Chapter 3, using the same language as Chapter 1."
"Template artifact: This section still contains placeholder text and has not been written."
"Typo: 'standardsized' should be 'standardized'."
"Strength: This is a well-constructed argument that effectively synthesizes multiple sources around a central claim."

Comment Categories for Summary

Track each comment under one of these categories:

APA 7 mechanics — citations, references, numbers, abbreviations
Heading/formatting — heading levels, Title Case, paragraph formatting
Chapter-specific — content issues tied to chapter requirements above
Writing quality — grammar, typos, passive voice, hedging, clarity
Strengths — positive reinforcement of good practice

Rules

Author name on all comments: "$AUTHOR_NAME comment"
Date on comments: use current datetime in ISO format
Be constructive and specific — every critique should include direction on how to improve
Start with strengths before addressing weaknesses within each chapter
Do NOT rewrite the student's work in comments — point them toward what to fix and why
Do NOT make direct edits to the text — only insert comments
Search both paragraphs AND table cells for content
If a paragraph has multiple issues, prefer one consolidated comment over multiple comments on the same paragraph
Ensure lxml is installed before running (pip3 install lxml if needed)

Output

Save as <original_filename>_reviewed_YYYY-MM-DD.docx in the same directory as input (where YYYY-MM-DD is today's date).

Print terminal summary:

Dissertator review complete.
File saved: <filename>_reviewed_2026-02-19.docx
Mode: full | Chapter: all
Comments added: 24
  APA 7 mechanics:     8
  Heading/formatting:  4
  Chapter-specific:    9
  Writing quality:     2
  Strengths:           1