Research project page

MemSlides: A Hierarchical Memory Driven Agent Framework for Personalized Slide Generation with Multi-turn Local Revision

MemSlides separates user profile memory, working memory, and tool memory so personalized slide agents can preserve stable preferences, carry revision constraints across rounds, and execute local edits more reliably.

MemSlides demonstration video showing profile memory, working memory, and localized revision in the slide-generation workflow.
Profile-Aware Generation

Same prompt. Same paper. Twenty-four profile-shaped slides.

MemSlides uses the same Attention Is All You Need source and the same generation request, then lets each user profile reshape the narrative, emphasis, and visual language.

24 generated slides
Abstract

Personalization in slide authoring requires memory at more than one lifetime.

The study evaluates first-pass personalization, multi-turn revision behavior, and localized modify reliability through controlled comparisons and qualitative cases.

Personalized presentation generation requires more than conditioning on a current prompt or template. Agents must preserve stable user preferences across tasks, retain newly introduced constraints during multi-turn revision, and carry out local edits reliably. MemSlides addresses this with a hierarchical memory design that separates long-term memory from working memory, and further divides long-term memory into user profile memory and tool memory. User profile memory supports round-0 personalization, working memory keeps active preferences and session constraints available across revision rounds, and tool memory stores reusable execution experience for localized editing. Scoped slide-local revision then applies updates to the smallest affected region rather than repeatedly regenerating the full deck.

Motivation

Why personalized slide agents need profile memory, working memory, and local edit control.

Personalized decks mix long-lived stylistic preferences, session-local revision constraints, and tool-level editing experience. Treating all of them as one flat prompt makes first-pass generation weaker and localized revision less reliable.

Round-0 personalization

Profiles should shape the first draft.

User profile memory stores intent-conditioned preferences across theme, content, visual, layout, template, and general dimensions, then routes compatible items into the active job.

Multi-turn carryover

Feedback often reveals preferences late.

Working memory keeps temporary deck-local preferences alive across later rounds, so a style or constraint introduced once can guide subsequent edits and inserted slides.

Localized editing

Small requests should not rewrite the deck.

MemSlides resolves the target surface, applies scoped patches, and verifies coverage before finalize, reducing drift in content that was already aligned.

Framework

Two memory scales connect persistent personalization with session-local revision.

The paper separates memory first by lifetime. Long-term memory persists across jobs and contains user profile memory plus tool memory; working memory is the current-job state that generation and revision actually read.

Scale 1 Persistent across jobs

Long-term memory

Stores reusable signals that should survive beyond one deck authoring session.

User Profile Memory Intent-aware presentation preferences: theme, content, visual, layout, template, and general habits.
Tool Memory Reusable execution experience from tool calls, errors, successful local edits, and verification outcomes.
route / retrieve into
Scale 2 Active within the current job

Working memory

Holds the active subset used by the agent now: routed profile items, temporary feedback preferences, edit state, carryover instructions, and repair focus for the current deck.

Overview of MemSlides hierarchical memory workflow
The overview figure follows the same two-scale structure: long-term memory contains user profile and tool memory, while working memory carries the active state into generation and Modify Exec.
Tool memory granularity

Two reuse granularities for edit execution.

Round-scope task experience Job-level lessons buffered through working memory and updated after modify rounds.
Operation-scope tool chain Reasoning, tool-call, and observation fragments retrieved before similar operations.
Plan Act Guard

Localized modify execution makes revision a checked state.

MemSlides converts feedback into an execution contract, chooses the smallest effective editing surface, and blocks premature completion when required coverage has not been verified.

Localized modify execution pipeline
Working memory supplies active preferences and edit state to a Plan-Act-Guard loop that scopes, edits, and verifies localized changes.
01 Plan

Build an execution contract.

Infer scope, target slides, active rules, selector hints, and whether coverage is required before editing.

02 Act

Edit the bounded surface.

Prefer batch CSS, semantic styling, or local patches over whole-deck regeneration; reserve full rewrite for new slides or controlled recovery.

03 Guard

Verify before finalize.

Use inspection, coverage checks, snapshot hashes, and repair focus to stop the agent from declaring completion too early.

Evidence

Profile memory improves persona alignment while tool memory improves edit reliability.

The evaluation covers first-pass personalization, ordinary deck quality, and diagnostic matched-pair modify behavior.

+2.73 Average content gain over SlideTailor in the shared multi-profile comparison suite.
0.963 Closed-loop completion in modify pairs after tool-memory injection.
2 scales Persistent profile and tool memory route into working memory for the active deck session.
Question Signal Selected result
Does profile memory help? Persona-alignment judgments All-column wins over both baselines on GLM-5 and Gemini 3.1 Pro; strongest averaged gains over SlideTailor are Content +2.73, Structure +2.95, Visual +2.79, Specificity +3.08.
Does quality remain competitive? DeepPresenter-style quality metrics GPT-5 Avg. 4.17 for MemSlides, best among compared systems in the shared three-profile suite.
Does tool memory improve editing? Diagnostic modify pairs 0.963 closed-loop completion and 0.534 strict verify with memory injection; first correct edit reduced from 609.5s to 242.5s overall.
Implementation Note

The codebase mirrors the paper's memory and revision decomposition without becoming the page's main story.

The MemSlides source tree separates generation, revision, runtime state, templates, tools, web sessions, and memory stores so the research concepts stay inspectable in engineering terms.

pipelines/

Generation and revision

Coordinates Researcher, DeckDesigner, export, profile execution plans, and localized revision loops.

memory/

Profile and tool memory

Stores user profiles, atomic preferences, episodes, tool chains, routing, extraction, and consolidation.

runtime/

Session state

Tracks deck execution state, page contracts, progress prompts, and local workflow context.

tools/

Guarded slide operations

Implements write, inspect, patch, template, document, asset, memory, and verifier tools.