Research project page

MemSlides: A Hierarchical Memory Driven Agent Framework for Personalized Slide Generation with Multi-turn Local Revision

MemSlides separates user profile memory, working memory, and tool memory so personalized slide agents can preserve stable preferences, carry revision constraints across rounds, and execute local edits more reliably.

Paper Code Demo Website Watch Video View Framework See Results

MemSlides demonstration video showing profile memory, working memory, and localized revision in the slide-generation workflow.

Profile-Aware Generation

Same prompt. Same paper. Twenty-four profile-shaped slides.

MemSlides uses the same Attention Is All You Need source and the same generation request, then lets each user profile reshape the narrative, emphasis, and visual language.

24 generated slides

Product01 / Decision frame

Same prompt, product strategy profile

Product02 / Parallelization claim

Same prompt, product strategy profile

Product03 / Training cost

Same prompt, product strategy profile

Product04 / Quality evidence

Same prompt, product strategy profile

Product05 / Adoption tradeoff

Same prompt, product strategy profile

Product06 / Risk boundary

Same prompt, product strategy profile

Product07 / Recommendation

Same prompt, product strategy profile

Product08 / Executive summary

Same prompt, product strategy profile

Research01 / Learning target

Same prompt, research seminar profile

Research02 / Architecture map

Same prompt, research seminar profile

Research03 / Attention mechanism

Same prompt, research seminar profile

Research04 / Training setup

Same prompt, research seminar profile

Research05 / Evidence table

Same prompt, research seminar profile

Research06 / Representation view

Same prompt, research seminar profile

Research07 / Boundary and cost

Same prompt, research seminar profile

Research08 / Research takeaway

Same prompt, research seminar profile

Visual01 / Core idea

Same prompt, visual teaching profile

Visual02 / Token interaction

Same prompt, visual teaching profile

Visual03 / Multi-head attention

Same prompt, visual teaching profile

Visual04 / Encoder decoder

Same prompt, visual teaching profile

Visual05 / Positional encoding

Same prompt, visual teaching profile

Visual06 / Training signal

Same prompt, visual teaching profile

Visual07 / Check question

Same prompt, visual teaching profile

Visual08 / Lesson recap

Same prompt, visual teaching profile

Abstract

Personalization in slide authoring requires memory at more than one lifetime.

The study evaluates first-pass personalization, multi-turn revision behavior, and localized modify reliability through controlled comparisons and qualitative cases.

Personalized presentation generation requires more than conditioning on a current prompt or template. Agents must preserve stable user preferences across tasks, retain newly introduced constraints during multi-turn revision, and carry out local edits reliably. MemSlides addresses this with a hierarchical memory design that separates long-term memory from working memory, and further divides long-term memory into user profile memory and tool memory. User profile memory supports round-0 personalization, working memory keeps active preferences and session constraints available across revision rounds, and tool memory stores reusable execution experience for localized editing. Scoped slide-local revision then applies updates to the smallest affected region rather than repeatedly regenerating the full deck.

Motivation

Why personalized slide agents need profile memory, working memory, and local edit control.

Personalized decks mix long-lived stylistic preferences, session-local revision constraints, and tool-level editing experience. Treating all of them as one flat prompt makes first-pass generation weaker and localized revision less reliable.

Round-0 personalization

Profiles should shape the first draft.

User profile memory stores intent-conditioned preferences across theme, content, visual, layout, template, and general dimensions, then routes compatible items into the active job.

Multi-turn carryover

Feedback often reveals preferences late.

Working memory keeps temporary deck-local preferences alive across later rounds, so a style or constraint introduced once can guide subsequent edits and inserted slides.

Localized editing

Small requests should not rewrite the deck.

MemSlides resolves the target surface, applies scoped patches, and verifies coverage before finalize, reducing drift in content that was already aligned.

Framework

Two memory scales connect persistent personalization with session-local revision.

The paper separates memory first by lifetime. Long-term memory persists across jobs and contains user profile memory plus tool memory; working memory is the current-job state that generation and revision actually read.

Scale 1 Persistent across jobs

Long-term memory

Stores reusable signals that should survive beyond one deck authoring session.

User Profile Memory Intent-aware presentation preferences: theme, content, visual, layout, template, and general habits.

Tool Memory Reusable execution experience from tool calls, errors, successful local edits, and verification outcomes.

route / retrieve into

Scale 2 Active within the current job

Working memory

Holds the active subset used by the agent now: routed profile items, temporary feedback preferences, edit state, carryover instructions, and repair focus for the current deck.

Overview of MemSlides hierarchical memory workflow — The overview figure follows the same two-scale structure: long-term memory contains user profile and tool memory, while working memory carries the active state into generation and Modify Exec.

Tool memory granularity

Two reuse granularities for edit execution.

Round-scope task experience Job-level lessons buffered through working memory and updated after modify rounds.

Operation-scope tool chain Reasoning, tool-call, and observation fragments retrieved before similar operations.

Plan Act Guard

Localized modify execution makes revision a checked state.

MemSlides converts feedback into an execution contract, chooses the smallest effective editing surface, and blocks premature completion when required coverage has not been verified.

Localized modify execution pipeline — Working memory supplies active preferences and edit state to a Plan-Act-Guard loop that scopes, edits, and verifies localized changes.

01 Plan

Build an execution contract.

Infer scope, target slides, active rules, selector hints, and whether coverage is required before editing.

02 Act

Edit the bounded surface.

Prefer batch CSS, semantic styling, or local patches over whole-deck regeneration; reserve full rewrite for new slides or controlled recovery.

03 Guard

Verify before finalize.

Use inspection, coverage checks, snapshot hashes, and repair focus to stop the agent from declaring completion too early.

Evidence

Profile memory improves persona alignment while tool memory improves edit reliability.

The evaluation covers first-pass personalization, ordinary deck quality, and diagnostic matched-pair modify behavior.

+2.73 Average content gain over SlideTailor in the shared multi-profile comparison suite.

0.963 Closed-loop completion in modify pairs after tool-memory injection.

2 scales Persistent profile and tool memory route into working memory for the active deck session.

Question	Signal	Selected result
Does profile memory help?	Persona-alignment judgments	All-column wins over both baselines on GLM-5 and Gemini 3.1 Pro; strongest averaged gains over SlideTailor are Content +2.73, Structure +2.95, Visual +2.79, Specificity +3.08.
Does quality remain competitive?	DeepPresenter-style quality metrics	GPT-5 Avg. 4.17 for MemSlides, best among compared systems in the shared three-profile suite.
Does tool memory improve editing?	Diagnostic modify pairs	0.963 closed-loop completion and 0.534 strict verify with memory injection; first correct edit reduced from 609.5s to 242.5s overall.

Qualitative Cases

Qualitative evidence highlights both memory behavior and localized revision behavior.

These cases show how the system consolidates reusable preferences, preserves session-local constraints, and updates only the slide regions that actually need change.

Memory behavior

Profile extraction, consolidation, and working-memory carryover determine what the agent should remember across jobs and across rounds.

Profile lifecycle

Intent-conditioned profile memory is selected, reconciled with the current request, used as active memory, and consolidated after the job.

Cross-job consolidation

Repeated local feedback cues become reusable organization patterns for later jobs.

Localized revision behavior

These examples show how MemSlides preserves already aligned material while applying small targeted updates or template-constrained edits.

Localized modify comparison

Localized modification

Targeted patches preserve already aligned content while satisfying the requested local change.

Template generation showcase

Template-aware generation

Task-time templates act as deck-local design constraints rather than generic profile preferences.

Implementation Note

The codebase mirrors the paper's memory and revision decomposition without becoming the page's main story.

The MemSlides source tree separates generation, revision, runtime state, templates, tools, web sessions, and memory stores so the research concepts stay inspectable in engineering terms.

pipelines/

Generation and revision

Coordinates Researcher, DeckDesigner, export, profile execution plans, and localized revision loops.

memory/

Profile and tool memory

Stores user profiles, atomic preferences, episodes, tool chains, routing, extraction, and consolidation.

runtime/

Session state

Tracks deck execution state, page contracts, progress prompts, and local workflow context.

tools/

Guarded slide operations

Implements write, inspect, patch, template, document, asset, memory, and verifier tools.