Trust Your Memory: Verifiable Control of Smart Homes through Reinforcement Learning with Multi-dimensional Rewards
arXiv cs.AI / 4/14/2026
📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research
Key Points
- The paper argues that while LLM-based smart home assistants can handle real-time device control, reliably performing memory-driven device control remains difficult to evaluate and optimize.
- It identifies limitations of existing benchmarks (they typically test either immediate control or generic memory retrieval) as well as RL training methods that provide only outcome-based supervision.
- The authors propose using reinforcement learning with multi-dimensional rewards to deliver more intermediate feedback for fine-grained memory operations such as add/update/delete/use.
- To support this, they release two resources: MemHomeLife, created from anonymized real-world long-term user interaction logs, and MemHome, a benchmark specifically for systematic evaluation of memory-driven device control.
- The work targets better assessment and training of memory management behaviors in smart home scenarios, aiming to reduce local failures and improve overall fine-grained performance.


