OmniMem is the Tuya in-house AI memory system designed for multi-device IoT scenarios. It addresses key engineering challenges in long-term memory management for AI agents: cross-session memory loss, fragmented memory across devices, high recall latency, and insufficient accuracy.
The system supports a dual-channel architecture for short-term memory (session level) and long-term memory (user level). It delivers millisecond-scale recall latency and more than 75% memory accuracy (Pro edition). Through a unified memory layer, OmniMem enables seamless memory sharing across multiple agents and devices.
OmniMem stands out in three aspects:

For developers building persistent AI memory capabilities in multi-device IoT scenarios, OmniMem provides a complete solution from out-of-the-box deployment to advanced customization.
| Scenario | Memory capability | Key technologies |
|---|---|---|
| Companion toys | • Long-term preference memory • Growth records • Personalized conversations |
• Long-term memory • Emotional weighting |
| Smart home | • Device state memory • User habit learning • Scene linkage |
• Cross-agent memory graph • Real-time updates |
| Travel assistant | • Cross-timeframe context integration • Travel preference accumulation |
• Temporal parsing • Long-term memory |
| Personalized recommendations | • Dynamic user profiles • Preference change tracking |
• Full-channel memory • Dynamic updates |
| Direction | Current status | Planned goal |
|---|---|---|
| Memory emotional weight model | Completed | Weight memories by emotional intensity to optimize recall ranking and natural expression. |
| Seamless cross-device migration | Planned | Package memory data → unbind device identity → sync to new device, with voiceprint-associated migration. |
| Multimodal memory fusion | In development | Map cross-modal semantics to associate image and video frame features with text memories. |
| Memory security and compliance | Ongoing iteration | Support device-side encrypted storage, memory data ownership, and GDPR-compliant data deletion. |
OmniMem uses a five-layer pipeline: Input Layer → Preprocessing Layer → Processing Layer → Management Layer → Storage Layer.
┌─────────────────────────────────────────────────────────────────┐
│ Input Layer │
│ Multimodal data collection (voice, text, events) │
└──────────────────────────────┬──────────────────────────────────┘
▼
┌─────────────────────────────────────────────────────────────────┐
│ Preprocessing Layer │
│ Data cleansing → PII filtering → Interference signal filtering │
└──────────────────────────────┬──────────────────────────────────┘
▼
┌────────────────────────┬─────┴─────┬────────────────────────────┐
│ Short-term Memory │ │ Long-term Memory │
│ │ │ │
│ Semantic extraction │ │ Entity extraction │
│ → Summarization │ │ → Relationship mapping │
│ → Lightweight summary │ │→ Standardized transcription│
│ │ │ → Weight tagging │
└────────────────────────┴─────┬─────┴────────────────────────────┘
▼
┌─────────────────────────────────────────────────────────────────┐
│ Management Layer │
│ Real-time incremental updates + Offline optimization │
│ + Periodic memory decay │
└──────────────────────────────┬──────────────────────────────────┘
▼
┌─────────────────────────────────────────────────────────────────┐
│ Storage Layer │
│ Structured memory graph with multidimensional storage │
│ (entity, relation, attribute) │
└─────────────────────────────────────────────────────────────────┘
| Layer | Responsibilities | Key technologies |
|---|---|---|
| Input layer | Collects and standardizes data from multiple sources. | • Multi-protocol adaptation • Streaming data ingestion |
| Preprocessing layer | Ensures data quality and compliance filtering. | • PII detection and masking • Noise filtering algorithm |
| Processing layer | Performs semantic understanding and structured memory extraction. | • Named entity recognition (NER) • Relationship graph construction • Sentiment analysis |
| Management layer | Manages the memory lifecycle. | • Incremental updates • Temporal decay • Conflict resolution |
| Storage layer | Stores and retrieves memory efficiently. | • Vector indexing • Graph database • Hierarchical caching |
Based on evaluations using public datasets, the comparison between OmniMem and mainstream open-source memory solutions is as follows:
| Metric | OmniMem Pro | OmniMem Standard | Mem0 | MemGPT |
|---|---|---|---|---|
| Memory accuracy | > 75% | > 65% | ~55% | ~50% |
| Long-term memory recall latency | < 100 ms | < 100 ms | Seconds | Seconds |
| Memory dimension | Dual-channel (short-term and long-term) | Dual-channel (short-term and long-term) | Single-channel | Single-channel |
| Cross-agent memory sharing | Native support | Native support | Not supported | Not supported |
| Dynamic update mechanism | Real-time incremental updates and offline integration | Real-time incremental updates | Full overwrite | Full overwrite |
How temporal information is handled directly affects memory validity. OmniMem implements a time semantic parsing engine that covers complex scenarios:
last week, National Day, and early next month.OmniMem prevents task execution results from polluting long-term memory. For example, after the user issues the command "Set the air conditioner to 26°C", the successful execution of that command should not be retained as long-term memory or influence future interactions.
Technical solution:
To eliminate memory silos across multiple agents, OmniMem builds a unified structured memory graph.
┌───────────┐ ┌────────────┐ ┌─────────────┐
│ Agent A │ │ Agent B │ │ Agent C │
│ (Toy) │ │ (Speaker) │ │ (Appliance) │
└─────┬─────┘ └──────┬─────┘ └──────┬──────┘
│ │ │
└──────────────────┼──────────────────┘
▼
┌──────────────────────────────┐
│ Unified Memory Graph │
│ │
│ User Entity ←→ Preference │
│ ↕ ↕ │
│ Event Node ←→ Context Node │
│ ↕ ↕ │
│ Device State ←→ Time Axis │
└──────────────────────────────┘
OmniMem combines real-time responsiveness with long-term consistency through a hybrid update strategy.
| Mode | Trigger | Processing logic | Use cases |
|---|---|---|---|
| Real-time incremental updates | During conversation | • Hot path writing • Asynchronous indexing |
• Immediate preference change • New event recording |
| Offline optimization and integration | Scheduled tasks or off-peak periods | • Redundancy merging • Expired memory cleanup • Graph optimization |
• Memory store compaction • Global consistency maintenance |
On the Tuya AI Developer Platform, enable the agent memory capabilities through configuration without additional development.
Log in and go to My Agent, then open the agent development page.

Go to Model Configuration > Memory > Long-term Memory, and enable the following memory capabilities based on your scenario:
OmniMem automatically collects, processes, stores, and recalls memories. You do not need to implement the underlying pipeline.
Is this page helpful?
YesFeedbackIs this page helpful?
YesFeedback