Research · February 2026

Conversations shouldn't be a single thread

We introduce the Conversation Tree Architecture - a hierarchical framework that organizes LLM conversations into isolated, context-aware nodes with principled mechanisms for branching, merging, and flow.

Pranav Hemanth · Sampriti Saha

Try It Live →Explore the Architecture ↓Read the Paper →

Definitions

↕

Conversational Unit

The atomic element of a conversation - a human prompt paired with a model response. The minimal unit of meaning from which all higher-order structures are assembled.

⑃

Branch

A deliberate split from an existing conversation, initiated by the user to explore an alternative direction without contaminating the parent context.

↓

Downstream Context Flow

The selective passage of information from a parent node to a newly created child node at the moment of branching.

↑

Upstream Context Merge

When a child node is deleted, relevant content from its context window is incorporated back into the parent.

↹

Cross-Node Context Passing

The broader class of operations by which context migrates across the tree - encompassing upstream, downstream, and any lateral transfer between sibling or cousin nodes.

⚠

Volatile Node

A transient branch that exists only for the duration of a session. At session end, its context window is either merged upstream or purged entirely.

The Problem

The fundamental limitation of current LLM conversation interfaces lies in their treatment of context as a single, shared, append-only resource. When a user wishes to explore a tangential question, debug an edge case, or pursue an alternative framing of a problem, they are forced to do so within the same context window that governs the main thread. Over time, this produces conversations that are topically diffuse, difficult to navigate, and increasingly prone to model responses that fail to reflect the user's true intent.

We characterize this failure mode as logical context poisoning: the progressive corruption of conversational coherence caused not by erroneous model outputs per se, but by the structural mismanagement of context.

⚠Linear Conversation (Current)

Context Quality:

✓Conversation Tree (Proposed)

Context Quality:

The Architecture

The Conversation Tree Architecture organizes LLM conversations as hierarchical tree structures in which each node is a self-contained conversational unit with its own isolated context window, and edges represent structured context-passing relationships between nodes.

Pause/Hover to interact

CW1

#1B1

CW1B1

#1B2

CW1B2

#1B1B1

CW1B1B1

#1B2B1

CW1B2B1

React Flow

Normal Thread

Volatile Branch

Context Flow

Intelligent downstream flow is essential to avoid pre-loading a branch with irrelevant parent context that would itself become a source of poisoning. Upstream context merging is the inverse operation: when a child node is deleted, relevant content from its context window is incorporated back into the parent.

Downstream Context Flow

Selective passage of information from a parent to a child at branch time. What is relevant? Full context, summary, or selected units?

Parent Node Context

New Child Node

Upstream Context Merge

Distillation of a child node's contents back into the parent when deleted. Where does it insert? At what granularity?

Child Node (Deleting)

Parent Node

Volatile Nodes

Volatile nodes are a distinct class of branch nodes defined by their transient lifecycle. A volatile node exists only for the duration of a session and carries no persistent state.

When a volatile node is deleted or the session ends, its local context window must either be merged upstream into the parent according to the upstream context merging rules, or it is purged entirely and its content is lost. They are particularly useful for exploratory or experimental conversational threads.

Parent

Volatile

Significance

The problems addressed by this proposal are not merely theoretical inconveniences. Context poisoning is a real and recurring phenomenon that degrades the utility of LLM tools precisely in the high-stakes, high-complexity scenarios where they are most needed.

01 The Researcher

Maintains a primary analytical thread while spinning off volatile branches to explore citations, definitions, or counterarguments.

02 The Software Engineer

Maintains a high-level design conversation while branching into debugging sessions that are later summarized and merged back.

03 The Writer

Preserves a narrative planning thread while experimenting with alternative scenes or tones in isolated branches.

Beyond individual utility, the architecture provides a conceptual framework for future work on multi-agent LLM systems, where context management across multiple cooperating agents is an open and pressing research problem. The node-and-flow abstraction generalizes naturally.

Research Goals

Formalize the Architecture

Define the computational model — data structures, operations, and invariants governing node creation, context flow, and deletion.

Intelligent Downstream Passing

Develop algorithms for selecting and compressing parent context prior to branching, maximizing child performance while minimizing irrelevant carryover.

Upstream Merge Strategies

Investigate tradeoffs between condensation granularity, relevance filtering, and chronological vs. append insertion positioning.

Empirical Evaluation

Compare the proposed architecture against baseline linear interfaces, measuring output quality, task completion, and user-reported coherence.

Timeline

Phase 1

Formalize theoretical model
Implement prototype interface
Evaluate downstream context passing on benchmarks

Phase 2

Upstream merge algorithm development
Comprehensive empirical evaluation
Prepare thesis document
Research presentation

References

[1] Liu, N. F., Lin, K., Hewitt, J., et al. (2024). Lost in the Middle: How Language Models Use Long Contexts. Transactions of the Association for Computational Linguistics. [Link]
[2] Hsieh, C., et al. (2025). Context Length Alone Hurts LLM Performance Despite Perfect Retrieval. arXiv:2510.05381. [Link]
[3] Chroma Research (2025). Context Rot: How Increasing Input Tokens Impacts LLM Performance. research.trychroma.com.

[4] Packer, C., Wooders, S., Lin, K., et al. (2023). MemGPT: Towards LLMs as Operating Systems. arXiv:2310.08560. [Link]
[5] Rezazadeh, A., et al. (2024). From Isolated Conversations to Hierarchical Schemas: Dynamic Tree Memory Representation for LLMs. arXiv:2410.14052. [Link]
[6] Hu, C., Fu, J., et al. (2024). HiAgent: Hierarchical Working Memory Management for Solving Long-Horizon Agent Tasks with Large Language Models. arXiv preprint.

[7] Daga, S., Sreedhar, S., & Shah, D. (2025). Supermemory: State-of-the-Art Agent Memory via Relational Versioning, Temporal Grounding, and Hybrid Retrieval. Technical Report. Supermemory, Inc.. [Link]
[8] Zhong, W., et al. (2024). MemoryBank: Enhancing Large Language Models with Long-Term Memory. AAAI 2024.
[9] Maharana, A., et al. (2024). Evaluating Very Long-Term Conversational Memory of LLM Agents. arXiv:2402.17753. [Link]
[10] Xu, Z., et al. (2025). A-Mem: Agentic Memory System for LLMs. arXiv preprint.

[11] Wu, Q., Bansal, G., Zhang, J., et al. (2023). AutoGen: Enabling Next-Gen LLM Applications via Multi-Agent Conversation. arXiv:2308.08155. [Link]
[12] Yao, S., Yu, D., Zhao, J., et al. (2023). Tree of Thoughts: Deliberate Problem Solving with Large Language Models. arXiv:2305.10601. [Link]