可持久化线段树

可持久化线段树（又称函数式线段树）是一种可持久化数据结构（英语：Persistent data structure）。这种数据结构在普通线段树的基础之上支持查询某个历史版本，同时时间复杂度与线段树是同级，空间复杂度相较而言更高。^[1]^[2]在中国信息学奥林匹克竞赛中，由于引入者黄嘉泰姓名的缩写与前中共中央总书记、国家主席胡锦涛(H.J.T.)相同，因此这种数据结构也可被称为总书记树或主席树。^{[来源请求]}

原理

与大部分可持久化数据结构相似，可持久化线段树在更新时尽可能与之前某一个旧版本共用一部分结点，从而节省空间。

例如，有一颗维护着区间和的线段树，现在以这颗线段树作为基础，将下标为 $5$ 的元素数值减去一，同时另存为一个新的版本。按照一般线段树中使用的思路，当位于根结点时我们应该递归操作它的右子树，如果是暴力实现的可持久化线段树，那么则需要再次递归将一整颗左子树复制一遍然后保存指针，但是复制的这一整颗子树是完全一模一样的，因此可以先只复制一个根结点，然后将它的左子树直接指向原先版本根结点的左子树，代表上一个版本和这一个新版本这颗子树保存的信息是完全一样的，然后再按照相似的方法，递归地处理下标 $5$ 存在的右子树。

性能分析

静态可持久化线段树在每次更新版本时，总是最大程度上减少结点的复制，这样不仅减少了时间的开销，更避免了不必要的空间浪费。与线段树相同，可持久化线段树由于在每一次更新版本时没有访问到不必要的结点，所以每一次查询和修改（即建立一个新的版本）时间复杂度为 $O(\log N)$ , 在这个过程中，会同时建立 $O(\log N)$ 个新的结点。如果总操作数量为 $m$ , 那么可持久化线段树可以以 $O(N+M\log N)$ 的时空复杂度解决问题。

实践

静态区间第k大数值

这类问题需要求解在一个长度为 $n$ 的数列中，第 $i$ 个数为 $a_{i}$ . 现在给定一些形如 $(l,r,k)$ 的三元组作为询问，设计程序计算数列第 $l~r$ 这些元素中出现次数排在第 $k$ 位的是多少。

利用可持久化线段树，维护区间 $(l,r)$ 代表区间 $[l,r]$ 中的元素出现了多少次，以此作为原始版本 $S_{0}$ ，此后每次建立一个新版本 $S_{i}$ ，代表去掉原数列中 $a_{0}~a_{i-1}$ 的元素之后建立的线段树，维护目标与上述相同。具体过程可以每次将 $a_{i}$ 的出现次数减一，并保存此时生成的新版本。

参考程序

C++

#include<bits/stdc++.h>

constexpr auto MAXN = (int)2e5 + 500;

std::map<int, int> val, mp;

struct Node {
	int fr, to, sum;
	Node *lft, *rgt;
	Node& Copy(Node *targ) {
		fr = targ->fr; to = targ->to; sum = targ->sum;
		lft = targ->lft; rgt = targ->rgt;
		return *this;
	}
};
Node* NewNode() {
	Node* ret = (Node*)malloc(sizeof(Node));
	ret->lft = ret->rgt = nullptr;
	return ret;
}
std::vector<Node*> version;

int num[MAXN], cnt[MAXN], orig[MAXN], fr[MAXN], to[MAXN], rank[MAXN];
std::queue<Node*> que, add;

signed main(void)
{
	int totNums, totQuery, cnt;

	//Read
	scanf("%d%d", &totNums, &totQuery);
	for (int i = 0; i < totNums; i++)scanf("%d", num + i);
	for (int i = 0; i < totQuery; i++)scanf("%d%d%d", fr + i, to + i, rank + i);

	for (int i = 0; i < totNums; i++) orig[i] = num[i];
	std::sort(num, num + totNums);
	cnt = 0; int tmp;
	mp[val[0] = *num] = cnt++;
	for (int i = 1; i < totNums; i++)
		if (num[i] != num[i - 1])
			tmp = cnt,mp[val[tmp] = num[i]] = cnt++;	
	for (int i = 0; i < totNums; i++)::cnt[num[i] = mp[orig[i]]]++;

	//Build
	Node *a, *b, *t;
	for (int i = 0; i < cnt; i++) {
		t = NewNode(); t->fr = t->to = i;
		t->sum = ::cnt[i];
		que.push(t);
	}
	for (; que.size() >= 2; std::swap(que, add)) {
		while (que.size() >= 2) {
			a = que.front(); que.pop(); b = que.front(); que.pop();
			t = NewNode(); t->fr = a->fr; t->to = b->to; t->sum = a->sum + b->sum;
			t->lft = a; t->rgt = b;
			add.push(t);
		}
		if (!que.empty()) { add.push(que.front()); que.pop(); }
	}version.push_back(que.front());
	//New Versions
	for (int del = 0; del < totNums; del++) {
		const int &target = num[del];
		t = a = NewNode(); b = version.back();
		while (true) {
			a->Copy(b); a->sum--;
			if (a->fr == target && a->to == target)break;
			if (a->lft->fr <= target && target <= a->lft->to) {
				a->lft = NewNode(); a = a->lft; b = b->lft;
			} else {
				a->rgt = NewNode(); a = a->rgt; b = b->rgt;
			}
		}version.push_back(t);
	}

	//Query
	int rnk;
	for (int i = 0; i < totQuery; i++)
	{
		a = version[--fr[i]]; b = version[to[i]]; rnk = rank[i];
		while (true) {
			if (a->lft == nullptr) { printf("%d\n", val[a->fr]); break; }

			if (a->lft->sum - b->lft->sum >= rnk) {
				a = a->lft; b = b->lft;
			}
			else {
				rnk -= a->lft->sum - b->lft->sum;
				a = a->rgt; b = b->rgt;
			}
		}
	}
	
	//system("pause");
	return 0;
}

参见

参考文献

^ 李煜东. 算法竞赛进阶指南. 中原出版传媒集团·河南电子音像出版社. 2018-1: P255–257. ISBN 978-7-83009-313-6. 请检查|date=中的日期值 (帮助)
^ Antti Laaksonen. Guide to Competitive Programming: Learning and Improving Algorithms Through Contests 1st edition. Springer. 2017: P250–251. ISBN 978-3-319-72546-8 （英语）.

[lyd-1] 李煜东. 算法竞赛进阶指南. 中原出版传媒集团·河南电子音像出版社. 2018-1: P255–257. ISBN 978-7-83009-313-6. 请检查|date=中的日期值 (帮助)

[antti-2] Antti Laaksonen. Guide to Competitive Programming: Learning and Improving Algorithms Through Contests 1st edition. Springer. 2017: P250–251. ISBN 978-3-319-72546-8 （英语）.

[1]

[2]

查论编计算机科学中的树
二叉树	二叉查找树笛卡尔树 MVP树 Top tree（英语：Top tree） T树线索二叉树
自平衡二叉查找树	AA树 AVL树左倾红黑树红黑树替罪羊树伸展树树堆加权平衡树
B树	B+树 B*树 B^x树 UB树 2-3树 2-3-4树 (a,b)-树（英语：(a,b)-tree）跳舞树（英语：Dancing tree） H树
堆	二叉堆二项堆斐波那契堆左偏树配对堆斜堆范恩德蟒蛇树（英语：Van Emde Boas tree）
Trie	后缀树基数树三叉查找树 X-快速前缀树 Y-快速前缀树 AC自动机
二叉空间分割（BSP）树	四叉树八叉树 k-d树隐式k-d树 VP树
非二叉树	指数树（英语：Exponential tree）融合树（英语：Fusion tree） PQ树（英语：PQ tree） SPQR树（英语：SPQR tree）
空间数据分割树	R树 R*树 R+树 X树 M树线段树 (储存区间) 线段树 (区间查询) 可持久化线段树希尔伯特R树优先R树
其他树	散列日历散列树手指树（英语：Finger tree）顺序统计树度量树（英语：Metric tree）覆盖树（英语：Cover tree） BK树二重连锁树（英语：Doubly chained tree） iDistance（英语：iDistance） Link-cut tree（英语：Link-cut tree） Log-structured merge-tree（英语：Log-structured merge-tree）树状数组哈希树

查论编数据结构
类型	集合容器
抽象类型	关联数组多重关连数组（英语：Multimap）串列前向串列堆栈队列双端队列优先队列双端优先队列集合多重集并查集可持久化数据结构线段树
数组	字串位数组环形缓冲器动态数组哈希表哈希数组树（英语：Hashed array tree）稀疏矩阵
链（英语：Linked data structure）	关联表（英语：Association list）链表跳跃列表松散链表（英语：Unrolled linked list）异或链表
树	线段树自平衡二叉查找树 B树二叉树 AA树 AVL树红黑树平衡树伸展树二叉查找树堆二叉堆左偏树二项堆斐波那契堆 R树 R*树 R+树希尔伯特R树（英语：Hilbert R-tree）希尔伯特前缀树哈希树
图	有向图有向无环图二元决策图无向图确定性非循环有限自动机（英语：Deterministic acyclic finite state automaton）
数据结构术语列表