C++ 中的 set 和 unordered_set 有什么区别?

时间：2023-10-07

本文介绍了C++ 中的 set 和 unordered_set 有什么区别?的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我遇到了这个好问题，它很相似，但完全不同，因为它讨论了 Java，它具有不同的哈希表实现，因为它具有同步访问器/mutators:Java 中的 HashMap 和 Hashtable 有什么区别?

I came across this good question, which is similar but not at all same since it talks about Java, which has different implementation of hash-tables, by virtue of having synchronized accessor /mutators: What are the differences between a HashMap and a Hashtable in Java?

那么set和unordered_set的C++实现有什么区别?这个问题当然可以扩展到map vs unordered_map 等其他C++容器.

So what is the difference in C++ implementation of set and unordered_set? This question can be of course extended to map vs unordered_map and so on for other C++ containers.

这是我的初步评估:

set:虽然标准没有明确要求将其实现为树，但要求其查找/插入操作的时间复杂度约束意味着它将始终作为树实现.通常作为高度平衡的 RB 树(如 GCC 4.8 中所见).由于它们是高度平衡的，它们对于 find()

set: While the standard doesn't explicitly ask it to be implemented as trees, the time-complexity constraint asked for its operations for find/insert, means it will always be implemented as a tree. Usually as RB tree (as seen in GCC 4.8), which is height-balanced. Since they are height balanced, they have predictable time-complexity for find()

优点:紧凑(与其他 DS 相比)

Pros: Compact (compared to other DS in comparison)

缺点:访问时间复杂度为 O(lg n)

Con: Access time complexity is O(lg n)

unordered_set:虽然标准没有明确要求将其实现为树，但要求其查找/插入操作的时间复杂度约束意味着它将始终作为哈希实现-表.

unordered_set: While the standard doesn't explicitly asks it to be implemented as trees, the time-complexity constraint asked for its operations for find/insert, means it will always be implemented as a hash-table.

优点:

更快(承诺为搜索摊销 O(1))
与 tree-DS 相比，易于将基本原语转换为线程安全

缺点:

查找不保证是 O(1).理论上最坏的情况是 O(n).
不像树那么紧凑(实际上，负载因子永远不会是 1).

注意:哈希表的 O(1) 来自没有冲突的假设.即使负载因子为 0.5，每插入一秒变量都会导致冲突.可以看出，哈希表的负载因子与访问其中元素所需的操作数成反比.更多我们减少#operations，更稀疏的哈希表.当存储的元素大小与指针相当时，开销就相当可观.

Note: The O(1), for hashtable comes from the assumption that there are no collision. Even with load-factor of .5, every second variable insertion is leading to collision. It could be observed that the load-factor of hash-table is inversely proportional to the number of operations required for accessing a element in it. More we reduce #operations, sparser hash-table. When the element stored are of size comparable to pointer, then overhead is quite significant.

我是否错过了应该知道的性能分析地图/集合之间的任何区别?

C++ 中的 set 和 unordered_set 有什么区别?

问题描述

推荐答案

相关文章

最新文章