.. index:: pair: splay trees; design .. _design-splay: Splay trees =========== .. mps:prefix:: design.mps.splay Introduction ------------ :mps:tag:`intro` This document explains the design of impl.c.splay, an implementation of Splay Trees, including its interface and implementation. :mps:tag:`readership` This document is intended for any MM developer. :mps:tag:`source` The primary sources for this design are [ST85]_ and [Sleator96]_. As CBS is a client, design.mps.cbs_. As PoolMVFF is an indirect client, design.mps.poolmvff_. Also, as PoolMVT is an indirect client, design.mps.poolmvt_. .. _design.mps.cbs: cbs .. _design.mps.poolmvt: poolmvt .. _design.mps.poolmvff: poolmvff :mps:tag:`background` The following background documents influence the design: guide.impl.c.adt(0). Overview -------- :mps:tag:`overview` Splay trees are a form of binary tree where each access brings the accessed element (or the nearest element) to the root of the tree. The restructuring of the tree caused by the access gives excellent amortised performance, as the splay tree adapts its shape to usage patterns. Unused nodes have essentially no time overhead. Definitions ----------- :mps:tag:`def.splay-tree` A *splay tree* is a self-adjusting binary tree as described in [ST85]_ and [Sleator96]_. :mps:tag:`def.node` A *node* is used in the typical data structure sense to mean an element of a tree (see also :mps:ref:`.type.tree`). :mps:tag:`def.key` A *key* is a value associated with each node; the keys are totally ordered by a client provided comparator. :mps:tag:`def.comparator` A *comparator* is a function that compares keys to determine their ordering (see also :mps:ref:`.type.tree.compare.method`). :mps:tag:`def.successor` Node *N*\ :subscript:`2` is the *successor* of node *N*\ :subscript:`1` if *N*\ :subscript:`1` and *N*\ :subscript:`2` are both in the same tree, and the key of *N*\ :subscript:`2` immediately follows the key of *N*\ :subscript:`1` in the ordering of all keys for the tree. :mps:tag:`def.left-child` Each node *N* contains a *left child*, which is a (possibly empty) sub-tree of nodes. The key of *N* is ordered after the keys of all nodes in this sub-tree. :mps:tag:`def.right-child` Each node *N* contains a *right child*, which is a (possibly empty) sub-tree of nodes. The key of *N* is ordered before the keys of all nodes in this sub-tree. :mps:tag:`def.neighbour` The *left neighbour* of a key *K* is the node *N* with the largest key that compares less than *K* in the total order. The *right neighbour* of a key *K* is the node *N* with the smaller key that compares greater than *K* in the total order. A node is a *neighbour* of a key if it is either the left or right neighbour of the key. :mps:tag:`def.first` A node is the *first* node in a set of nodes if its key compares less than the keys of all other nodes in the set. :mps:tag:`def.last` A node is the *last* node in a set of nodes if its key compares greater than the keys of all other nodes in the set. :mps:tag:`def.client-property` A *client property* is a value that the client may associate with each node in addition to the key (a block size, for example). This splay tree implementation provides support for efficiently finding the first or last nodes with suitably large client property values. See also :mps:ref:`.prop` below. Requirements ------------ :mps:tag:`req` These requirements are drawn from those implied by design.mps.poolmvt_, design.mps.poolmvff_, design.mps.cbs_, and general inferred MPS requirements. :mps:tag:`req.order` Must maintain a set of abstract keys which is totally ordered for a comparator. :mps:tag:`req.fast` Common operations must have low amortized cost. :mps:tag:`req.add` Must be able to add new nodes. This is a common operation. :mps:tag:`req.remove` Must be able to remove nodes. This is a common operation. :mps:tag:`req.locate` Must be able to locate a node, given a key. This is a common operation. :mps:tag:`req.neighbours` Must be able to locate the neighbouring nodes of a key (see :mps:ref:`.def.neighbour`). This is a common operation. :mps:tag:`req.iterate` Must be able to iterate over all nodes in key order with reasonable efficiency. :mps:tag:`req.protocol` Must support detection of protocol violations. :mps:tag:`req.debug` Must support debugging of clients. :mps:tag:`req.stack` Must do all non-debugging operations with stack usage bounded by a constant size. :mps:tag:`req.adapt` Must adapt to regularities in usage pattern, for better performance. :mps:tag:`req.property` Must permit a client to associate a client property (such as a size) with each node in the tree. :mps:tag:`req.property.change` Must permit a client to dynamically reassign client properties to nodes in the tree. This is a common operation. :mps:tag:`req.property.find` Must support rapid finding of the first and last nodes which have a suitably large value for their client property. This is a common operation. :mps:tag:`req.root` Must be able to find the root of a splay tree (if one exists). Generic binary tree interface ----------------------------- Types ..... .. c:type:: struct TreeStruct *Tree :mps:tag:`type.tree` ``Tree`` is the type of a node in a binary tree. ``Tree`` contains no fields to store the key associated with the node, or the client property. Again, it is intended that the :c:type:`TreeStruct` can be embedded in another structure, and that this is how the association will be made (see :mps:ref:`.usage.client-node` for an example). No convenience functions are provided for allocation or deallocation. .. c:type:: void *TreeKey :mps:tag:`type.treekey` ``TreeKey`` is the type of a key associated with a node in a binary tree. It is an alias for ``void *`` but expresses the intention. .. c:type:: TreeKey (*TreeKeyMethod)(Tree tree) :mps:tag:`type.tree.key.method` A function of type ``TreeKey`` returns the key associated with a node in a binary tree. (Since there is no space in a :c:type:`TreeStruct` to store a key, it is expected that the :c:type:`TreeStruct` is embedded in another structure from which the key can be extracted.) .. c:type:: Compare (*TreeCompare)(Tree tree, TreeKey key) :mps:tag:`type.tree.compare.method` A function of type ``TreeCompare`` is required to compare ``key`` with the key the client associates with that splay tree node ``tree``, and return the appropriate Compare value (see :mps:ref:`.usage.compare` for an example). The function compares a key with a node, rather than a pair of keys or nodes as might seem more obvious. This is because the details of the mapping between nodes and keys is left to the client (see :mps:ref:`.type.tree`), and the splaying operations compare keys with nodes (see :mps:ref:`.impl.splay`). .. c:type:: Res (*TreeDescribeMethod)(Tree tree, mps_lib_FILE *stream) :mps:tag:`type.tree.describe.method` A function of type :c:type:`TreeDescribeMethod` is required to write (via :c:func:`WriteF()`) a client-oriented representation of the splay node. The output should be non-empty, short, and without newline characters. This is provided for debugging only. Functions ......... .. c:function:: Bool TreeCheck(Tree tree) :mps:tag:`function.tree.check` This is a check function for the ``Tree`` type (see guide.impl.c.adt.method.check and design.mps.check_). .. _design.mps.check: check Splay tree interface -------------------- Types ..... .. c:type:: struct SplayTreeStruct *SplayTree :mps:tag:`type.splay.tree` :c:type:`SplayTree` is the type of the main object at the root of the splay tree. It is intended that the :c:type:`SplayTreeStruct` can be embedded in another structure (see :mps:ref:`.usage.client-tree` for an example). No convenience functions are provided for allocation or deallocation. .. c:type:: Bool (*SplayTestNodeMethod)(SplayTree splay, Tree tree, void *closureP, Size closureS) :mps:tag:`type.splay.test.node.method` A function of type :c:type:`SplayTestNodeMethod` required to determine whether the node itself meets some client determined property (see :mps:ref:`.prop` and :mps:ref:`.usage.test.node` for an example). Parameters ``closureP`` and ``closureS`` describe the environment for the function (see :mps:ref:`.function.splay.find.first` and :mps:ref:`.function.splay.find.last`). .. c:type:: Bool (*SplayTestTreeMethod)(SplayTree splay, Tree tree, void *closureP, Size closureS) :mps:tag:`type.splay.test.tree.method` A function of type :c:type:`SplayTestTreeMethod` is required to determine whether any of the nodes in the sub-tree rooted at the given node meet some client determined property (see :mps:ref:`.prop` and :mps:ref:`.usage.test.tree` for an example). In particular, it must be a precise (not conservative) indication of whether there are any nodes in the sub-tree for which the ``testNode`` method (see :mps:ref:`.type.splay.test.node.method`) would return :c:macro:`TRUE`. Parameters ``closureP`` and ``closureS`` describe the environment for the function (see :mps:ref:`.function.splay.find.first` and :mps:ref:`.function.splay.find.last`). .. c:type:: void (*SplayUpdateNodeMethod)(SplayTree splay, Tree tree) :mps:tag:`type.splay.update.node.method` A function of type :c:type:`SplayUpdateNodeMethod` is required to update any client data structures associated with a node to maintain some client determined property (see :mps:ref:`.prop`) given that the children of the node have changed. (See :mps:ref:`.usage.callback` for an example) Functions ......... :mps:tag:`function.no-thread` The interface functions are not designed to be either thread-safe or re-entrant. Clients of the interface are responsible for synchronization, and for ensuring that client-provided methods invoked by the splay module (:mps:ref:`.type.tree.compare.method`, :mps:ref:`.type.tree.key.method`, :mps:ref:`.type.splay.test.node.method`, :mps:ref:`.type.splay.test.tree.method`, :mps:ref:`.type.splay.update.node.method`) do not call functions of the splay module. .. c:function:: Bool SplayTreeCheck(SplayTree splay) :mps:tag:`function.splay.tree.check` This is a check function for the :c:type:`SplayTree` type (see guide.impl.c.adt.method.check and design.mps.check_). .. c:function:: void SplayTreeInit(SplayTree splay, TreeCompareMethod compare, TreeKeyMethod nodeKey, SplayUpdateNodeMethod updateNode) :mps:tag:`function.splay.tree.init` This function initialises a :c:type:`SplayTree` (see guide.impl.c.adt.method.init). The ``nodeKey`` function extracts a key from a tree node, and the ``compare`` function defines a total ordering on keys of nodes (see :mps:ref:`.req.order`). The effect of supplying a compare method that does not implement a total ordering is undefined. The ``updateNode`` method is used to keep client properties up to date when the tree structure changes; the value ``SplayTrivUpdate`` may be used for this method if there is no need to maintain client properties. (See :mps:ref:`.usage.initialization` for an example use). .. c:function:: void SplayTreeFinish(SplayTree splay) :mps:tag:`function.splay.tree.finish` This function clears the fields of a :c:type:`SplayTree` (see guide.impl.c.adt.method.finish). Note that it does not attempt to finish or deallocate any associated ``Tree`` objects; clients wishing to destroy a non-empty :c:type:`SplayTree` must first explicitly descend the tree and call :c:func:`TreeFinish()` on each node from the bottom up. .. c:function:: Bool SplayTreeInsert(SplayTree splay, Tree tree, void *key) :mps:tag:`function.splay.tree.insert` This function is used to insert into a splay tree a new node which is associated with the supplied key (see :mps:ref:`.req.add`). It first splays the tree at the key. If an attempt is made to insert a node that compares ``CompareEQUAL`` to an existing node in the tree, then :c:macro:`FALSE` will be returned and the node will not be inserted. (See :mps:ref:`.usage.insert` for an example use). .. c:function:: Bool SplayTreeDelete(SplayTree splay, Tree tree, void *key) :mps:tag:`function.splay.tree.delete` This function is used to delete from a splay tree a node which is associated with the supplied key (see :mps:ref:`.req.remove`). If the tree does not contain the given node, or the given node does not compare ``CompareEQUAL`` with the given key, then :c:macro:`FALSE` will be returned, and the node will not be deleted. The function first splays the tree at the given key. (See :mps:ref:`.usage.delete` for an example use). .. c:function:: Bool SplayTreeFind(Tree *nodeReturn, SplayTree splay, TreeKey key) :mps:tag:`function.splay.tree.find` Search the splay tree for a node that compares ``CompareEQUAL`` to the given key (see :mps:ref:`.req.locate`), and splay the tree at the key. Return :c:macro:`FALSE` if there is no such node in the tree, otherwise set ``*nodeReturn`` to the node and return :c:macro:`TRUE`. .. c:function:: Bool SplayTreeNeighbours(Tree *leftReturn, Tree *rightReturn, SplayTree splay, TreeKey key) :mps:tag:`function.splay.tree.neighbours` Search a splay tree for the two nodes that are the neighbours of the given key (see :mps:ref:`.req.neighbours`). Splay the tree at the key. If any node in the tree compares ``CompareEQUAL`` with the given key, return :c:macro:`FALSE`. Otherwise return :c:macro:`TRUE`, set ``*leftReturn`` to the left neighbour of the key (or ``TreeEMPTY`` if the key has no left neighbour), and set ``*rightReturn`` to the right neighbour of the key (or ``TreeEMPTY`` if the key has no right neighbour). See :mps:ref:`.usage.insert` for an example of use. .. c:function:: Tree SplayTreeFirst(SplayTree splay) :mps:tag:`function.splay.tree.first` If the tree has no nodes, return ``TreeEMPTY``. Otherwise, splay the tree at the first node, and return that node (see :mps:ref:`.req.iterate`). .. c:function:: Tree SplayTreeNext(SplayTree splay, TreeKey key) :mps:tag:`function.splay.tree.next` If the tree contains a right neighbour for ``key``, splay the tree at that node and return it. Otherwise return ``TreeEMPTY``. See :mps:ref:`.req.iterate`. .. c:function:: Res SplayTreeDescribe(SplayTree splay, mps_lib_FILE *stream, Count depth, TreeDescribeMethod nodeDescribe) :mps:tag:`function.splay.tree.describe` This function prints (using :c:func:`WriteF()`) to the stream a textual representation of the given splay tree, using ``nodeDescribe`` to print client-oriented representations of the nodes (see :mps:ref:`.req.debug`). Provided for debugging only. .. c:function:: Bool SplayFindFirst(Tree *nodeReturn, SplayTree splay, SplayTestNodeMethod testNode, SplayTestTreeMethod testTree, void *closureP, Size closureS) :mps:tag:`function.splay.find.first` Find the first node in the tree that satisfies some client property, as determined by the ``testNode`` and ``testTree`` methods (see :mps:ref:`.req.property.find`). ``closureP`` and ``closureS`` are arbitrary values, and are passed to the ``testNode`` and ``testTree`` methods which may use the values as closure environments. If there is no satisfactory node, return :c:macro:`FALSE`; otherwise set ``*nodeReturn`` to the node and return :c:macro:`TRUE`. See :mps:ref:`.usage.delete` for an example. .. c:function:: Bool SplayFindLast(Tree *nodeReturn, SplayTree splay, SplayTestNodeMethod testNode, SplayTestTreeMethod testTree, void *closureP, Size closureS) :mps:tag:`function.splay.find.last` As :c:func:`SplayFindFirst()`, but find the last node in the tree that satisfies the client property. .. c:function:: void SplayNodeRefresh(SplayTree splay, Tree tree, TreeKey key) :mps:tag:`function.splay.node.refresh` Call the ``updateNode`` method on the given node, and on any other nodes that may require updating. The client key for the node must also be supplied; the function splays the tree at this key. (See :mps:ref:`.usage.insert` for an example use). This function must be called whenever the client property (see :mps:ref:`.prop`) at a node changes (see :mps:ref:`.req.property.change`). .. c:function:: void SplayNodeUpdate(SplayTree splay, Tree node) :mps:tag:`function.splay.node.update` Call the ``updateNode`` method on the given node, but leave other nodes unchanged. This may be called when a new node is created, to get the client property off the ground. Client-determined properties ---------------------------- :mps:tag:`prop` To support :mps:ref:`.req.property.find`, this splay tree implementation provides additional features to permit clients to cache maximum (or minimum) values of client properties for all the nodes in a subtree. The splay tree implementation uses the cached values as part of :c:func:`SplayFindFirst()` and :c:func:`SplayFindLast()` via the ``testNode`` and ``testTree`` methods. The client is free to choose how to represent the client property, and how to compute and store the cached value. :mps:tag:`prop.update` The cached values depend upon the topology of the tree, which may vary as a result of operations on the tree. The client is given the opportunity to compute new cache values whenever necessary, via the ``updateNode`` method (see :mps:ref:`.function.splay.tree.init`). This happens whenever the tree is restructured. The client may use the :c:func:`SplayNodeRefresh()` method to indicate that the client attributes at a node have changed (see :mps:ref:`.req.property.change`). A call to :c:func:`SplayNodeRefresh()` splays the tree at the specified node, which may provoke calls to the ``updateNode`` method as a result of the tree restructuring. The ``updateNode`` method will also be called whenever a new splay node is inserted into the tree. :mps:tag:`prop.example` For example, if implementing an address-ordered tree of free blocks using a splay tree, a client might choose to use the base address of each block as the key for each node, and the size of each block as the client property. The client can then maintain as a cached value in each node the size of the largest block in the subtree rooted at that node. This will permit a fast search for the first or last block of at least a given size. See :mps:ref:`.usage.callback` for an example ``updateNode`` method for such a client. :mps:tag:`prop.ops` The splay operations must cause client properties for nodes to be updated in the following circumstances (see :mps:ref:`.impl` for details): :mps:tag:`prop.ops.rotate` rotate left, rotate right -- We need to update the value at the original root, and the new root, in that order. :mps:tag:`prop.ops.link` link left, link right -- We know that the line of right descent from the root of the left tree and the line of left descent from the root of the right tree will both need to be updated. This is performed at the assembly stage. (We could update these chains every time we do a link left or link right instead, but this would be less efficient) :mps:tag:`prop.ops.assemble` assemble -- This operation also invalidates the lines of right and left descent of the left and right trees respectively which need to be updated (see below). It also invalidates the root which must be updated last. :mps:tag:`prop.ops.assemble.reverse` To correct the chains of the left and right trees without requiring stack or high complexity, we use a judicious amount of pointer reversal. :mps:tag:`prop.ops.assemble.traverse` During the assembly, after the root's children have been transplanted, we correct the chains of the left and right trees. For the left tree, we traverse the right child line, reversing pointers, until we reach the node that was the last node prior to the transplantation of the root's children. Then we update from that node back to the left tree's root, restoring pointers. Updating the right tree is the same, mutatis mutandis. Usage ----- :mps:tag:`usage` Here's a simple example of a client which uses a splay tree to implement an address ordered tree of free blocks. The significant client usages of the splay tree interface might look as follows:- :mps:tag:`usage.client-tree` Tree structure to embed a :c:type:`SplayTree` (see :mps:ref:`.type.splay.tree`):: typedef struct FreeTreeStruct { SplayTreeStruct splayTree; /* Embedded splay tree */ /* no obvious client fields for this simple example */ } FreeTreeStruct; :mps:tag:`usage.client-node` Node structure to embed a ``Tree`` (see :mps:ref:`.type.tree`):: typedef struct FreeBlockStruct { TreeStruct treeStruct; /* embedded splay node */ Addr base; /* base address of block is also the key */ Size size; /* size of block is also the client property */ Size maxSize; /* cached value for maximum size in subtree */ } FreeBlockStruct; :mps:tag:`usage.callback` ``updateNode`` callback method (see :mps:ref:`.type.splay.update.node.method`):: void FreeBlockUpdateNode(SplayTree splay, Tree tree) { /* Compute the maximum size of any block in this subtree. */ /* The value to cache is the maximum of the size of this block, */ /* the cached value for the left subtree (if any) and the cached */ /* value of the right subtree (if any) */ FreeBlock freeNode = FreeBlockOfTree(tree); Size maxSize = freeNode.size; if (TreeHasLeft(tree)) { FreeBlock leftNode = FreeBlockOfTree(TreeLeft(tree)); if(leftNode.maxSize > maxSize) maxSize = leftNode->maxSize; } if (TreeHasRight(tree)) { FreeBlock rightNode = FreeBlockOfTree(TreeRight(tree)); if(rightNode.maxSize > maxSize) maxSize = rightNode->maxSize; } freeNode->maxSize = maxSize; } :mps:tag:`usage.compare` Comparison function (see :mps:ref:`.type.tree.compare.method`):: Compare FreeBlockCompare(Tree tree, TreeKey key) { Addr base1, base2, limit2; FreeBlock freeNode = FreeBlockOfTree(tree); base1 = (Addr)key; base2 = freeNode->base; limit2 = AddrAdd(base2, freeNode->size); if (base1 < base2) return CompareLESS; else if (base1 >= limit2) return CompareGREATER; else return CompareEQUAL; } :mps:tag:`usage.test.tree` Test tree function (see :mps:ref:`.type.splay.test.tree.method`):: Bool FreeBlockTestTree(SplayTree splay, Tree tree void *closureP, Size closureS) { /* Closure environment has wanted size as value of closureS. */ /* Look at the cached value for the node to see if any */ /* blocks in the subtree are big enough. */ Size size = closureS; FreeBlock freeNode = FreeBlockOfTree(tree); return freeNode->maxSize >= size; } :mps:tag:`usage.test.node` Test node function (see :mps:ref:`.type.splay.test.node.method`):: Bool FreeBlockTestNode(SplayTree splay, Tree tree void *closureP, Size closureS) { /* Closure environment has wanted size as value of closureS. */ /* Look at the size of the node to see if is big enough. */ Size size = closureS; FreeBlock freeNode = FreeBlockOfTree(tree); return freeNode->size >= size; } :mps:tag:`usage.initialization` Client's initialization function (see :mps:ref:`.function.splay.tree.init`):: void FreeTreeInit(FreeTree freeTree) { /* Initialize the embedded splay tree. */ SplayTreeInit(&freeTree->splayTree, FreeBlockCompare, FreeBlockUpdateNode); } :mps:tag:`usage.insert` Client function to add a new free block into the tree, merging it with an existing block if possible:: void FreeTreeInsert(FreeTree freeTree, Addr base, Addr limit) { SplayTree splayTree = &freeTree->splayTree; Tree leftNeighbour, rightNeighbour; TreeKey key = base; /* use the base of the block as the key */ Res res; /* Look for any neighbouring blocks. (.function.splay.tree.neighbours) */ res = SplayTreeNeighbours(&leftNeighbour, &rightNeighbour, splayTree, key); AVER(res == ResOK); /* this client doesn't duplicate free blocks */ /* Look to see if the neighbours are contiguous. */ if (leftNeighbour != TreeEMPTY && FreeBlockLimitOfSplayNode(leftNeighbour) == base) { /* Inserted block is contiguous with left neighbour, so merge it. */ /* The client housekeeping is left as an exercise to the reader. */ /* This changes the size of a block, which is the client */ /* property of the splay node. See :mps:ref:`.function.splay.node.refresh` */ SplayNodeRefresh(splayTree, leftNeighbour, key); } else if (rightNeighbour != TreeEMPTY && FreeBlockBaseOfSplayNode(rightNeighbour) == limit) { /* Inserted block is contiguous with right neighbour, so merge it. */ /* The client housekeeping is left as an exercise to the reader. */ /* This changes the size of a block, which is the client */ /* property of the splay node. See :mps:ref:`.function.splay.node.refresh` */ SplayNodeRefresh(splayTree, rightNeighbour, key); } else { /* Not contiguous - so insert a new node */ FreeBlock newBlock = (FreeBlock)allocate(sizeof(FreeBlockStruct)); Tree newTree = &newBlock->treeStruct; newBlock->base = base; newBlock->size = AddrOffset(base, limit); TreeInit(newTree); /* :mps:ref:`.function.tree.init` */ SplayNodeUpdate(splayTree, newTree); /* :mps:ref:`.function.splay.node.update` */ /* :mps:ref:`.function.splay.tree.insert` */ res = SplayTreeInsert(splayTree, newTree, key); AVER(res == ResOK); /* this client doesn't duplicate free blocks */ } } :mps:tag:`usage.delete` Client function to allocate the first block of a given size in address order. For simplicity, this allocates the entire block:: Bool FreeTreeAllocate(Addr *baseReturn, Size *sizeReturn, FreeTree freeTree, Size size) { SplayTree splayTree = &freeTree->splayTree; Tree splayNode; Bool found; /* look for the first node of at least the given size. */ /* closureP parameter is not used. See `.function.splay.find.first.`_ */ found = SplayFindFirst(&splayNode, splayTree, FreeBlockTestNode, FreeBlockTestTree, NULL, size); if (found) { FreeBlock freeNode = FreeBlockOfTree(splayNode); Void *key = (void *)freeNode->base; /* use base of block as the key */ Res res; /* allocate the block */ *baseReturn = freeNode->base; *sizeReturn = freeNode->size; /* :mps:ref:`.function.splay.tree.delete` */ res = SplayTreeDelete(splayTree, splayNode, key); AVER(res == ResOK); /* Must be possible to delete node */ /* Delete the block */ deallocate(freeNode, (sizeof(FreeBlockStruct)); return TRUE; } else { /* No suitable block */ return FALSE; } } Implementation -------------- :mps:tag:`impl` For more details of how splay trees work, see [ST85]_. For more details of how to implement operations on splay trees, see [Sleator96]_. Here we describe the operations involved. Top-down splaying ................. :mps:tag:`impl.top-down` The method chosen to implement the splaying operation is called "top-down splay". This is described as "procedure top-down splay" in [ST85]_, but the implementation here additionally permits attempts to access items which are not known to be in the tree. Top-down splaying is particularly efficient for the common case where the location of the node in a tree is not known at the start of an operation. Tree restructuring happens as the tree is descended, whilst looking for the node. :mps:tag:`impl.splay` The key to the operation of the splay tree is the internal function :c:func:`SplaySplay()`. It searches the tree for a node with a given key. In the process, it brings the found node, or an arbitrary neighbour if not found, to the root of the tree. This "bring-to-root" operation is performed top-down during the search, and it is not the simplest possible bring-to-root operation, but the resulting tree is well-balanced, and will give good amortised cost for future calls to :c:func:`SplaySplay()`. See [ST85]_. :mps:tag:`impl.splay.how` To perform this top-down splay, the tree is broken into three parts, a left tree, a middle tree and a right tree. We store the left tree and right tree in the right and left children respectively of a "sides" node to eliminate some boundary conditions. The initial condition is that the middle tree is the entire splay tree, and the left and right trees are empty. We also keep pointers to the last node in the left tree, and the first node in the right tree. Note that, at all times, the three trees are each validly ordered, and they form a partition with the ordering left, middle, right. The splay is then performed by comparing the middle tree with the following six cases, and performing the indicated operations, until none apply. :mps:tag:`impl.splay.cases` Note that figure 3 of [ST85]_ describes only 3 cases: *zig*, *zig-zig* and *zig-zag*. The additional cases described here are the symmetric variants which are respectively called *zag*, *zag-zag* and *zag-zig*. In the descriptions of these cases, ``root`` is the root of the middle tree; ``node->left`` is the left child of ``node``; ``node->right`` is the right child of ``node``. The comparison operators (``<``, ``>``, ``==``) are defined to compare a key and a node in the obvious way by comparing the supplied key with the node's associated key. :mps:tag:`impl.splay.zig` The "zig" case is where ``key < root``, and either: - ``key == root->left``; - ``key < root->left && root->left->left == NULL``; or - ``key > root->left && root->left->right == NULL``. The operation for the zig case is: link right (see :mps:ref:`.impl.link.right`). :mps:tag:`impl.splay.zag` The "zag" case is where ``key > root``, and either: - ``key == root->right``; - ``key < root->right && root->right->left == NULL``; or - ``key > root->right && root->right->right == NULL``. The operation for the zag case is: link left (see :mps:ref:`.impl.link.left`). :mps:tag:`impl.splay.zig.zig` The "zig-zig" case is where - ``key < root && key < root->left && root->left->left != NULL``. The operation for the zig-zig case is: rotate right (see :mps:ref:`.impl.rotate.right`) followed by link right (see :mps:ref:`.impl.link.right`). :mps:tag:`impl.splay.zig.zag` The "zig-zag" case is where - ``key < root && key > root->left && root->left->right != NULL``. The operation for the zig-zag case is: link right (see :mps:ref:`.impl.link.right`) followed by link left (see :mps:ref:`.impl.link.left`). :mps:tag:`impl.splay.zag.zig` The "zag-zig" case is where - ``key > root && key < root->right && root->right->left != NULL``. The operation for the zag-zig case is: link left (see :mps:ref:`.impl.link.left`) followed by link right (see :mps:ref:`.impl.link.right`). :mps:tag:`impl.splay.zag.zag` The "zag-zag" case is where - ``key > root && key > root->right && root->right->right != NULL``. The operation for the zag-zag case is: rotate left (see :mps:ref:`.impl.rotate.left`) followed by link left (see :mps:ref:`.impl.link.left`). :mps:tag:`impl.splay.terminal.null` A special terminal case is when - ``root == NULL``. This can only happen at the beginning, and cannot arise from the operations above. In this case, the splay operation must return :c:macro:`NULL`, and "not found". :mps:tag:`impl.splay.terminal.found` One typical terminal case is when - ``key == root``. This case is tested for at the beginning, in which case "found" is returned immediately. If this case happens as a result of other operations, the splay operation is complete, the three trees are assembled (see :mps:ref:`.impl.assemble`), and "found" is returned. :mps:tag:`impl.splay.terminal.not-found` The other typical terminal cases are: - ``key < root && root->left == NULL``; and - ``key > root && root->right == NULL``. In these cases, the splay operation is complete, the three trees are assembled (see :mps:ref:`.impl.assemble`), and "not found" is returned. :mps:tag:`impl.rotate.left` The "rotate left" operation (see [ST85]_ figure 1) rearranges the middle tree as follows (where any of sub-trees A, B and C may be empty): .. figure:: splay-rotate-left.svg :align: center :alt: Diagram: the rotate left operation. :mps:tag:`impl.rotate.right` The "rotate right" operation (see [ST85]_ figure 1) rearranges the middle tree as follows (where any of sub-trees A, B and C may be empty): .. figure:: splay-rotate-right.svg :align: center :alt: Diagram: the rotate right operation. :mps:tag:`impl.link.left` The "link left" operation (see [ST85]_ figure 11a for symmetric variant) rearranges the left and middle trees as follows (where any of sub-trees A, B, L and R may be empty): .. figure:: splay-link-left.svg :align: center :alt: Diagram: the link left operation. The last node of the left tree is now x. :mps:tag:`impl.link.right` The "link right" operation (see [ST85]_ figure 11a) rearranges the middle and right trees as follows (where any of sub-trees A, B, L and R may be empty): .. figure:: splay-link-right.svg :align: center :alt: Diagram: the link left operation. The first node of the right tree is now x. :mps:tag:`impl.assemble` The "assemble" operation (see [ST85]_ figure 12) merges the left and right trees with the middle tree as follows (where any of sub-trees A, B, L and R may be empty): .. figure:: splay-assemble.svg :align: center :alt: Diagram: the assemble operation. Top-level operations .................... :mps:tag:`impl.insert` :c:func:`SplayTreeInsert()`: (See [Sleator96]_, chapter 4, function insert). If the tree has no nodes, [how does it smell?] add the inserted node and we're done; otherwise splay the tree around the supplied key. If the splay successfully found a matching node, return failure. Otherwise, add the inserted node as a new root, with the old (newly splayed, but non-matching) root as its left or right child as appropriate, and the opposite child of the old root as the other child of the new root. :mps:tag:`impl.delete` :c:func:`SplayTreeDelete()`: (See [Sleator96]_, chapter 4, function delete). Splay the tree around the supplied key. Check that the newly splayed root is the same node as given by the caller, and that it matches the key; return failure if not. If the given node (now at the root) has fewer than two children, replace it (as root), with the non-null child or null. Otherwise, set the root of the tree to be the left child (arbitrarily) of the node to be deleted, and splay around the same key. The new root will be the last node in the sub-tree and will have a null right child; this is set to be the right child of the node to be deleted. :mps:tag:`impl.search` :c:func:`SplayTreeSearch()`: Splay the node around the supplied key. If the splay found a matching node, return it; otherwise return failure. :mps:tag:`impl.neighbours` :c:func:`SplayTreeNeighbours()`: Splay the tree around the supplied key. If the splay found a matching node, return failure. Otherwise, determine whether the (non-matching) found node is the left or right neighbour of the key (by comparison with the key). Set the tree root to be the right or left child of that first neighbour respectively, and again splay the tree around the supplied key. The new root will be the second neighbour, and will have a null left or right child respectively. Set this null child to be the first neighbour. Return the two neighbours. :mps:tag:`impl.neighbours.note` Note that it would be possible to implement :c:func:`SplayTreeNeighbours()` with only one splay, and then a normal binary tree search for the left or right neighbour of the root. This would be a cheaper operation, but would give poorer amortised cost if the call to :c:func:`SplayTreeNeighbours()` typically precedes a call to :c:func:`SplayTreeInsert()` (which is expected to be a common usage pattern - see :mps:ref:`.usage.insert`). It's also possible to implement :c:func:`SplayTreeNeighbours()` by simply keeping track of both neighbours during a single splay. This has about the same cost as a single splay, and hence about the same amortised cost if the call to :c:func:`SplayTreeNeighbours()` typically precedes a call to :c:func:`SplayTreeInsert()`. :mps:tag:`impl.next` :c:func:`SplayTreeNext()`: Splay the tree around the supplied ``oldKey``. During iteration the "old node" found is probably already at the root, in which case this will be a null operation with little cost. If this old node has no right child, return :c:macro:`NULL`. Otherwise, split the tree into a right tree (which contains just the right child of the old node) and a left tree (which contains the old node, its left child and no right child). The next node is the first node in the right tree. Find this by splaying the right tree around ``oldKey`` (which is known to compare ``CompareLESS`` than any keys in the right tree). Rejoin the full tree, using the right tree as the root and setting the left child of root to be the left tree. Return the root of this tree. Testing ------- :mps:tag:`test` There is no plan to test splay trees directly. It is believed that the testing described in design.mps.cbs.test will be sufficient to test this implementation. Error Handling -------------- :mps:tag:`error` This module detects and reports most common classes of protocol error. The cases it doesn't handle will result in undefined behaviour and probably cause an :c:macro:`AVER` to fire. These are: :mps:tag:`error.bad-pointer` Passing an invalid pointer in place of a :c:type:`SplayTree` or ``Tree``. :mps:tag:`error.bad-compare` Initialising a :c:type:`SplayTree` with a compare function that is not a valid compare function, or which doesn't implement a total ordering on splay nodes. :mps:tag:`error.bad-describe` Passing an invalid describe method to :c:func:`SplayTreeDescribe()`. :mps:tag:`error.out-of-stack` Stack exhaustion under :c:func:`SplayTreeDescribe()`. Future ------ :mps:tag:`future.parent` The iterator could be made more efficient (in an amortized sense) if it didn't splay at each node. To implement this (whilst meeting :mps:ref:`.req.stack`) we really need parent pointers from the nodes. We could use the (first-child, right-sibling/parent) trick described in [ST85]_ to implement this, at a slight cost to all other tree operations, and an increase in code complexity. [ST85]_ doesn't describe how to distinguish the first-child between left-child and right-child, and the right-sibling/parent between right-sibling and parent. One could either use the comparator to make these distinctions, or steal some bits from the pointers. References ---------- .. [ST85] "Self-Adjusting Binary Search Trees"; Daniel Dominic Sleator, Robert Endre Tarjan; AT&T Bell Laboratories, Murray Hill, NJ; 1985-07; Journal of the ACM, Vol. 32, Num. 3, pp. 652-686, July 1985; . .. [Sleator96] "Splay Trees"; Daniel Dominic Sleator; CMU, 22/02/96; CMU 15-211; .