CodeWithAndre: tree

Showing posts with label tree. Show all posts

Saturday, June 28, 2025

Issues with a Naive Fibonacci

Let’s look in detail at a naïve—that is, simple and elegant, but computationally expensive (you’ll get O(2^n)) way to implement Fibonacci. This, perhaps, was the way you learned how to implement it when you first came across the idea of recursive programming.

Fibonacci is defined very simply:

Fib(0) is 1
Fib(1) is 1
Fib(n) is the sum of Fib(n-1) and Fib(n-2) for all other n >=2

This last step is what makes it recursive—and, unfortunately, what makes the elegant recursive solution. Let’s look at why.

The following is as literal (and correct) a translation of the 3 bullet points above as possible:

public static int fibonacci(int n) {

if (n == 0 || n == 1) {

return n;

}

return fibonacci(n - 1) + fibonacci(n - 2);

}

There’s a problem with this code; even if not in theory, in practice: It repeats a lot of work unnecessarily.

Suppose we need fib(10). Then we need

· Fib(9)

o For which we need fib(8)

§ For which we need fib(7)

· For which we need fib(6)

· And fib(5)

§ And fib(6)

· For which we need fib(5)

· And fib(4)

o and fib(7)

· Fib(8)

o for which we need fib(7)

o and fib(6)

The tree of calls isn’t even complete—this is why Fibonacci takes so long, O(2^n)—and look at how many times I’m repeating that I need royal blue, or turquoise, or any other color. And realize that, if, for example, I need fib(100), I need to do that whole tree’s worth of calculation before I can find the answer, even if fib(100) is just an intermediary goal.

Of course, using fib(100) is quite an extreme example, but it does its job: show just how much work you repeat due to the exponential nature of the problem.

Let’s actually look at a tree showing the sequence of calls for fib(7) to see just how much work is repeated, on our way, in the next article, to a better (if less obvious and elegant) way to calculate Fibonacci that saves orders of magnitude of time and memory:

A diagram of a flowchart

AI-generated content may be incorrect.

Fib(7), by definition, requires fib(6) and fib(5). But here’s the problem: the traditional approach is to calculate fib(6) and throw away the intermediate results, including an answer we need anyway before calculating fib(5), which was already part of fib(6) but which we foolishly threw away.

We get the O(2^n) complexity from the fact that the naïve formula splits us up into 2 scenarios and has us calculate each one independently, and we see that given the branching of the tree. 2^n grows very, very quickly, so while this approach always remains correct, very quickly, it degrades in efficiency, so the method we’ll propose in the next article takes over as the best way to approach this problem.

Monday, June 23, 2025

Red-Black Trees

There exists a special kind of binary search tree called a red-black tree that implements certain extra rules to ensure it stays balanced. Balancing is the process of repositioning nodes to decrease the height of a tree, thereby preventing it from degenerating into a list. Red-black trees, because they impose these conditions to self-balance, perform better, and it takes longer for their performance to degrade.

Normal binary trees have 4 important relations:

You: wherever you are
Your parent: whoever you are connected to, which is one level closer to the root than you
Your child(ren): whoever you are connected to, which are one level further from the root than you
Your sibling(s): whoever may or may not have, as the nearest common ancestor with you, as a parent, the same node that is your parent

Red-black tree rules force us to care about 2 more relations:

Your uncle: The sibling of your parent
Your grandparent: the parent of your parent

The following coloring rules apply:

The root is black
Null nodes are black by default
Red nodes’ children are not red
Every path from the root to a particular node (or, equivalently, up from that node to the root) passes through equal numbers of red and black nodes

Both inserting and deleting nodes might cause violations of these rules to occur. Here are possible violations and how to fix them:

If you insert as red:

And your parent is black

You’re fine!

And your parent is red

Keep reading and check on your uncle
And your uncle is red

Color your parent black
Color your uncle black
Color your grandparent red
Make sure that doesn’t cause any problems, and if there are any, fix them

And your uncle is black

If you’re a right child

Rotate left, i.e., counter-clockwise, about your parent

If you’re a left child

Rotate right, i.e., clockwise, about your grandparent and recolor

A left rotation looks like:

Before Rotation:

/ \

a b

After Left Rotation:

/ \

x b

And has these steps:

Set y to be the right child of x.
Move y’s left subtree to x’s right subtree.
Update the parent of x and y.
Update x’s parent to point to y instead of x.
Set y’s left child to x.
Update x’s parent to y.

And a right rotation looks like this:

Befor Right Rotation:

/ \

a b

After Right Rotation:

/ \

a x

and has these steps:

Set y to be the left child of x.
Move y’s right subtree to x’s left subtree.
Update the parent of x and y.
Update x’s parent to point to y instead of x.
Set y’s right child to x.
Update x’s parent to y.

After inserting as red, you might need to recolor:

If your parent and uncle are red, make them black and your grandparent red
If the new node’s uncle is black (and the new node is the right child of a left or left child of a right—but not left of a left or right of a right) rotate appropriately
If the new node’s uncle is black and this time the new node is a right child of a right child or left child of a left, rotate and recolor the parent and grandparent

After deleting:

If you have a black-black parent-child, this can cause problems

If you have a black sibling

Recolor your parent and sibling, and rotate

If your black sibling has black children

Recolor the sibling red, and now your parent has a problem. Fix it appropriately.

If your black sibling has any red children

Rotate and recolor

Sunday, June 22, 2025

Depth- and Breadth-First Searches

Suppose now you’re looking for a particular value in a graph. (We’ll demonstrate things with a tree, but the graph does not need to be.) You essentially have two ways to go about this search (for example, to solve a maze).

Let’s say you have 10 paths before you, and each is 10 steps deep. You could either go 10 steps down #1, then maybe 10 steps down #2, then maybe 10 steps down #3, etc., until you find what you want or exhaust all options; or 1 step down paths 1-10, then a second step down paths 1-10, then a third step down paths 1-10, and so on, until, again, either tou find what you want or you exhausted all options.

If what you’re looking for is in the graph—an exit from a maze, for example—both of these methods will find the answer. Both are fundamental graph algorithms.

The algorithm by which you go as deep as you can, until you can’t anymore, is called a depth-first search: it prioritizes maximum depth on a path, before moving to another path. The alternative is the breadth-first search, by which you explore as many paths as possible at a certain level, before going to the next level for any of them.

Here's the pseudocode for depth-first:

procedure DFS(node, visited):

mark node as visited

for each neighbor in node.neighbors:

if neighbor is not visited:

DFS(neighbor, visited)

Note that DFS() is recursive and uses a stack.

And here’s the BFS pseudocode, which, this time, is iterative and uses a queue:

procedure BFS(Graph G, start_node):

create an empty queue Q

mark start_node as visited

enqueue start_node into Q

while Q is not empty:

node = dequeue Q

for each neighbor in node.neighbors:

if neighbor is not visited:

mark neighbor as visited

enqueue neighbor into Q

Saturday, June 21, 2025

Full, Complete, Balanced, and Degenerate Trees

We’ve looked at general binary trees—those graphs without cycles in a single component for which there is one root, and each node has either 0, 1 or 2 children.

Let’s now look at 3 subcategories of binary trees that impose more rules, each one getting closer to a “proper” binary tree by a stricter application of one of those rules.

First, full: A general binary tree in theory can have nodes with 1 child. A full tree is one in which none of these one-child nodes exist; all nodes either have 2 children or none
Next: complete: a general binary tree can have gaps. A complete tree is one in which, except (maybe) in the last level, all nodes have 2 children. If at the widest level, there are situations with fewer than 2 children, they occur as far to the right as possible
Finally: balanced: a balanced tree is one in which, looking at the left and right subtrees (those rooted at the children of the global root), the heights of those two subtrees are different by 0 or 1.
In fact, it might be good to add a fourth category as an anti-pattern: the degenerate tree. Suppose for a moment that from the root, one always moves the same direction when adding a child: add the root, move right, add a child; move right, add a child; move right, add a child; and so on. In this case, this tree is no better than a linked list (and it effectively is one, called a tree by mistake)

Try to draw these four groupings of trees.

CodeWithAndre