In A Tale of Two Puzzles, I argued that there is a difference between solving a puzzle and doing mathematics. This is particularly evident when the puzzle has artificial constraints, or is purely a why question1. To make my point clear, I'm going to approach a puzzle mathematically. I'm going to pretend that I've come across this in my research somehow and need to solve it to move on to the next bit.
1Where you're given the answer and asked to explain the “neat" reason why it is so.
2 The Puzzle
This is a famous puzzle (my book attributes it to Langley). It's an angle problem, which is supposed to be solvable without using trigonometry. There are a few equivalent ways of specifying the puzzle, and the unknown angle might be a different one (there are four which if you can work out one then the other three follow easily), I'll start with the formulation as in Figure 1.
You can fill in various angles quite easily, but none of them get you directly to .
There is clearly some cunning construction that gives you the angle. However, after having stared at this diagram for some time2 it hasn't occurred to me. So I'm going to “do the math" and just calculate the thing.
2We're talking years here.
Actually, I'm not even going to do that. I'm going to measure it. Or rather, my computer is going to. The program I'm using to draw these diagrams (which, contrary to standard practice, are drawn to scale) can measure them. Figure 2 is an accurate drawing.
So, now I'm done. I've solved the problem. Well, not quite. If this were a genuine problem I'd come across where I didn't know the answer then I wouldn't know for sure if that was exact or just a coincidence of rounding. As it's from a puzzle, I can be confident that the answer is going to be a “nice" answer and so am confident that is correct.
As a mathematician, I'm instantly curious as to the coincidence of the two angles. I want to know if there is a reason for that or if it genuinely is a coincidence.
3 Do The Experiment
There's an obvious hypothesis: that the found angle is equal to the right-hand base angle. I must admit that I doubt this one (because it should also depend on the and ), but it's worth a test. The way to test this is to vary the , keeping the other two fixed.
Varying only the , we see that is not usually equal to the base angle. We do see some angles where is “nice", such as on the base giving as , but there is no obvious pattern. Table 1 gives the list of “nice" pairs. Varying only the gives Table 2. For the top angle, only gives a “nice" angle for .
Obviously, one could explore more by varying the three together to find other triples that work. But this does not appear to be a fruitful line of enquiry. It might be illuminating to draw some graphs of against the base angles to spot the connecting function, but there are other avenues to explore first.
4 Spot The Coincidences
Let's fill in all the angles, as in Figure 3. There are a lot of angles that agree in that diagram. Although the four difficult angles (the , , , and on that midline) can also be found elsewhere, there are no obvious connections between them.
However, one thing that the overcrowded Figure 3 shows is that there are quite a few isosceles triangles here. There is the outer one, but then also one along the left-hand side with “base" angles and another at the bottom with “base" angles . Maybe the key is keeping these triangles isosceles. This gives us only one degree of freedom, which can be chosen to be the angle at the top of the outer triangle. Once this is fixed, the other two are determined by the isosceles requirements.
The top angles that give “nice" angles for are given in Table 3. There aren't many! This suggests that the secret is more complicated than that the outer triangle is isosceles.
Another route from one angle to another is to find two parts of the diagram that are similar (in the proper mathematical sense). There are two similar triangles in the diagram. The top triangle and bottom left triangles are both . There are also two triangles.
Both of these use the same angles so let's take our diagram and super-impose it on itself so that the two angles agree. We then have a choice as to how big to make the diagram. We can either align the angles or the angles. The drawing is slightly easier to do if we align the angles so we'll do that. The result is in Figure 4 with the original diagram in black and the overlay in red.
Well, now, there's a thing. The red version of the diagram is drawn so that the and angles coincide, but it turns out that that angles also coincide.
Looking at this diagram, I find myself wanting to repeat the superposition construction.
This gives a sequence of triangles that appear to be converging to the point where the cross line meets the left-hand edge of the original triangle.
Let's try to give a concrete description of this process. Start with a non-equilateral triangle and pick two unequal angles. Construct a scaled copy of the triangle inside with the following properties:
The vertex with the smaller of the chosen angles in the copy is at the vertex with the larger chosen angle in the original.
The edge between the chosen angles on the smaller triangle lies along the edge between the chosen angles on the original.
The triangle is as large as it can be.
Then we iterate the process to produce Figure 7.
It's a straightforward argument to show that the third vertices of each triangle form two families lying along two straight lines which intersect at the convergence point3. Moreover, these two lines meet the edge of the original triangle at the same angle (albeit, one from above and one from below).
3I'm strongly resisting the temptation to write “Banach's fixed point theorem" here. Incidentally, did you know that if you place a map of your city on your dining room table then there is a unique point on the map which is in exactly the right place?
In order to show that this is truly what is going in our original diagram, we need to show that the lines genuinely match. That is, in Figure 4 we aligned the angles and observed that the angles part way down the side seemed to match as well. We need to show that this apparent match is actual. One way to do that is to look at ratios of side lengths. Let's label a few vertices.
We need to show that the ratio is the same as . It will be slightly easier to work with than in which case we'll try to show that:
We'll use the sine rule to get at the various lengths. As the whole question is about angles, the overall size of the diagram doesn't matter so we're free to set the length of one side. As in Figure 9 we'll set the base to .
The sine rule then says that:
Thus the sum we want to examine is:
Thus the apparent coincidence is genuine.
This is now enough to prove that the angle is . The superposition argument says that as the lines and are the two joining lines from the superposition. Since we know that angle , this shows that .
Taking the superposition line then the problem is backwards. In the iteration set-up, one would start with the triangle, apply the construction, and then work out the angle of the new lines which would include the angle going to the third vertex of the original triangle (that's angle in Figure 4). In the iteration, that angle is not known at the outset. So really one should start knowing that the line is from the iteration process and work out angle , rather than knowing angle and working out that the line is from the iteration.
So let's solve this problem in general. We need to generalise our diagram to Figure 10. In this diagram, , , and are specified at the outset as the angles of the triangle. We want to find out angle .
The sine rule then says that:
Since we know that this is the line from the iteration, we know that
Substituting, we have:
Using a fair amount of standard trigonometry, we eventually arrive at:
As a check, let's put and , then:
To see how special and are here, let's examine the steps we used in the above simplification.
We cancelled a with so used .
We used .
These together mean that and so , . So this particular simplification route requires these exact values of , , and .
6 A Polygonal Approach
At this point, I have measured the angle and found it to be , and then found a proof that the angle is which includes a general construction that can be applied to almost any triangle. The construction itself might not be all that exciting, but it does provide a context for the angle. I am extremely doubtful that this is in any way related to the intended solution, though, so while the problem may be solved I don't feel that I'm completely done with it.
Looking for more patterns, I noticed that if the line containing the unknown angles is continued down to the horizontal then it makes an isosceles triangle with base angle as in Figure 11.
My first thought was that there was a hexagon here somewhere. That proves not to be the case, as although the three marked sides in Figure 11 are the same size, the angles are not all . But once one starts thinking about polygons, one starts seeing them everywhere and indeed, there are polygons that can be drawn related to this figure. In particular, the angle at the apex of the outer triangle is the central angle in an –sided polygon. Drawing the triangle adjacent to itself makes two sectors from this polygon. We'll draw them mirrored; the reason for this is that in Figure 3 the line up from the base at “bounces" off the side of the triangle as if in a mirror, so drawing the triangle mirrored next to itself allows us to think of that line continuing straight rather than bouncing. The result is in Figure 12.
Let's simplify that a little and only draw one copy of each line, as in Figure 13. This seems to say that if we draw a line at from one vertex of an –agon and at from the next vertex then they will intercept on a line from the centre to the next vertex after that. The really interesting part is what happens if we continue those lines beyond that intersection point. Both eventually reach other vertices of the –agon: the line reaches the th vertex from its starting point and the line the th from where it began. In Figure 14 we draw the full –agon with just the chords to reveal the structure.
This again looks like quite a coincidence. Experimenting with Geogebra shows that it occurs more often than I would expect. Often enough that it's worth doing a proper search for occurrences to find patterns.
It turns out that there are patterns (the search took me down a considerable side route that was very interesting, but not directly relevant to this problem). The most common pattern occurs when one of the chords makes a angle at the centre, which is exactly what happens here. The other chord has to satisfy properties which boil down to and (see Figure 14 for the relevance of the numbers , , and ).
7 Wot No Trig?
The above methods have been interesting to follow but the resulting proofs of the angles are a little backwards. Neither begins with the angles as given but rather start from some other assumption, in both cases about how the inner lines interact, and works backwards to show that all the angles are as stated.
In lieu of anything better, then, the next strategy is to try to work forwards to calculate the missing angle from the given ones. Although the original problem says “No trig allowed", when just solving the problem then artificially limiting oneself is plain dumb. Of course, it can be insightful to then say “How much of that did I really need?", and that is a very common thing for mathematicians to care about. But it really is only afterwards that they do that.
The other reason that I dislike the “No trig allowed" sign is that it can so easily be circumvented, but at the cost of tediousness. Trigonometry is simply about ratios and so every statement using sines and cosines can be reformulated as a statement about similar triangles or other geometric shapes. So if I had a trigonometric argument, I could unpick it to remove the trigonometry by recasting it in more “elementary" terms.4
4I'm young enough to have taken GCSE maths, and it was the system in my school to cover the GCSE syllabus a year early but not to take the exam until the regular time – the intervening year was spent doing an course called “Additional Maths". The GCSE syllabus had no calculus but the Additional Maths one did. I spent some time in one of my GCSE papers wondering if I was allowed to use calculus to answer a particular question.
So let's ignore the “No trig" sign and merrily proceed. As this is meant to be a “How did I do that?" sort of post with details as to how I got there, I'll admit to spending quite some time with the sine rule trying to get one of the unknown angles in a place where I could calculate it. I was trying to avoid the cosine rule (as it introduces more terms), but I did end up briefly trying to put it in. Rather than detail all the blind alleys I went down, I'll just point to Figure 15 as a picture of my working. Somehow, I don't think that that page is going to win any awards for neatness of working or precision of layout.
Eventually, I decided to work out the gradient of the line which meant finding the coordinates of its endpoints. Again I'll set the base length to which means that length is also , see Figure 16.
We use the sine rule to get length , labelled in Figure 16, as:
With these, we can write down the coordinates of points and .
Hence the gradient is:
We can simplify this as follows:
So clearly and would be useful to compute.
Thus the numerator simplifies to:
For the denominator, we start with:
This isn't quite as nice as the above, but it becomes so when we put in a bit more of the denominator.
Thus the denominator in full is:
Thus the gradient is:
Thus the line is at to the horizontal, and since is at to the horizontal, this gives .
Incidentally, the one step in the above that is truly from “left field" is throwing in a factor of . This is where knowing the answer, and having explored the problem with other strategies, helps: we have to get or in there somewhere so as it wasn't there at the outset we have to find some place to put it in.
It's also another example of a habit of mathematicians of doing nothing in an interesting way. That is, we multiply something by – which accomplishes nothing – but in an interesting way – by writing .
8 Introducing Triangles
The previous section has an intriguing implication. It's not hard to work out from the above calculation that the line has length . A formula of the type occurs when working out the base length of an isosceles triangle with apex angle . So this suggests that the line is the base of an isosceles triangle with apex angle and other side length . If the apex angle is then the base angle is . The angle is , so that means that one of the sides of the isosceles triangle lies along , though is too long itself. So let's mark a new point at the apex of this (upside down) isosceles triangle. This is in Figure 17.
As this is meant to be a write-up of a process rather than a conclusion, I should admit here to spending a bit of time staring at a parallelogram before realising that the right thing to do was something else. If one draws a line through parallel to and marks where that intersects with the line , say , then the resulting shape is a parallelogram. I thought it might be possible to work with the new line instead of , but in the end found an alternative route which didn't involve this new point.
In Figure 17, as well as the angles in the triangle I've marked angle . The reason for doing this is that it is half of angle . As the triangle is isosceles, this means that a circle centred at with radius will pass through both and making a chord of this circle. That angle is half of and they are on the same side of means that is also on this circle, whence in particular the length of is .
And now all angles are calculable. As has length , is an equilateral triangle. Also is an isosceles triangle, meaning that angle is (since we know that is ).
If you'll pardon the pun, the point seems fairly central to the whole problem.
9 Solving the Puzzle
In fact, the point finally provides us with what we want. In all the above, although we've found ways to calculate one, and hence all, of the missing angles, our routes have been backwards. We've found some construction that gives us a line across the triangle that we can compute the angle for and then shown that that line was our original one. With none of the constructions have we been able to start from the original diagram and show that the angle is what it is by “simple" means.
But the addition of the point provides this. So first we'd better “find" it in our original diagram5.
5There's a Calvin and Hobbes cartoon where Calvin uses a treasure map to find some treasure. When Hobbes expresses admiration for the fact that the map worked, Calvin lets on that he buried the treasure in the first place. That is exactly what we're doing here: we're going to “dig up" the point , knowing exactly where it is all along.
A common theme in all of this has been the importance of the angle , but it hasn't always been the in the original diagram that's been the key. Our equilateral triangle gives us another (in fact three of them), so let's use this to “find" . We'll choose the point so that has length , as in Figure 19.
This means that triangle is isosceles and so angle is . Since is , this leaves as . We already know that is so is an isosceles triangle with at the apex. Hence has length .
Futhermore, angle is and triangle is isosceles, whence equilateral. So the line also has length . Drawing a circle centred on therefore passes through , , and .
At this point, we have reconstructed Figure 18 without calculating any of the angles inside triangle . As we do know angle , we can use the fact that is a chord of this circle to see that angle is and proceed from there to calculate all the missing angles.
This is now a direct calculation of the missing angles using only knowledge of isosceles triangles and that the angle subtended6 at the centre of a circle is twice that at the circumference. I deem these facts sufficiently “basic" for this to qualify as a solution to the original puzzle.
6I've never come across this word in any other context.
10 Generalising the Puzzle
Now that we have an acceptable solution, we can see where we went wrong with all our attempts to generalise the puzzle at the start. It is clear from the solution that the key is all the lines of length , and all of those emanate either from or from . So the real starting place of this puzzle is the points and and two circles as in Figure 20.
From this, we create the obvious equilateral triangle which is the kernel of the puzzle. We also need one more point which we'll choose to be the point . This must lie on the circle centred at and be a little further round than , as in Figure 21.
To create the puzzle, we extend the lines and until they meet. The line will intersect the circle centred at and we mark this point , as in Figure 22.
The one choice to make in this is the position of . That is determined by the angle , so let us label that . From that one angle we can calculate all of the others.
Setting yields the original puzzle. Also, insisting that the outer triangle be isosceles forces the original puzzle since it implies that from which we get .
Alternative “nice" puzzles would be with right-angled triangles. There are two for this, one with which is in Figure 24 and one with which is in Figure 25. The solutions for these are and respectively.
The range for is from to . At , the lines and are parallel and so the construction fails.
It is perhaps worth noting that since the range of values for the solution, , is between and (with being not possible), then is the “nicest" solution as the only multiple of that is possible.
11 And Finally …
Having finally “solved" the puzzle, it's time to look up the puzzle. It has a page on Wikipedia. My solution matches the official one, and the generalisation given there is closely related to the polygonal approach that I found7. There's no mention of the superposition approach, so I'm claiming that one!
7which, via searching, led me to the article The Number of Intersection Points made by the Diagonals of a Regular Polygon by Björn Poonen and Michael Rubinstein. This is now on my list of articles to read.
In the end, the point was to solve this problem as a mathematician and not as a puzzler. By doing so, I was able to use the puzzle as a launching point for an exploration of a region of mathematics that I might not have encountered otherwise. I spent quite some time in that exploration, and enjoyed it thoroughly.
So when next you are faced with a mathematical puzzle, look at it as a mathematician, not as a puzzler. Here's my guide to keep you on a mathematical track:
Ignore any artificial constraints.
The problem is yours to use as you like, not someone else's to impose on you.
Get to the answer by any means possible and then work backwards as well as forward.
It's always easiest to travel if you can see both where you started and where you want to end up. It's also easier to explore off the beaten track if you have beacons to find your way home again.
Follow up all the “What if …?" thoughts that occur to you.
Follow up all the things that provoke the reation “Whoa, I didn't expect that!"
Above all, when you find a more elegant path or neater argument, don't feel bad for not spotting it before. If you hadn't put in all the work beforehand you wouldn't appreciate just how elegant the new path actually is.