This post was written as a follow-up on Hillel Wayne’s newsletter: You can cheat a test suite with a big enough polynomial [1] which you should definitely check first.

Indeed, his post refers to a well-known mathematical problem: interpolation. More specifically, he is interpolating using polynomials. It turns out there is a great tool for polynomial interpolation: Lagrange polynomial.

Lagrange interpolation

Given inputs $x_{1}, ..., x_{n} \in R$ and outputs $y_{1}, ..., y_{n} \in R$ , we want to build a polynomial $P$ such that $\forall i \in {1.. n}, P (x_{i}) = y_{i}$ .

The Lagrange interpolating polynomial is built as follows:

P (x) = i = 1 \sum n y_{i} δ_{i} (x) with δ_{i} (x) = {01 if x = x_{j} if x = x_{i}

This way, it is clear that the property stands:

P (x_{i}) = j = 1 \sum n y_{j} δ_{j} (x_{i}) = y_{i} 1 δ_{i} (x_{i}) + j \neq = i \sum y_{j} 0 δ_{j} (x_{i}) = y_{i}

Okay, but how do we define $δ_{i} (\cdot)$ as a polynomial?
Hopefully, we know the roots of this polynomial. And we also have some information on how to normalize it ( $δ_{i} (x_{i}) = 1$ ). The following meets these two requirements¹:

δ_{i} (x) = j \neq = i \prod \frac{x - x _{j}}{x _{i} - x _{j}}

Thus, we can entirely compute our Lagrange interpolating polynomial! But, wait… Hillel’s newsletter was considering functions of multiple variables.

Higher dimensions

Using straightforward Lagrange interpolation

Given input vectors $x_{1}, ..., x_{n} \in R^{d}$ and output scalars² $y_{1}, ..., y_{n} \in R$ , we want to build a polynomial $P$ such that $\forall i \in {1.. n}, P (x_{i}) = y_{i}$ . Beware, $x$ s are now vectors, so $P$ is a multivariate polynomial now. Using the idea of Lagrange interpolation, we can come up with:

P (x) = i = 1 \sum n y_{i} δ_{i} (x) = i = 1 \sum n y_{i} j \neq = i \prod \frac{∣∣ x - x _{j} ∣∣}{∣∣ x _{i} - x _{j} ∣∣} with δ_{i} (x) = {01 if x = x_{j} if x = x_{i} where ∣∣ \cdot ∣∣ denotes a norm for vectors

Simple, right? …right?
The thing is we no longer have a polynomial. Alright, let’s apply a quick fix:

P (x) = i = 1 \sum n y_{i} j \neq = i \prod \frac{∣∣ x - x _{j} ∣ ∣ ^{2}}{∣∣ x _{i} - x _{j} ∣ ∣ ^{2}} = i = 1 \sum n y_{i} j \neq = i \prod (\frac{1}{∣∣ x _{i} - x _{j} ∣ ∣ ^{2}} k = 1 \sum d (x^{(k)} - x_{j}^{(k)})^{2}) where ∣∣ \cdot ∣∣ denotes the euclidean norm

Ta-daaa! We somehow won, but at what cost? Well, now this is a multivariate polynomial, but we lost an important property along the way: this polynomial is not of minimal degree.

A more sophisticated Lagrange interpolation

Hopefully, we are not the first ones working on this problem: see the paper by Kamron Saniee [2] to do some clean work, which I won’t do today.

Experimental results

Now, time to code and compute some polynomials!

You can check the source code here. The code itself is ugly as hell, but it does work (and will be forgotten forever after this hopefully).

I used the same set of inputs as in Hillel’s post:

inputs = [(1, 2, 3), (4, 2, 2), (1, 1, 1), (3, 5, 4)]
outputs = [max(g) for g in inputs]
 
p = lagrange(inputs, outputs)
 
print(p.eval(inputs[0]))
# Should be 3, outputs 2.9999999999999254
 
print(p.eval(inputs[1]))
# Should be 4, outputs 3.9999999999998908
 
print(p.eval(inputs[2]))
# Should be 1, outputs 0.9999999999999887
 
print(p.eval(inputs[3]))
# Should be 5, outputs 4.999999999999659
 
print(p)
# This one I'm not writing here

Yeay! Apart from floating point errors, we’re good! So, we found a polynomial which appears to be equivalent to the max function if looking only at this 4-test-long test suite.

What is this polynomial, you may ask?

The polynomial (click to unfold)

$P (x, y, z) = 0.008389738340477258 x^{6} + 0.02516921502143177 x^{4} y^{2} - 0.12339965453265947 x^{4} z + 1.0990083807817799 x^{3} y + 17.965412321668477 x^{2} + 2.2095988740323715 x^{2} z^{2} - 0.26042351736933017 x^{2} y^{3} - 0.26042351736933017 x^{2} y z^{2} - 0.24679930906531894 x^{2} z^{3} + 16.31967244578082 x y - 0.11513274902437466 x y^{4} + 1.0566822340221358 x y^{2} z - 0.11513274902437466 x z^{4} + 18.336720619282197 y^{2} - 31.246256797389808 z + 2.288229799756893 y^{2} z^{2} + 16.67558057705841 yz + 0.008389738340477258 y^{6} - 0.12339965453265947 y^{4} z + 0.02516921502143177 y^{2} z^{4} + 1.1105981703026038 y z^{3} - 0.11513274902437466 x^{5} - 0.1302117586846651 x^{4} y - 5.371275030388332 x^{3} - 0.23026549804874932 x^{3} z^{2} + 2.224676604183994 x^{2} y^{2} - 6.79466572836031 x^{2} z + 0.05033843004286354 x^{2} y^{2} z^{2} + 1.1105981703026038 x^{2} yz - 29.6431245601689 x - 6.869129294350971 x z^{2} + 1.0990083807817799 x y^{3} + 1.0990083807817799 x y z^{2} + 1.0566822340221358 x z^{3} - 30.092303755357946 y + 1.1516537649542578 y^{4} - 6.954174397031541 y^{2} z + 1.136576034802636 z^{4} - 0.1302117586846651 y^{5} - 0.26042351736933017 y^{3} z^{2} - 0.24679930906531894 y^{2} z^{3} + 0.008389738340477258 z^{6} + 1.0730228392297358 x^{4} + 0.02516921502143177 x^{4} z^{2} - 0.23026549804874932 x^{3} y^{2} + 1.0566822340221358 x^{3} z - 6.773748320644872 x^{2} y + 0.02516921502143177 x^{2} y^{4} - 0.24679930906531894 x^{2} y^{2} z + 0.02516921502143177 x^{2} z^{4} - 6.889314823107927 x y^{2} + 16.74706928539441 x z - 0.23026549804874932 x y^{2} z^{2} - 4.180942997888811 x yz + 32.30273175100761 + 18.66529844539697 z^{2} - 5.509788241315336 y^{3} - 6.913071460559145 y z^{2} - 5.565139786322052 z^{3} + 0.02516921502143177 y^{4} z^{2} + 1.1105981703026038 y^{3} z - 0.1302117586846651 y z^{4} - 0.12339965453265947 z^{5}$

We are pretty far from minimal degree here. But this just adds more chaos to the gag I guess.

References

[1] newsletter by Hillel Wayne: You can cheat a test suite with a big enough polynomial
[2] Saniee, Kamron. (2008). A Simple Expression for Multivariate Lagrange Interpolation. SIAM Undergraduate Research Online. 1. 10.1137/08S010025.

Of course, when doing Lagrange interpolation, the $x_{i}$ s are supposed to be unique. ↩
Here we consider some scalars, but it also could be some vectors without any loss of generality. ↩

Raimmy's blog

Explorer

Cheating test suites and multivariate interpolating polynomial

Table of Contents

Lagrange interpolation

Higher dimensions

Using straightforward Lagrange interpolation

A more sophisticated Lagrange interpolation

Experimental results

References

Table of Contents

Raimmy's blog

Explorer

Cheating test suites and multivariate interpolating polynomial

Table of Contents

Lagrange interpolation

Higher dimensions

Using straightforward Lagrange interpolation

A more sophisticated Lagrange interpolation

Experimental results

References

Footnotes

Table of Contents