New AI Tackles Math Problems That Take Millions of Steps


Researchers have developed an artificially intelligent system that does the exact opposite of living in the moment. But it doesn’t just think a few steps ahead—it thinks millions of steps ahead.

A team led by mathematician Sergei Gukov from the California Institute of Technology (Caltech) has created a new type of machine-learning algorithm designed to solve math problems that necessitate an extremely long series of steps. Like a really long series of steps; we’re talking a million steps or more.

Specifically, the AI was able to make progress on a complex problem called the Andrews–Curtis conjecture, which has stumped mathematicians for decades. The conjecture basically asks: Can certain math puzzles always be solved using a set of allowed moves, like rearranging or undoing steps?

To that end, the new Caltech program sought to “find long sequences of steps that are rare and hard to find,” Ali Shehper, first author of the study and a mathematician at Rutgers University, said in a Caltech statement. “It’s like trying to find your way through a maze the size of Earth. These are very long paths that you have to test out, and there’s only one path that works.”

In a preprint study posted on arXiv last August and updated on Tuesday, Shehper and his colleagues detail how they used their newly developed AI to solve families of problems related to the Andrews–Curtis conjecture, which involves abstract algebra. To be clear, they didn’t solve the conjecture itself. While that might seem anticlimactic, the researchers did disprove ongoing potential counterexamples to the conjecture. While disproving counterexamples doesn’t necessarily make the original conjecture true, it does bolster it.

“Ruling out some of the counterexamples gives us confidence in the validity of the original conjecture and helps build our intuition about the main problem,” Shehper explained. “It gives us new ways to think about it.” Gukov compared the math problems to the Rubik’s Cube.

“Can you take this scrambled, complicated Rubik’s Cube and get it back to its original state? You have to test out these very long sequences of moves, and you won’t know if you are on the right path until the very end,” he explained.

So how does the AI do it? Basically, by thinking outside of the box. Following a reinforcement learning approach, the researchers trained the AI by first feeding it easy math problems followed by increasingly difficult tasks. “It tries various moves and gets rewarded for solving the problems,” said Shehper. “We encourage the program to do more of the same while still keeping some level of curiosity. In the end, it develops new strategies that are better than what humans can do. That’s the magic of reinforcement learning.”

The algorithm ultimately learned to generate long sequences of unexpected moves, which the researchers termed “super moves.” In contrast, ChatGPT’s output is much more boring.

“If you ask ChatGPT to write a letter, it will come up with something typical. It’s unlikely to come up with anything unique and highly original. It’s a good parrot,” said Gukov. “Our program is good at coming up with outliers.”

I can think of at least one outlier event that would be really convenient for an AI to predict: financial crashes. But while current machine learning programs haven’t achieved this level of prognostic sophistication, the researchers speculate that their methods could one day contribute to that sort of intelligent forecasting.

“Basically, our program knows how to learn to learn,” Gukov explained. “It’s thinking outside the box.” He added that the team had made significant “improvements in an area of math that was decades old.” What’s more, Gukov and his colleagues have prioritized approaches that do not need large amounts of computing power, making their work accessible to other academics with small-scale computers.

Though the practical applications of this achievement might not be evident in our day-to-day lives, their work joins a host of other researchers optimizing machine-learning algorithms to solve humanity’s problems (not to destroy our civilization).


Leave a Reply

Your email address will not be published. Required fields are marked *