A First Guess is Rarely the Final Answer: Learning to Search in the Travelling Salesperson Problem

arXiv cs.LG / 4/9/2026

💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research

共有:

Key Points

The paper argues that many neural TSP solvers are trained to output only a single tour, even though real workflows typically use additional test-time sampling and local search, motivating learning the search process itself.
It identifies a core limitation in existing learned-improvement methods: they often reuse components designed for single-solution prediction rather than being built around the mechanics and state of local search.
The proposed NICO-TSP framework learns a 2-opt improvement policy by representing the current tour with edge tokens aligned to the 2-opt neighborhood, scoring candidate moves directly without tour positional encodings.
NICO-TSP is trained in two stages—imitation learning on short-horizon optimal improvement trajectories, followed by critic-free group-based reinforcement learning on longer rollouts.
In compute- and wall-clock-matched evaluations, NICO-TSP achieves more consistent and markedly more step-efficient improvements, generalizes better to larger out-of-distribution TSP instances, and can replace classical local search or act as a test-time refinement module for constructive solvers.

Abstract

Most neural solvers for the Traveling Salesperson Problem (TSP) are trained to output a single solution, even though practitioners rarely stop there: at test time, they routinely spend extra compute on sampling or post-hoc search. This raises a natural question: can the search procedure itself be learned? Neural improvement methods take this perspective by learning a policy that applies local modifications to a candidate solution, accumulating gains over an improvement trajectory. Yet learned improvement for TSP remains comparatively immature, with existing methods still falling short of robust, scalable performance. We argue that a key reason is design mismatch: many approaches reuse state representations, architectural choices, and training recipes inherited from single-solution methods, rather than being built around the mechanics of local search. This mismatch motivates NICO-TSP (Neural Improvement for Combinatorial Optimization): a 2-opt improvement framework for TSP. NICO-TSP represents the current tour with exactly

n

edge tokens aligned with the neighborhood operator, scores 2-opt moves directly without tour positional encodings, and trains via a two-stage procedure: imitation learning to short-horizon optimal trajectories, followed by critic-free group-based reinforcement learning over longer rollouts. Under compute-matched evaluations that measure improvement as a function of both search steps and wall-clock time, NICO-TSP delivers consistently stronger and markedly more step-efficient improvement than prior learned and heuristic search baselines, generalizes far more reliably to larger out-of-distribution instances, and serves both as a competitive replacement for classical local search and as a powerful test-time refinement module for constructive solvers.

Black Hat Asia

AI Business

OpenAI's pricing is about to change — here's why local AI matters more than ever

Dev.to

Google AI Tells Users to Put Glue on Their Pizza!

Dev.to

Big Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.

Dev.to

Could it be that this take is not too far fetched?

Reddit r/LocalLLaMA

A First Guess is Rarely the Final Answer: Learning to Search in the Travelling Salesperson Problem

Key Points

Abstract

Related Articles

Black Hat Asia

OpenAI's pricing is about to change — here's why local AI matters more than ever

Google AI Tells Users to Put Glue on Their Pizza!

Big Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.

Could it be that this take is not too far fetched?

関連おすすめサービス

Notta搭載AI議事録イヤホン ZENCHORD1

AI搭載ボイスレコーダー Plaud

画像高画質化AIツール Aiarty Image Enhancer