Official Codebase for "Neural Thickets: Diverse Task Experts Are Dense Around Pretrained Weights" (ICML 2026 Spotlight)

Python 53.3%
Jupyter Notebook 40%
Shell 6.7%

Find a file

Repository files (latest commit first)
Filename	Latest commit message	Latest commit date
Yulu Gan 1a838c871c Merge pull request #6 from zhxchd/main Update countdown reward function for baselines		2026-05-20 17:24:54 -04:00
analysis/one_step_gradient	Merge branch 'main' of https://github.com/sunrainyg/RandOpt	2026-05-15 00:08:11 -04:00
baselines	update countdown reward functions for baselines	2026-05-19 16:11:04 -07:00
core	initial commit, basic setup	2026-03-01 11:52:12 -05:00
data	initial commit, basic setup	2026-03-01 11:52:12 -05:00
data_handlers	Allow equal sign for countdown	2026-05-08 15:42:10 -04:00
diffusion	adding diffusion script	2026-03-08 12:24:43 -04:00
distillation	Add distillation code	2026-03-20 03:05:57 +01:00
docker	initial commit, basic setup	2026-03-01 11:52:12 -05:00
scripts	initial commit, basic setup	2026-03-01 11:52:12 -05:00
simple_1D_signals_expts	fixing name bug in 1D expts	2026-03-19 18:37:36 -04:00
utils	Allow equal sign for countdown	2026-05-08 15:42:10 -04:00
.gitignore	initial commit, basic setup	2026-03-01 11:52:12 -05:00
CUSTOM_DATASET_GUIDE.md	add CUSTOM_DATASET_GUIDE.md and Colab demo; update README.md	2026-03-12 00:15:16 -04:00
randopt.py	Fix base model test evaluation.	2026-05-08 15:39:43 -04:00
README.md	Add distillation code	2026-03-20 03:05:57 +01:00
requirements.txt	initial commit, basic setup	2026-03-01 11:52:12 -05:00

README.md

RandOpt

Neural Thickets: Diverse Task Experts Are Dense Around Pretrained Weights

Yulu Gan, Phillip Isola

Paper | Project Page | Starting with a 1D Experiment:

Requirements

Option1: Python / Conda

(optional) conda activate your_env
pip install -r requirements.txt

Option2: Docker

From the directory containing RandOpt/:

Step	Command
Build	`docker build -f RandOpt/docker/Dockerfile_vllm -t randopt-vllm:latest .`
Run	`docker run -it --gpus all randopt-vllm:latest bash`
Run (with data)	`docker run -it --gpus all -v /path/to/RandOpt/data:/workspace/data randopt-vllm:latest bash`

Run RandOpt

Post-train on your own dataset

Please follow the instructions in CUSTOM_DATASET_GUIDE.md

Post-train on a standard dataset

First download the data here: data/README.md

Then, from the RandOpt directory:

Mode	Command
Single node	`sbatch scripts/single_node.sh`
Multiple nodes	`sbatch scripts/multiple_nodes.sh`
Local (no Slurm)	`bash scripts/local_run.sh`

Distill top-k models into a single model

Please follow the instructions in distillation/README.md.

Run Baselines

Please follow the instructions in baselines/README.md

Citation

@misc{gan2026neuralthickets,
      title={Neural Thickets: Diverse Task Experts Are Dense Around Pretrained Weights}, 
      author={Yulu Gan and Phillip Isola},
      year={2026},
      eprint={2603.12228},
      archivePrefix={arXiv},
      primaryClass={cs.LG},
      url={https://arxiv.org/abs/2603.12228}, 
}