Hyperopt sampling by tgiani · Pull Request #2371 · NNPDF/nnpdf

tgiani · 2025-09-23T10:15:12Z

this PR should enable n3fit to produce a fit using different hyperparameters for each replica, taking as input the results of an hyperopt run. I think we need to

write a script implementing the sampling of the hyperopt trials. This should produce a file containing the hyperparameters settings for each replicas
enable n3fit to read from this file the settings for each replica

scarlehoff · 2026-02-24T13:18:02Z

n3fit/src/n3fit/model_trainer.py

+        # different samples
+        else:
+            with open(params['hyperopt_res'], 'r') as file:
+                hyperopt_params = json.load(file)


Suggested change

hyperopt_params = json.load(file)

else:

hyperopt_params = [params]

tgiani · 2026-02-26T12:16:56Z

This is a possible way to select and use the best trials : one should add in the runcard

trial_specs: 
  hyperscan: 260204-jcm-hyperopt
  thermalization: 400
  number_of_trials: 10

which will download the full 260204-jcm-hyperopt hyperscan and select the 10 best trials, after dropping the first 400 ones for thermalization.
Using this info the procution rule produce_trials will create a dictionary containing the settings of the best trials, which will then be used in the fit. Results discussed so far for nnpdf4.1 test have been produced using thermalization: 400 and number_of_trials: 10, which should then be used in teh baseline runcard.

If trial_specs is not given in the runcard, teh fit will use teh settings specified in the runcard under parameters

scarlehoff · 2026-02-26T13:41:17Z

I like this approach a lot. I had a quick look and seems fine to me. I'll try to have a deeper look later and then we can merge.

scarlehoff · 2026-03-02T09:16:30Z

We should add a test, at least to the regressions, for this. Same runcard, two replicas. Something like that.

And perhaps it would be wise to add to the .json file at the end the hyperparameter of the fit so we always know the parameters of each replica.

scarlehoff · 2026-03-02T09:28:36Z

n3fit/src/n3fit/scripts/n3fit_exec.py

+            n_best = trial_specs['number_of_trials']
+            best = hyperopt_dataframe[n_termalization:].sort_values('loss')[:n_best].to_dict(orient='list')
+            best['number_of_trials'] = n_best
+            return best


Since this is part of the config, it should use the loader (or the fallbackloader)

I'm unsure whether this should be here or in the validphys config though.

The other problem I see is that this is not seen by setupfit so if you send many jobs in parallel all of them will try to download the same thing to the same place, which can lead to a very bad crash (just happened to me in the cluster 😅)

@scarlehoff does the last commit solve this issue?

another maybe better option could be to do everything in vp config, create there the best trials and save a json file with them in the table folder or something. n3fit would then read it. Just like we do with stuff like thcovmat I guess. But how do I access output folders from the config to save the json file...?

I guess since you went for the option of downloading a scan, I think it is fine not to save the json file (since the fit would be in the server anyway).
But then you need to make sure the parameters are in the json of each replica.

tgiani · 2026-03-03T11:31:52Z

@scarlehoff thank you, I will have a go at your comments this afternoon

…etupfit

n3fit/src/n3fit/model_trainer.py

Co-authored-by: Juan M. Cruz-Martinez <juacrumar@lairen.eu>

…d hyperscan

reading different hyperparameters for each replica

32c7e36

tgiani changed the title ~~reading different hyperparameters for each replica~~ Hyperopt sampling Sep 23, 2025

tgiani added the hyperoptimization label Sep 23, 2025

tgiani and others added 11 commits September 26, 2025 11:06

setting activation function for last layer to be linear

d099536

adding the sampling for optimizer hyperparmeters. Probably wrong

341aae4

Merge branch 'master' into sampling

2ad3f0e

no optimizer hyperparams in the sampling for the replicas

5cbf953

small fix

3c8d0e5

removing pdb

b9b92f4

Merge branch 'master' into sampling

85051ad

change also optimizer hyperparameters when running a fit

66bf666

temporary way to account for 10 best trials

2e3137d

Merge branch 'master' into sampling

708b109

adding positivity settings in replicas hyperparams

64fea5d

scarlehoff reviewed Feb 24, 2026

View reviewed changes

tgiani added 2 commits February 24, 2026 16:28

load hyperotp results with vp api

15f8478

select best trials from runcard specifications

706ecbb

Merge branch 'master' into sampling

86a9692

tgiani marked this pull request as ready for review February 26, 2026 12:19

scarlehoff reviewed Mar 2, 2026

View reviewed changes

tgiani and others added 2 commits March 3, 2026 14:37

Merge branch 'master' into sampling

324a999

use loader in n3fit and download hyperscan with fallbackloader in vps…

c42c628

…etupfit

scarlehoff reviewed Mar 4, 2026

View reviewed changes

n3fit/src/n3fit/model_trainer.py Outdated Show resolved Hide resolved

tgiani and others added 3 commits March 5, 2026 10:44

add hyperparam specs at the end of each replica json

a11134d

Update n3fit/src/n3fit/model_trainer.py

85243e8

Co-authored-by: Juan M. Cruz-Martinez <juacrumar@lairen.eu>

make the code crash if some parameters are specifid both in params an…

a4c4886

…d hyperscan

tgiani added 3 commits March 5, 2026 14:45

Merge branch 'sampling' of github.com:NNPDF/nnpdf into sampling

cbd89d3

small edit

f0e9ea7

adding regression tests

907f824

tgiani added the redo-regressions Recompute the regression data label Mar 5, 2026

tgiani and others added 4 commits March 6, 2026 11:49

fix test_check_basis_with_layer

4fbe701

Merge branch 'master' into sampling

0f2e0a5

samll fix

b4ca36b

Merge branch 'sampling' of github.com:NNPDF/nnpdf into sampling

600c196

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Hyperopt sampling#2371

Hyperopt sampling#2371
tgiani wants to merge 27 commits intomasterfrom
sampling

tgiani commented Sep 23, 2025

Uh oh!

scarlehoff Feb 24, 2026

Uh oh!

tgiani commented Feb 26, 2026 •

edited

Loading

Uh oh!

scarlehoff commented Feb 26, 2026

Uh oh!

scarlehoff commented Mar 2, 2026

Uh oh!

scarlehoff Mar 2, 2026

Uh oh!

tgiani Mar 3, 2026

Uh oh!

tgiani Mar 4, 2026 •

edited

Loading

Uh oh!

scarlehoff Mar 4, 2026

Uh oh!

tgiani commented Mar 3, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	hyperopt_params = json.load(file)
	else:
	hyperopt_params = [params]

Conversation

tgiani commented Sep 23, 2025

Uh oh!

scarlehoff Feb 24, 2026

Choose a reason for hiding this comment

Uh oh!

tgiani commented Feb 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

scarlehoff commented Feb 26, 2026

Uh oh!

scarlehoff commented Mar 2, 2026

Uh oh!

scarlehoff Mar 2, 2026

Choose a reason for hiding this comment

Uh oh!

tgiani Mar 3, 2026

Choose a reason for hiding this comment

Uh oh!

tgiani Mar 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

scarlehoff Mar 4, 2026

Choose a reason for hiding this comment

Uh oh!

tgiani commented Mar 3, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

tgiani commented Feb 26, 2026 •

edited

Loading

tgiani Mar 4, 2026 •

edited

Loading