Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Optimal hyperparameters and optimal flow design #69

Open
dodreh opened this issue Dec 26, 2024 · 1 comment
Open

Optimal hyperparameters and optimal flow design #69

dodreh opened this issue Dec 26, 2024 · 1 comment

Comments

@dodreh
Copy link

dodreh commented Dec 26, 2024

Hello! Thank you for excellent framework!

For my bachelor thesis I have to adapt SAILOR (Link to project: https://github.com/malicd/sailor). I should implement NF instead of Gaussian Mixture Model.
Short about SAILOR: SAILOR is unsupervised algorithm for domain adoption. It uses GMM to compress big dataset and to train the model on target dataset.

How I used NF in SAILOR:
Architecture of transformation/flow:
Number of flows = 4
Screenshot from 2024-12-26 16-06-13
I am not sure if the number of hidden layers is optimally chosen or not?

number of epochs = 10 000
learning rate = 0.000002

Problem: I have tried to design flows in different ways (flows = 32, 16, 8, 4; changing lr and epochs many times, diff. number of hidden layers, different number of units in hidden layers and so on), but no version gave me good results. During the fitting the model in data I have most of the time case that my loss value is "inf" (or "Nan"). A value for loss starts from 365 and decreasing, and in one moment it starts to print "inf" or "nan". I have also printed loss in diagram (number of epochs = 15 000) and at around 2500 and 3000 epoch the diagram shows two very big spikes, after it continues to decrease.
Question: Is there some standard procedure how to choose hyperparameters (lr, number of epochs) and how to design flows? If no, can you give me advice how to create optimal flows with optimal parameters (lr, epochs)? Is 15 000 epochs too big or I can consider it as normal value for epochs?

Thank you in advance!

@dodreh
Copy link
Author

dodreh commented Dec 26, 2024

P.S. Here are two screenshots of loss diagram:
Screenshot from 2024-12-26 16-31-54
Screenshot from 2024-12-26 16-30-32

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant