Hello Everyone! Sorry for bothering you. NNI Team hopes to dig deeper about our user’s usage and scenario, as well as collecting feedback about NNI, to assess customer needs and provide effective solutions.
If any user of our Gitter Chat Group would like to join our interview, please send a message to Gitter Chat. We really appreciate your support. Thank you so much!
Hello!When creating a job on kubeflow, I met this error
command: python3 mnist.py
And then start the experiment
my nni Environment:
nni version: latest
python version: 3.6
is conda or virtualenv used?: NO
k8s version: 1.18.2
kubeflow version 1.0.1
my k8s cluster with 2 nodes (IP 10.50.200.190 as master and IP 10.50.200.200 as slave)
nfs server also running on 10.50.200.200
Hi Just realizing this Gitter. Forgot about it.
Please refer to by stack overflow question here https://stackoverflow.com/questions/62403788/multiple-host-multiple-gpu-trials
Basically, I am wondering, If
We are experimenting with NNI, specifically hyperparameter tuning, and running into one issue. I would appreciate any help.
We are running hyperparameter tuning with
--foreground flag as we want it not to shut down before all the trials completed. However, with the
--foreground flag set it never gets terminated. The log says "Experiment done", but the process never gets terminated.
Any help will be appreciated.