Where communities thrive


  • Join over 1.5M+ people
  • Join over 100K+ communities
  • Free without limits
  • Create your own community
People
Repo info
Activity
  • 02:35
    YunLemon closed #913
  • Sep 23 11:12
    microsoft-github-policy-service[bot] closed #927
  • Sep 23 11:12

    DaveyBiggers on master

    Microsoft mandatory file Merge pull request #926 from mi… (compare)

  • Sep 23 11:12
    DaveyBiggers closed #926
  • Sep 22 23:55
    microsoft-github-policy-service[bot] commented #927
  • Sep 22 19:53
    trigaten commented #916
  • Sep 22 01:05
    rainingDesert commented #916
  • Sep 21 05:37
    sooopka closed #928
  • Sep 15 09:55
    sooopka opened #928
  • Sep 08 15:19
    microsoft-github-policy-service[bot] opened #927
  • Sep 08 15:19

    microsoft-github-policy-service[bot] on GitHubPolicyService

    (compare)

  • Sep 08 15:19

    microsoft-github-policy-service[bot] on GitHubPolicyService

    Microsoft mandatory file (compare)

  • Sep 01 18:14

    microsoft-github-policy-service[bot] on GitHubPolicyService

    (compare)

  • Sep 01 18:14

    microsoft-github-policy-service[bot] on GitHubPolicyService

    Microsoft mandatory file (compare)

  • Sep 01 18:06

    microsoft-github-policy-service[bot] on GitHubPolicyService

    (compare)

  • Sep 01 18:06

    microsoft-github-policy-service[bot] on GitHubPolicyService

    Microsoft mandatory file (compare)

  • Aug 31 15:22

    microsoft-github-policy-service[bot] on GitHubPolicyService

    (compare)

  • Aug 31 15:22

    microsoft-github-policy-service[bot] on GitHubPolicyService

    Microsoft mandatory file (compare)

  • Aug 30 09:43

    microsoft-github-policy-service[bot] on GitHubPolicyService

    (compare)

  • Aug 30 09:43

    microsoft-github-policy-service[bot] on GitHubPolicyService

    Microsoft mandatory file (compare)

EC2EZ4RD
@EC2EZ4RD
@AndKram Hi, I met a problem, that is, when I train my agent about 1-2 hours, base_env_builder would info me that mission end with out-of-time. And then, base_env_builder failed to find client for mission.
EC2EZ4RD
@EC2EZ4RD

something like this

INFO:marlo.base_env_builder:Agent missed 5 observation(s).
INFO:marlo.base_env_builder:Agent missed 1 observation(s).
INFO:marlo.base_env_builder:Agent missed 3 observation(s).
INFO:marlo.base_env_builder:Mission ended: out_of_time
INFO:marlo.base_env_builder:Mission ended: out_of_time
WARNING:marlo.base_env_builder:Error on attempting to start mission : There are not enough clients available in the ClientPool to start this 2 agent mission.
WARNING:marlo.base_env_builder:Will attempt again after 3 seconds.
WARNING:marlo.base_env_builder:Error on attempting to start mission : Failed to find an available client for this mission - tried all the clients in the supplied client pool.
WARNING:marlo.base_env_builder:Will attempt again after 3 seconds.

but obviously, I should not run out of time, because in the training hours, common situation is that mission ends with command_quota_reached.

It seems like the minecraft is a little unstable, because I also trained with a total different code in a different computer and met the same problem.
EC2EZ4RD
@EC2EZ4RD
Just soon, I observed this situation, both agents just stand still. And maybe there is some faults in my code. Sorry to bother you.
something like dead lock in my code.
Andre Kramer
@AndKram
@EC2EZ4RD It possible one of the Mincraft instances has hung which would cause the mission to run out of time. Could you open an issue on giuthub malmo and attach the logs from both of the Minecraft instances from Minecraft/run/logs/latest.log please? It's best to use two directories for Minecraft (copying Minecraft directory to another) and there is are the kill_clients_after_num_rounds and kill_clients_retry env init arguments which can help with long running training.
Jumpei Hirono
@jhirono
Andre Kramer
@AndKram
@jhirono Yes, marLo GitHub contains the competition (https://www.crowdai.org/challenges/marlo-2018) multi-agent environments and you can run them with the test_random_3game_mission.py but they are documented on https://github.com/crowdAI/marlo-multi-agent-starter-kit/ and https://marlo.readthedocs.io/en/latest/available_envs.html (last 3 are the environments used for round 2).
@EC2EZ4RD I should also have mentioned that the restarts and possibly network errors can lead to very long periods of retries so it's always best to wait for a long time (3 to 5 mins) to see if mission start up times out or retries recover.
EC2EZ4RD
@EC2EZ4RD
@AndKram Thanks! I have addressed this problem.
Lennart
@biggzlar
@EC2EZ4RD hi there, mind linking the issue on github? I'm having the same problem - at some point the client just does not reset the mission.
Lennart
@biggzlar
@AndKram Hi there, I have a similar problem to EC2EZ4RD's. I commonly get errors on attempted mission starts even after running a single episode (sometimes it takes a while). The client tends to be in the respawn screen when this happens - in case that is any help.
Andre Kramer
@AndKram
@biggzlar Please feel free to create an issue. Copy the Minecraft dir and run the second Minecraft from that and attach the logs to the GitHub issue for both of the launchClient.{sh,bat} runs. Resets can take a while and print out errors while retrying but should eventually start a new mission (as long as there are two parallel calls to reset).
Lennart
@biggzlar
@AndKram thanks will do.
Matthew Lee
@mattthelee
Hi. I've been running the find the goal v0 mission in Marlo. I keep finding that the environment does not give the assigned reward (0.5) back when an action finishes the game. It seems to depend on when i run the env.step afterwards. I've noticed a lag between taking an action and the position returned by info, is this related?
Matthew Lee
@mattthelee
nvm, found the solution is to put in a value for the step_sleep param to get Marlo to wait for Malmo
Shunta Saito
@mitmul
Hi, I'm using Malmo by launching it by docker run command introduced here:
https://github.com/Microsoft/malmo/blob/master/scripts/python-wheel/README.md#using-the-prebuilt-docker-image
and I want to communicate with the minecraft server from outside of the docker container using Marlo (https://github.com/crowdAI/marLo). I opened the port 10000 for the communication but the agent seems not to be able to communicate with the minecraft server in the docker container... Do I need to open another port? How can I know what ports should be opened to do it?
Andre Kramer
@AndKram
@mattthelee Rewards should be reported at mission end as they are sent with the mission end message but location and rewards during the mission are reported asynchronously. I think it would be an error to get any rewards after marlo env reports done == True but not an error to get a reward at some step after reaching a location.
Andre Kramer
@AndKram
@mitmul Malmo opens dynamic ports to talk back to the agent host from Minecraft (client) after the initial connection (here on port 10000) and on Windows docker's -net=host mode does not automatically open these unfortunately. Malmo (not marlo) allows you to specify the port number to use for the "command port" which is used to send commands during the mission though you would need to modify marlo to set that port so that it is not chosen dynamically. But there are other connections back to the agent host for video, rewards and observations which are harder to set up (in the mission XML) so it may be simpler to try to open a range for dynamic allocated ports. The port range is 10000 to 11000 so you could look to opening all these ports. Longer term, there is a new implementation of malmo for gym environments that only uses a single connection (https://github.com/Microsoft/malmo/tree/master/MalmoEnv).
Lennart
@biggzlar
Hi there. I have a question about the availability of object labels. This Microsoft/malmo#66 states that object labels have been integrated in 0.31.0 but I can’t find any documentation on that.
In this Microsoft/malmo#506 the poster is advised to use observationfromray, an XML tag. Can anyone tell me how to use these or alternatively how to use the apparently available object labels integrated in 0.31.0?
Lennart
@biggzlar
To add to my question, is there a method in place to translate world coordinates to screen coordinates?
Jewel James
@jeweljames

hey @AndKram - i have observed that after 123 back to back episodes, the agents run into
INFO:marlo.base_env_builder:Mission ended: MALMO_AGENT_NOT_RESPONDING
and then enter a loop of
enerating join tokens locally... INFO:marlo.base_env_builder:params.gameMode : Cannot force survival mode. INFO:marlo.base_env_builder:params.gameMode : Cannot force survival mode. INFO:marlo.base_env_builder:params.gameMode : Cannot force survival mode. INFO:marlo.base_env_builder:Waiting for mission to start... WARNING:marlo.base_env_builder:Error on attempting to start mission : Failed to find the server for this mission - you must start the agent that has role 0 first. WARNING:marlo.base_env_builder:Will attempt again after 3 seconds.

from which there is no recovery. in between it goes into other logs like
Pause for restarts .... Stopping ClientInfo: 127.0.0.1:10001:0 ... Stopping ClientInfo: 127.0.0.1:10000:0 ... Restart Malmo Minecraft clients for experiment:<experiment_id>

has this issue been observed before ?

Jumpei Hirono
@jhirono
IMG_0135.JPG
Just sharing our malmo hands-on event pic. ;)
Andre Kramer
@AndKram
@jeweljames What OS are you on? There is a periodic restart of Minecraft as well as retries that should help if Minecraft becomes unstable or crashes. One issue on Windows is running in the same Minecraft directory. Copy the complete Minecraft dir (containing launchClient.bat) and run the second Minecraft client from that will help. If it happens again could you attach the logs (in run/logs - most likely just latest.log) and if Minecraft appears stuck then do a jstack command on the gradle process (you can find that using jps command) please? One other thing, some timeouts on multi-agent missions are over 90 seconds so it's best to wait ~3 mins to see if it recovers. Thanks, Andre.
@jhirono Great to see! Thanks for sharing.
Lennart
@biggzlar

Was trying to run launchClient.bat on Windows 10 and encountered this error. Any way to fix it?

Where:
Build file 'C:...\Desktop\malmo-master\Minecraft\build.gradle' line: 13

What went wrong:
An exception occurred applying plugin request [id: 'net.minecraftforge.gradle.forge', version: '2.0.1']
Failed to apply plugin [id 'net.minecraftforge.gradle.forge']
Unable to obtain url ([http://files.minecraftforge.net/maven/net/minecraftforge/forge/json]) with etag!

Jewel James
@jeweljames

@AndKram Thanks for answering.
Im observing this on ubuntu . im already running the minecraft instances from two folders.
im using the following params

                                    "client_pool": client_pool,
                                    "agent_names" : [
                                        "MarLo-Agent-0",
                                        "MarLo-Agent-1"],
                                     "step_sleep":0.01,
                                     "kill_clients_after_num_rounds":100,
                                     "kill_clients_retry":10
                                 }

this is reproducible as it happens around the same episode number each time. will attach the log.

Jewel James
@jeweljames
@AndKram - you can find the logs here - https://gist.github.com/jeweljames/788a8517ea76b02fa8a40970ee934398
i have attached the logs from the agents , the two minecraft instances(latest_m1.log and latest_m2.log) and two jstack logs.
ProQianXiao
@ProQianXiao
@AndKram @spMohanty Hello, are there any solutions that decides which mission is running?
Andre Kramer
@AndKram
@ProQianXiao My post from Nov 6th should provide a way to check which of the three games is being played. (Hope I understood the question correctly).
ProQianXiao
@ProQianXiao
@AndKram I tried that , but I met some errors:
Andre Kramer
@AndKram
@jeweljames Both Minecrafts seem to running a mission while one agent host (called from Python) is waiting on second to turn up and the second agent host is saying that all ports are in use. Which would indicate that previous missions are not finishing. Are you calling reset() before the step() function returns "done"? I've opened issue #784 on github malmo for this but have not seen this failure.
ProQianXiao
@ProQianXiao
QQ截图20181217163939.png
I added the codes at the begining of the 'run_agent' function, is that right ?
QQ截图20181217164050.png
Jewel James
@jeweljames
@AndKram im sometimes calling reset() before the step function returns "done". but only after calling close() on that env first.
is that a problem?
Andre Kramer
@AndKram
@jeweljames Better to not call close and re-use the env with reset(). It looks like there is a port leak when closing and the mission is still running? Let's take this to #784.
@ProQianXiao etree is not defined. I think we had this also in MarLo.: from lxml import etree.
ProQianXiao
@ProQianXiao
@AndKram Thanks and that helps. I also want to know whether we can use this method to dertermine the running misson when evaluation.
andrekramer
@andrekramer
@ProQianXiao (Andre on non-work account:) I think so but have not tried by submitting a test.
ProQianXiao
@ProQianXiao
@andrekramer Thank you, and I will have a try.
iZuno-K
@iZuno-K

@andrekramer @AndKram I got errors at auto-evaluation phase:
“Unable to orchestrate evaluation. Core services failed to start…”
and after 1 or 2 episodes:
“Unable to orchestrate submission, please contact Administrators.”
The submitted programs have succeeded the test_submission_locally.sh without errors.
(Issue url:
https://gitlab.crowdai.org/karino_isi/marlo2018/issues/8
https://gitlab.crowdai.org/karino_isi/marlo2018/issues/5
)

Could you check the cause and teach me how to fix it ?
My CrowdAI account is karino_isi.

Lennart
@biggzlar
One more question: suppose I have several different models - is there a way to access the gametype during evaluation?
SP Mohanty
@spMohanty
@iZuno-K : We are investigating the error. and Will get back to you guys soon.
SP Mohanty
@spMohanty
I just tried to make a new submission with the starter kit, and it went through completely fine.
So theres a chance that this might be because of some issue in the actual submitted code.
iZuno-K
@iZuno-K

@spMohanty Thank you for your investigating.

I’ve succeeded the local test based on crowdAI’s instruction (https://github.com/crowdAI/marlo-multi-agent-starter-kit). I tested 1 game with 2 agents and 3 different games with 6 agents. Both tests succeed without errors.

How can I check my submitted code have any issue?

SP Mohanty
@spMohanty
@iZuno-K : Do you test it for multiple episodes locally ?
iZuno-K
@iZuno-K
@spMohanty I tested multiple episodes today. My code cause an error RuntimeError: Attempted to use a closed Session. The reason is I use tf.Session().close() at the end of the run_episode().
Then I modified run.py by moving the tf.Session().close() line to the end of the main function. After that I tested 15 episodes local test and succeeded.
I submitted the modified code but receive Unable to orchestrate evaluation. Core services failed to start....