docker network connect
to join the zeppelin container to the hive network. Further I created a new jdbc zeppelin interpreter that connects to the hive instance started with docker compose.
@Atlantic777 the problem is with the setup indeed.
Right now Hive is deployed without external MapReduce engine (i.e. no YARN running). When Hive is not configured to use external one, it will use internal: https://cwiki.apache.org/confluence/display/Hive/GettingStarted#GettingStarted-Hive,Map-ReduceandLocal-Mode
From the docs: "local mode only runs with one reducer and can be very slow processing larger data sets."
So, to have Hive in docker with YARN, we will need to extend the docker-compose to look like the one here: https://github.com/big-data-europe/docker-hadoop/tree/2.0.0-hadoop2.7.4-java8
And then configure Hive to point to YARN.
Hi @waterponey,
I have played around with Spark/YARN integration and have a working setup (not pushed to BDE repos). If you can describe your use case (i.e. why do you need YARN/Mesos) in case of deploying Spark in docker, that would be helpful. In my case, I needed YARN for history manager and that's pretty much it.
Datanode denied communication with namenode because hostname cannot be resolved (ip=10.0.0.10, hostname=10.0.0.10): DatanodeRegistration(0.0.0.0:50010, datanodeUuid=d64d014a-4467-4065-95e6-596590148f75, infoPort=50075, infoSecurePort=0, ipcPort=50020, storageInfo=lv=-56;cid=CID-ab7b488d-d3c3-470d-aea0-c6e6ac6708b9;nsid=2064164277;c=0)