This channel is rarely used. For other channels to contact the dask community, please see https://docs.dask.org/en/stable/support.html
User \"system:serviceaccount:nublado-athornton:dask\" cannot get resource \"pods\" in API group \"\" in the namespace \"nublado-athornton\""
but I have what look like the right rules in my role:rules:
- apiGroups:
- ""
resources:
- pods
verbs:
- list
- create
- delete
n_jobs=1
in the RandomForestRegressor()
constructor, but still end up with some of my dask workers using 2000% CPU, which is looking really weird.
pyarrow
)
That last time I looked, pandas loaded everything. It would be reasonable to implement that iteratively, and fastparquet does have a specific method to do that.
Is there a way to do that query without knowing that row-group 1 is where you want to look
Parquet optionally stores columns max and min values for each row-group, so maybe
other
and split_every
equal or lower than length of other
...