User \"system:serviceaccount:nublado-athornton:dask\" cannot get resource \"pods\" in API group \"\" in the namespace \"nublado-athornton\""but I have what look like the right rules in my role:
rules: - apiGroups: - "" resources: - pods verbs: - list - create - delete
RandomForestRegressor()constructor, but still end up with some of my dask workers using 2000% CPU, which is looking really weird.
That last time I looked, pandas loaded everything. It would be reasonable to implement that iteratively, and fastparquet does have a specific method to do that.
Is there a way to do that query without knowing that row-group 1 is where you want to look
Parquet optionally stores columns max and min values for each row-group, so maybe
split_everyequal or lower than length of