I've been using this tool the past few days and it has been great so far. I work heavily with financial data and noticed that when I'm creating my EntitySet if I have column names as dtype int I get a failure message.
I was wondering if this is desired behavior that the column names need to be dtype str for the entity set to work.
Thank you for the awesome product.
locations.PERCENT_TRUE(appointments.sms_received)gives the percent of rows for which
True, given a single location. I'd expect that column to be the same for all rows of a single location, because that's what it was conditioned on, but I'm not finding that to be the case. Any ideas why?
fm.loc[fm.neighborhood == 'HORTO', 'locations.PERCENT_TRUE(appointments.sms_received)'].describe()I get:
cutoff_timeyou'd like to create features at and a
training_windowwhich specifies how much historical data to use. So, you can create the different time period features you want by make multiple calls to
ft.calculate_feature_matrixfor each window. you can read more about handling time here: https://docs.featuretools.com/automated_feature_engineering/handling_time.html