How much history is included in dataframe transformations? #443

samuell · 2022-09-10T20:46:36Z

samuell
Sep 10, 2022

Another question important to time series-related problems is how much history you are getting into dataframe transformations (those that you decorate with) df_transformation?

I'm asking this since with a continuous time series, given that we have history for a few years on e.g. minute or second resolution, that can easily mean millions of rows of data.

Does the df_transformations somehow compute the transformation only for exactly the relevant time periods, or will extra history be pulled in?

And, while too much history seems like it could have performance penalties, one might also sometimes make sure to include a certain amount of history in order to define time-lagged features, such as the value of a specific feature 24 hours, 1 week, 1 month or 1 year back.

Thus, is there some way to control how much history is included?

Answered by simba-git

Sep 12, 2022

By default, FF will use all of the existing data. I wouldn't expect a few million of rows to take very long though, less than a couple mins. Another technique we've seen is to create a transformation on the main dataset that filters all rows before a specific point of time then run your transformations/features/etc. on top of that table.

View full answer

simba-git · 2022-09-12T21:46:48Z

simba-git
Sep 12, 2022
Maintainer

By default, FF will use all of the existing data. I wouldn't expect a few million of rows to take very long though, less than a couple mins. Another technique we've seen is to create a transformation on the main dataset that filters all rows before a specific point of time then run your transformations/features/etc. on top of that table.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How much history is included in dataframe transformations? #443

{{title}}

Replies: 1 comment

{{title}}

Select a reply

How much history is included in dataframe transformations? #443

samuell Sep 10, 2022

Replies: 1 comment

simba-git Sep 12, 2022 Maintainer

samuell
Sep 10, 2022

simba-git
Sep 12, 2022
Maintainer