Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

batched data #4049

Open
3 tasks done
aldopareja opened this issue May 2, 2024 · 2 comments
Open
3 tasks done

batched data #4049

aldopareja opened this issue May 2, 2024 · 2 comments
Labels
🌊 feat:workflow Workflow related stuff.

Comments

@aldopareja
Copy link

Self Checks

  • I have searched for existing issues search for existing issues, including closed ones.
  • I confirm that I am using English to submit report (我已阅读并同意 Language Policy).
  • Pleas do not modify this template :) and fill in all the required fields.

Provide a description of requested docs changes

Sometimes you generate workflows that should be applied to many different datapoints or a whole dataset simil to dataset.map(apply_workflow) that you do with hf Dataset. You load a datapoint and you want to apply the same workflow to each datapoint (specifying a column like prompt). I don't think that is doable at the moment and it's a very basic use case of a workflow for data manipulation.

@dosubot dosubot bot added the 🌊 feat:workflow Workflow related stuff. label May 2, 2024
@takatost
Copy link
Collaborator

takatost commented May 7, 2024

What's your use case and what do you need to do? 🤔

@galer7
Copy link

galer7 commented May 17, 2024

Not OP, but one of my use cases is:

  • text input -> LLM to split into multiple strings e.g. ToC as input and I get an array with the chapters/subchapters
  • I want to apply a workflow to each generated section individually

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
🌊 feat:workflow Workflow related stuff.
Projects
None yet
Development

No branches or pull requests

3 participants