Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

for 'other workflows' datalab tutorial, need to show how to detect all issue types and use a custom regression model for detecting label errors #993

Open
jwmueller opened this issue Feb 12, 2024 · 1 comment

Comments

@jwmueller
Copy link
Member

jwmueller commented Feb 12, 2024

ideal code would be (this is just pseudocode):

issue_types = get_default_issue_types(problem_type='regression')  # contains many Nones / missing keys (that are left at defaults used internally)
issue_types['label']['cleanlearning_kwargs']['model'] = MySklearnRegressor(...)

datalab.find_issues(features, ..., issue_types=issue_types) 
# will find label errors using my regression model, and simultanoeusly detect many other issue types
@jwmueller jwmueller changed the title for advanced datalab tutorial, need to show how to detect all issue types and use a custom regression model for detecting label errors for 'other workflows' datalab tutorial, need to show how to detect all issue types and use a custom regression model for detecting label errors Feb 12, 2024
@jwmueller
Copy link
Member Author

first need to address this:
#920

so we have a place to store the new code

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant