Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Experiment with a fixed threshold #7519

Draft
wants to merge 19 commits into
base: main
Choose a base branch
from
Draft

Conversation

@tiferet
Copy link

@tiferet tiferet commented Jan 5, 2022

After all the changes made on the modeling side, there is now better separation in scores between TPs and FPs.

This PR experiments with setting a hard-coded score threshold > 0.5 to determine which alerts to surface, rather than always selecting the max-likelihood class, thereby reducing FPs.

Replaces #7512, because it turns out I need to update the worse rather than current libraries, since the end-to-end evaluation runs the boosted queries on the worse modeling.

tiferet and others added 15 commits Jan 4, 2022
Make sure we get near-perfect recall (ATM-light) and bad precision.
Absent features are now represented implicitly by the absence of a row
in the `tokenFeatures` relation, rather than explicitly by an empty
string. This leads to improved runtime performance. To enable this
implicit representation, we pass the set of supported token features to
the `scoreEndpoints` HOP. Requires CodeQL CLI v2.7.4.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Linked issues

Successfully merging this pull request may close these issues.

None yet

4 participants